Informatika i Ee Primeneniya [Informatics and its Applications]
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Inform. Primen.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Informatika i Ee Primeneniya [Informatics and its Applications], 2019, Volume 13, Issue 3, Pages 34–40
DOI: https://doi.org/10.14357/19922264190306
(Mi ia607)
 

This article is cited in 2 scientific papers (total in 2 papers)

Hybrid extreme gradient boosting models to impute the missing data in precipitation records

A. K. Gorsheninab, O. P. Martynovb

a Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
b Faculty of Computational Mathematics and Cybernetics, M. V. Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow, 119991, Russian Federation
Full-text PDF (195 kB) Citations (2)
References:
Abstract: The article compares the classical method of extreme gradient boosting implemented in the XGBoost (eXtreme Gradient Boosting) framework with the new modification CatBoost (Categorial Boosting), which is rarely involved in scientific researches. Some hybrid classification-regression models are proposed to improve the accuracy of imputation in missing values in real data using 14 meteorological stations in Germany. The achieved accuracy of the classification is up to 92% and the root-mean-square errors are quite moderate. The hybrid methods outperformed both simple classification and regression models in prediction accuracy. The proposed approaches can be successfully used for meteorological data analysis by machine learning methods as well as for improving the forecasting accuracy in physical models of atmospheric processes.
Keywords: data imputation, precipitation, classification, regression, gradient boosting, XGBoost, CatBoost.
Funding agency Grant number
Russian Science Foundation 18-71-00156
The problems formulation and analysis of results through the paper were performed by A. K. Gorshenin whose research was supported by the Russian Science Foundation (project 18-71-00156).
Received: 08.07.2019
Document Type: Article
Language: Russian
Citation: A. K. Gorshenin, O. P. Martynov, “Hybrid extreme gradient boosting models to impute the missing data in precipitation records”, Inform. Primen., 13:3 (2019), 34–40
Citation in format AMSBIB
\Bibitem{GorMar19}
\by A.~K.~Gorshenin, O.~P.~Martynov
\paper Hybrid extreme gradient boosting models to impute the missing data in precipitation records
\jour Inform. Primen.
\yr 2019
\vol 13
\issue 3
\pages 34--40
\mathnet{http://mi.mathnet.ru/ia607}
\crossref{https://doi.org/10.14357/19922264190306}
Linking options:
  • https://www.mathnet.ru/eng/ia607
  • https://www.mathnet.ru/eng/ia/v13/i3/p34
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Информатика и её применения
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024