Preprints of the Keldysh Institute of Applied Mathematics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Keldysh Institute preprints:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Preprints of the Keldysh Institute of Applied Mathematics, 2024, 053, 12 pp.
DOI: https://doi.org/10.20948/prepr-2024-53
(Mi ipmp3263)
 

On the estimation of integral risk of predictor Lipschitz functions in machine learning models

Yu. S. Chaynikov, V. A. Sudakov
References:
Abstract: Class imbalance in available training samples for solving machine learning problems in most practical cases complicates the training of predictors that effectively generalize patterns from the training dataset to the general population. This paper investigates the theoretical foundations of the effectiveness of adding synthetic data to the training set. In the assessment of overall risk, two types of errors are highlighted: representation error and deviation error. Practical recommendations are formulated for creating synthetic samples that deviate in their distribution from the representative ones by the density distribution of the argument, with more frequent samples in those areas where the density distribution of the argument has relatively low values, leading to a reduction in the size of the corresponding Voronoi cells and a reduction in the contribution of deviation error to total risk.
Keywords: synthetic data, machine learning, Voronoi cells, predictor, training sample, total risk, empirical risk, representation error, deviation error.
Document Type: Preprint
Language: Russian
Citation: Yu. S. Chaynikov, V. A. Sudakov, “On the estimation of integral risk of predictor Lipschitz functions in machine learning models”, Keldysh Institute preprints, 2024, 053, 12 pp.
Citation in format AMSBIB
\Bibitem{ChaSud24}
\by Yu.~S.~Chaynikov, V.~A.~Sudakov
\paper On the estimation of integral risk of predictor Lipschitz functions in machine learning models
\jour Keldysh Institute preprints
\yr 2024
\papernumber 053
\totalpages 12
\mathnet{http://mi.mathnet.ru/ipmp3263}
\crossref{https://doi.org/10.20948/prepr-2024-53}
Linking options:
  • https://www.mathnet.ru/eng/ipmp3263
  • https://www.mathnet.ru/eng/ipmp/y2024/p53
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Препринты Института прикладной математики им. М. В. Келдыша РАН
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025