|
This article is cited in 1 scientific paper (total in 1 paper)
Supervised learning classification of data taking into account principal component analysis
M. P. Krivenko Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian
Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
Abstract:
The article examines questions of supervised learning classification of data taking into account principal component analysis (PCA) results. Ñonstruction of a Bayesian classifier becomes possible after representation of covariances through the parameters of the probabilistic PCA model. The case of singular data distributions is singled out; for this case, it is suggested to estimate the parameters of the model under constraints on the eigenvalues of covariance matrices. The quality of classification is studied in respect to the actual data dimension. It is demonstrated that, when correctly assigned, the classifier has the least error probabilities. Exceeding the best value of the dimension usually worsens the quality of the classification to a lesser extent than its underestimation. The mixture of probabilistic principal component analyzer allows modeling big data by means of a relatively small number of free parameters. The number of free parameters can be controlled by choosing the latent dimension of the data.
Keywords:
principal component analysis; mixtures of normal distributions; EM algorithm; supervised learning classification.
Received: 30.05.2018
Citation:
M. P. Krivenko, “Supervised learning classification of data taking into account principal component analysis”, Inform. Primen., 12:3 (2018), 56–61
Linking options:
https://www.mathnet.ru/eng/ia547 https://www.mathnet.ru/eng/ia/v12/i3/p56
|
|