|
This article is cited in 3 scientific papers (total in 3 papers)
Supervised learning classification of incomplete clinical data
M. P. Krivenko Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of the Russian
Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
Abstract:
The article examines the effectiveness of classification methods for incomplete clinical data. Training Bayesian classifier is carried out by the maximum likelihood method for the model of a mixture of normal distributions. Rigorous derivation of formulas ensuring the realization of the steps of the EM algorithm allowed correctly applying the iterative process of obtaining estimates of the parameters of the mixture. For incomplete data, methods for selecting initial values and correcting degenerate covariance matrices for the elements of the mixture are proposed. The experimental part of the work consisted in analyzing the dependence of the quality of classification on the number of missing individual values, using data on enzymes obtained for patients with liver diseases. The real data treatment has demonstrated almost identical classification errors when applying simple and complex methods of processing of missing values in the case of low number of randomly missing individual values.
Keywords:
missing data; EM algorithm; mixtures of normal distributions.
Received: 14.06.2017
Citation:
M. P. Krivenko, “Supervised learning classification of incomplete clinical data”, Inform. Primen., 11:3 (2017), 27–33
Linking options:
https://www.mathnet.ru/eng/ia482 https://www.mathnet.ru/eng/ia/v11/i3/p27
|
|