|
Zhurnal Vychislitel'noi Matematiki i Matematicheskoi Fiziki, 2009, Volume 49, Number 11, Pages 2066–2080
(Mi zvmmf4790)
|
|
|
|
This article is cited in 2 scientific papers (total in 2 papers)
An efficient method for feature selection in linear regression based on an extended Akaike's information criterion
D. P. Vetrova, D. A. Kropotovb, N. O. Ptashkoa a Faculty of Computational Mathematics and Cybernetics, Moscow State University, Moscow, 119992, Russia
b Dorodnicyn Computing Center, Russian Academy of Sciences,
ul. Vavilova 40, Moscow, 119333, Russia
Abstract:
A method for feature selection in linear regression based on an extension of Akaike's information criterion is proposed. The use of classical Akaike's information criterion (AIC) for feature selection assumes the exhaustive search through all the subsets of features, which has unreasonably high computational and time cost. A new information criterion is proposed that is a continuous extension of AIC. As a result, the feature selection problem is reduced to a smooth optimization problem. An efficient procedure for solving this problem is derived. Experiments show that the proposed method enables one to efficiently select features in linear regression. In the experiments, the proposed procedure is compared with the relevance vector machine, which is a feature selection method based on Bayesian approach. It is shown that both procedures yield similar results. The main distinction of the proposed method is that certain regularization coefficients are identical zeros. This makes it possible to avoid the underfitting effect, which is a characteristic feature of the relevance vector machine. A special case (the so-called nondiagonal regularization) is considered in which both methods are identical.
Key words:
pattern recognition, linear regression, feature selection, Akaike's information criterion.
Received: 12.05.2009
Citation:
D. P. Vetrov, D. A. Kropotov, N. O. Ptashko, “An efficient method for feature selection in linear regression based on an extended Akaike's information criterion”, Zh. Vychisl. Mat. Mat. Fiz., 49:11 (2009), 2066–2080; Comput. Math. Math. Phys., 49:11 (2009), 1972–1985
Linking options:
https://www.mathnet.ru/eng/zvmmf4790 https://www.mathnet.ru/eng/zvmmf/v49/i11/p2066
|
Statistics & downloads: |
Abstract page: | 760 | Full-text PDF : | 370 | References: | 77 | First page: | 10 |
|