|
This article is cited in 2 scientific papers (total in 2 papers)
On relevant features selection based on information theory
A. V. Bulinski Lomonosov Moscow State University, Faculty of Mechanics and Mathematics
Abstract:
It is shown that widely used suboptimal algorithms of feature selection based
on information theory concepts do not necessarily identify a collection of
features (relevant in a sense) affecting the studied random response. This
can be considered as a reflection of the epistasis phenomenon known in
genetics, when individual features have little effect on increased risk
of complex disease, whereas certain combinations of features have
significant impact on risk. It is demonstrated that a similar effect is also
manifested in inferences employing statistical estimates of mutual
information.
Keywords:
feature selection, mutual information, interaction information, sequential selection of features, epistasis effect.
Received: 21.02.2023
Citation:
A. V. Bulinski, “On relevant features selection based on information theory”, Teor. Veroyatnost. i Primenen., 68:3 (2023), 483–508; Theory Probab. Appl., 68:3 (2023), 392–410
Linking options:
https://www.mathnet.ru/eng/tvp5640https://doi.org/10.4213/tvp5640 https://www.mathnet.ru/eng/tvp/v68/i3/p483
|
Statistics & downloads: |
Abstract page: | 228 | Full-text PDF : | 21 | References: | 40 | First page: | 25 |
|