Matematicheskaya Biologiya i Bioinformatika
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Mat. Biolog. Bioinform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Matematicheskaya Biologiya i Bioinformatika, 2020, Volume 15, Issue 1, Pages 4–19
DOI: https://doi.org/10.17537/2020.15.4
(Mi mbb419)
 

This article is cited in 1 scientific paper (total in 1 paper)

Bioinformatics

Approach to the selection of significant features in solving biomedical problems of binary classification of microarray data

I. Yu. Boiko, D. S. Anisimov, L. L. Smolyakova, M. A. Ryazanov

Altay State University, Barnaul, Russian Federation
Full-text PDF (599 kB) Citations (1)
References:
Abstract: In modern biomedical research aimed at finding methods for early diagnosis of cancer, microarrays containing certain biological information about patients are used. Based on these data, patients are assigned to one of two classes, corresponding to the presence and absence of some diagnosis. When solving this problem, one of the steps that have a decisive influence on the quality of classification is the significant features selection. This paper proposes a criterion for the selection of significant features, based on the ledge-coefficient of correlation. The ledge-coefficient was previously used to estimate the degree of interrelation of numerical and binary features. For two sets of microarray data, comparative examples of their binary classification are presented using three feature selection algorithms, three dimensionality reduction methods, six classification models. The use of the ledge-criterion for feature selection made it possible to obtain a classification quality comparable to the results of using common methods of feature selection, such as $t$-test and $U$-test. For the data set of the peptide microarrays considered in the paper, the effectiveness of applying the projection method to latent structures had previously been identified. The use of this method in combination with the significant features’ selection using the ledge-criterion made it possible to obtain a higher classification quality measure.
Key words: feature selection, ledge-coefficient, binary classification, microarrays, ROC-curve, projection to latent structures.
Funding agency Grant number
Russian Foundation for Basic Research 17-04-00321
Received 18.07.2019, 15.01.2020, Published 30.01.2020
Document Type: Article
Language: Russian
Citation: I. Yu. Boiko, D. S. Anisimov, L. L. Smolyakova, M. A. Ryazanov, “Approach to the selection of significant features in solving biomedical problems of binary classification of microarray data”, Mat. Biolog. Bioinform., 15:1 (2020), 4–19
Citation in format AMSBIB
\Bibitem{BoiAniSmo20}
\by I.~Yu.~Boiko, D.~S.~Anisimov, L.~L.~Smolyakova, M.~A.~Ryazanov
\paper Approach to the selection of significant features in solving biomedical problems of binary classification of microarray data
\jour Mat. Biolog. Bioinform.
\yr 2020
\vol 15
\issue 1
\pages 4--19
\mathnet{http://mi.mathnet.ru/mbb419}
\crossref{https://doi.org/10.17537/2020.15.4}
Linking options:
  • https://www.mathnet.ru/eng/mbb419
  • https://www.mathnet.ru/eng/mbb/v15/i1/p4
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Statistics & downloads:
    Abstract page:100
    Full-text PDF :112
    References:12
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024