Artificial Intelligence and Decision Making
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Guidelines for authors

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Artificial Intelligence and Decision Making:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Artificial Intelligence and Decision Making, 2019, Issue 3, Pages 52–59
DOI: https://doi.org/10.14357/20718594190306
(Mi iipr180)
 

Natural language processing

Feature selection for text classification of a news flows based on topical importance characteristic

V. V. Zhebela, S-N. A. Zharikovab, I. V. Sochenkova

a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia
b Peoples' Friendship University of Russia named after Patrice Lumumba, Moscow, Russia
Abstract: The paper presents an approach for ranking the most valuable features for text classification task. The introduced Topical Importance Characteristic leverages the feature selection method comprising the information about the distributions of words or phrases among the topics. We compare this method to well-known TF-IDF approach and use the introduced word-ranking scheme in two classifiers: Random Forrest and Multinomial Naïve Bayes. The Accuracy of classification results was tested in the “20-Newsgroups” dataset. The developed approach outperforms TF-IDF-based methods and matches the Accuracy achieved by the more powerful state of the art approaches such as SVC on the same dataset.
Keywords: topical text classification, machine learning, topical importance characteristic, 20-Newsgroups.
Funding agency Grant number
Russian Foundation for Basic Research 15-29-06082
18-29-16172
Bibliographic databases:
Document Type: Article
Language: Russian
Citation: V. V. Zhebel, S-N. A. Zharikova, I. V. Sochenkov, “Feature selection for text classification of a news flows based on topical importance characteristic”, Artificial Intelligence and Decision Making, 2019, no. 3, 52–59
Citation in format AMSBIB
\Bibitem{ZheZhaSoc19}
\by V.~V.~Zhebel, S-N.~A.~Zharikova, I.~V.~Sochenkov
\paper Feature selection for text classification of a news flows based on topical importance characteristic
\jour Artificial Intelligence and Decision Making
\yr 2019
\issue 3
\pages 52--59
\mathnet{http://mi.mathnet.ru/iipr180}
\crossref{https://doi.org/10.14357/20718594190306}
\elib{https://elibrary.ru/item.asp?id=41216283}
Linking options:
  • https://www.mathnet.ru/eng/iipr180
  • https://www.mathnet.ru/eng/iipr/y2019/i3/p52
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Artificial Intelligence and Decision Making
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025