Problemy Peredachi Informatsii
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor
Guidelines for authors

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Probl. Peredachi Inf.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Problemy Peredachi Informatsii, 2017, Volume 53, Issue 3, Pages 100–111 (Mi ppi2248)  

This article is cited in 7 scientific papers (total in 7 papers)

Source Coding

Information-theoretic method for classification of texts

B. Ya. Ryabkoab, A. E. Gus'kovca, I. V. Selivanovabc

a Institute of Computational Technologies, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
b Novosibirsk State University, Novosibirsk, Russia
c Russian National Public Library for Science and Technnology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
Full-text PDF (244 kB) Citations (7)
References:
Abstract: We consider a method for automatic (i.e., unmanned) text classification based on methods of universal source coding (or “data compression”). We show that under certain restrictions the proposed method is consistent, i.e., the classification error tends to zero with increasing text lengths. As an example of practical use of the method we consider the classification problem for scientific texts (research papers, books, etc.). The proposed method is experimentally shown to be highly efficient.
Received: 21.10.2015
Revised: 13.05.2017
English version:
Problems of Information Transmission, 2017, Volume 53, Issue 3, Pages 294–304
DOI: https://doi.org/10.1134/S0032946017030115
Bibliographic databases:
Document Type: Article
UDC: 621.391.1+519.72
Language: Russian
Citation: B. Ya. Ryabko, A. E. Gus'kov, I. V. Selivanova, “Information-theoretic method for classification of texts”, Probl. Peredachi Inf., 53:3 (2017), 100–111; Problems Inform. Transmission, 53:3 (2017), 294–304
Citation in format AMSBIB
\Bibitem{RyaGusSel17}
\by B.~Ya.~Ryabko, A.~E.~Gus'kov, I.~V.~Selivanova
\paper Information-theoretic method for classification of texts
\jour Probl. Peredachi Inf.
\yr 2017
\vol 53
\issue 3
\pages 100--111
\mathnet{http://mi.mathnet.ru/ppi2248}
\elib{https://elibrary.ru/item.asp?id=29966415}
\transl
\jour Problems Inform. Transmission
\yr 2017
\vol 53
\issue 3
\pages 294--304
\crossref{https://doi.org/10.1134/S0032946017030115}
\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000412936700011}
\scopus{https://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-85031754667}
Linking options:
  • https://www.mathnet.ru/eng/ppi2248
  • https://www.mathnet.ru/eng/ppi/v53/i3/p100
  • This publication is cited in the following 7 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Проблемы передачи информации Problems of Information Transmission
    Statistics & downloads:
    Abstract page:345
    Full-text PDF :45
    References:42
    First page:30
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024