Preprints of the Keldysh Institute of Applied Mathematics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Keldysh Institute preprints:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Preprints of the Keldysh Institute of Applied Mathematics, 2022, 043, 24 pp.
DOI: https://doi.org/10.20948/prepr-2022-43
(Mi ipmp3069)
 

This article is cited in 1 scientific paper (total in 1 paper)

Two-factor patterns construction in problems of texts classification

M. Yu. Voronina, A. A. Kislitsyn, Yu. N. Orlov
Full-text PDF (871 kB) Citations (1)
References:
Abstract: Two-factor patterns of empirical distributions of bigram frequencies for machine classification of texts by authors and subject are constructed. Text attributes are recognized by the nearest neighbor method in relation to reference distributions. The proximity between distributions is understood in the sense of the norm in L1. The 'author-topic' pair of an unknown text is defined as a nearest neighbor pattern. The problem of recognizing the author regardless of the topic of the text and the topic regardless of the author is analyzed. The possibilities of enlarging and detailing classification features are also being investigated.
Keywords: machine classification, text, bigram distribution, spectral portrait, clustering.
Funding agency Grant number
Russian Foundation for Basic Research 19-29-01174
Document Type: Preprint
Language: Russian
Citation: M. Yu. Voronina, A. A. Kislitsyn, Yu. N. Orlov, “Two-factor patterns construction in problems of texts classification”, Keldysh Institute preprints, 2022, 043, 24 pp.
Citation in format AMSBIB
\Bibitem{VorKisOrl22}
\by M.~Yu.~Voronina, A.~A.~Kislitsyn, Yu.~N.~Orlov
\paper Two-factor patterns construction in problems of texts classification
\jour Keldysh Institute preprints
\yr 2022
\papernumber 043
\totalpages 24
\mathnet{http://mi.mathnet.ru/ipmp3069}
\crossref{https://doi.org/10.20948/prepr-2022-43}
Linking options:
  • https://www.mathnet.ru/eng/ipmp3069
  • https://www.mathnet.ru/eng/ipmp/y2022/p43
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Препринты Института прикладной математики им. М. В. Келдыша РАН
    Statistics & downloads:
    Abstract page:51
    Full-text PDF :14
    References:4
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024