Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya, 2021, Volume 17, Issue 4, Pages 389–396
DOI: https://doi.org/10.21638/11701/spbu10.2021.407
(Mi vspui505)
 

Computer science

Research of features of Dostoevsky's publicistic style by using $n$-grams based on the materials of the “Time” and “Epoch” magazines

R. V. Abramov, K. A. Kulakov, A. A. Lebedev, N. D. Moskin, A. A. Rogov

Petrozavodsk State University, 33, pr. Lenina, Petrozavodsk, 185910, Russian Federation
References:
Abstract: The paper is devoted to the study of the publicity style of F. M. Dostoevsky on the basis of publications in the journals “Time” and “Epoch” (1861–1865). For this, fragments of texts (including other authors: M. M. Dostoevsky, N. N. Strakhov, A. A. Golovachev, etc.) were selected in sizes of 500, 700 and 1000 words, on which the occurrence of bigrams and trigrams (encoded sequences of parts of speech) were counted. Decision trees were built on their basis and an analysis of the accuracy of text recognition was performed. If we consider the class cation at the rest level of the tree (fragment size 1000), then the accuracy was on average 87 resulting decision trees.
Keywords: publicity style, text attribution, decision tree, $n$-gram, F. M. Dostoevsky, information system “Statistical methods for analyzing literary texts”, tree matching.
Funding agency Grant number
Russian Foundation for Basic Research 18-012-90026
This work was supported by the Russian Foundation for Basic Research (project N 18-012-90026).
Received: December 25, 2020
Accepted: October 13, 2021
Document Type: Article
UDC: 004.8
MSC: 68T50
Language: English
Citation: R. V. Abramov, K. A. Kulakov, A. A. Lebedev, N. D. Moskin, A. A. Rogov, “Research of features of Dostoevsky's publicistic style by using $n$-grams based on the materials of the “Time” and “Epoch” magazines”, Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr., 17:4 (2021), 389–396
Citation in format AMSBIB
\Bibitem{AbrKulLeb21}
\by R.~V.~Abramov, K.~A.~Kulakov, A.~A.~Lebedev, N.~D.~Moskin, A.~A.~Rogov
\paper Research of features of Dostoevsky's publicistic style by using $n$-grams based on the materials of the ``Time'' and ``Epoch'' magazines
\jour Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr.
\yr 2021
\vol 17
\issue 4
\pages 389--396
\mathnet{http://mi.mathnet.ru/vspui505}
\crossref{https://doi.org/10.21638/11701/spbu10.2021.407}
Linking options:
  • https://www.mathnet.ru/eng/vspui505
  • https://www.mathnet.ru/eng/vspui/v17/i4/p389
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Вестник Санкт-Петербургского университета. Серия 10. Прикладная математика. Информатика. Процессы управления
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024