Problemy Peredachi Informatsii
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor
Guidelines for authors

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Probl. Peredachi Inf.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Problemy Peredachi Informatsii, 2001, Volume 37, Issue 2, Pages 96–109 (Mi ppi520)  

This article is cited in 94 scientific papers (total in 94 papers)

Source Coding

Using Literal and Grammatical Statistics for Authorship Attribution

O. V. Kukushkina, A. A. Polikarpov, D. V. Khmelev
References:
Abstract: Markov chains are used as a formal mathematical model for sequences of elements of a text. This model is applied for authorship attribution of texts. As elements of a text, we consider sequences of letters or sequences of grammatical classes of words. It turns out that the frequencies of occurrences of letter pairs and pairs of grammatical classes in a Russian text are rather stable characteristics of an author and, apparently, they could be used in disputed authorship attribution. A comparison of results for various modifications of the method using both letters and grammatical classes is given. Experimental research involves 385 texts of 82 writers. In the Appendix, the research of D. V. Khmelev is described, where data compression algorithms are applied to authorship attribution.
Received: 08.08.2000
Revised: 11.01.2001
English version:
Problems of Information Transmission, 2001, Volume 37, Issue 2, Pages 172–184
DOI: https://doi.org/10.1023/A:1010478226705
Bibliographic databases:
Document Type: Article
UDC: 621.391.1
Language: Russian
Citation: O. V. Kukushkina, A. A. Polikarpov, D. V. Khmelev, “Using Literal and Grammatical Statistics for Authorship Attribution”, Probl. Peredachi Inf., 37:2 (2001), 96–109; Problems Inform. Transmission, 37:2 (2001), 172–184
Citation in format AMSBIB
\Bibitem{KukPolKhm01}
\by O.~V.~Kukushkina, A.~A.~Polikarpov, D.~V.~Khmelev
\paper Using Literal and Grammatical Statistics for Authorship Attribution
\jour Probl. Peredachi Inf.
\yr 2001
\vol 37
\issue 2
\pages 96--109
\mathnet{http://mi.mathnet.ru/ppi520}
\mathscinet{http://mathscinet.ams.org/mathscinet-getitem?mr=2099901}
\zmath{https://zbmath.org/?q=an:1008.62118}
\transl
\jour Problems Inform. Transmission
\yr 2001
\vol 37
\issue 2
\pages 172--184
\crossref{https://doi.org/10.1023/A:1010478226705}
Linking options:
  • https://www.mathnet.ru/eng/ppi520
  • https://www.mathnet.ru/eng/ppi/v37/i2/p96
  • This publication is cited in the following 94 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Проблемы передачи информации Problems of Information Transmission
    Statistics & downloads:
    Abstract page:2224
    Full-text PDF :1065
    References:85
    First page:1
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024