|
Preprints of the Keldysh Institute of Applied Mathematics, 2013, 027, 26 pp.
(Mi ipmp1777)
|
|
|
|
This article is cited in 3 scientific papers (total in 3 papers)
Identification of a text author by the letter frequency empirical distribution
L. A. Borisov, Yu. N. Orlov, K. P. Osminin
Abstract:
The distances distributions between empirical triplet distributions are investigated. The accuracy estimation of these distributions is obtained depending on the length of the text. The method of author identification is examined on the broad class of literature texts. The stabilization length of triplet distributions is approximately equal to one half of the text without dependence on author and text length. The example of cluster method is given for E. I. Roerich philosophical texts.
Keywords:
empirical probability, minimal text length, author identification.
Citation:
L. A. Borisov, Yu. N. Orlov, K. P. Osminin, “Identification of a text author by the letter frequency empirical distribution”, Keldysh Institute preprints, 2013, 027, 26 pp.
Linking options:
https://www.mathnet.ru/eng/ipmp1777 https://www.mathnet.ru/eng/ipmp/y2013/p27
|
Statistics & downloads: |
Abstract page: | 581 | Full-text PDF : | 280 | References: | 59 |
|