|
This article is cited in 1 scientific paper (total in 1 paper)
Theoretical and Applied Mathematics
Approximation of distributions of text characters bigrams frequencies for alphabetic characters identification
Yu. A. Kotov Novosibirsk State Technical University (NSTU)
Abstract:
The article discusses the application features of methods of the frequencies ordering and approximation to solve the problem of text characters identification. The conditions for realization of Jacobsen’s method for receiving the least error of identification are defined. The method of approximation of one- and two-dimensional distributions of the frequencies of characters bigrams of the text and the language is offered. The experimental data about errors of Jacobsen’s method and the offered approximation method for Russian language texts are provided.
The error of the offered method is less than that of Jacobsen's method. This method can be used for identification of text characters for any language that has a reference distribution of the alphabetic characters bigrams frequencies.
Keywords:
approximation; identification; character; bigram; one-to-one substitution; cypher.
Citation:
Yu. A. Kotov, “Approximation of distributions of text characters bigrams frequencies for alphabetic characters identification”, Tr. SPIIRAN, 50 (2017), 190–208
Linking options:
https://www.mathnet.ru/eng/trspy932 https://www.mathnet.ru/eng/trspy/v50/p190
|
|