Trudy SPIIRAN
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Informatics and Automation:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Trudy SPIIRAN, 2016, Issue 49, Pages 104–121
DOI: https://doi.org/10.15622/sp.49.6
(Mi trspy919)
 

This article is cited in 7 scientific papers (total in 7 papers)

Methods of Information Processing and Management

Method of the artificial texts identification based on the calculation of the belonging measure to the invariants

A. O. Shumskaya

Tomsk State University of Control Systems and Radioelectronics (TUSUR)
Full-text PDF (910 kB) Citations (7)
Abstract: The work is devoted to the identification of texts generated automatically (artificially) with the use of software algorithms. This is an important and topical issue, because such texts are being widely spread on the Internet. Created «copies» of the web pages are used to attract readers to online resources as well as to disseminate a large number of unique copies of pages with content specific orientation.
This article describes the features of determining the origin of the text by the example of working on texts generated by synonymization as the most common method of generating artificial web content. The author provides an invariant of artificial texts as a set of the values of text characteristics, which allows classification of texts according to the process of their creation. The article proposes a method of the artificial texts identification based on the calculation of the belonging measure to the invariants, which allows making a decision about the origin of the text. The article also presents values obtained from the experiments on identifying artificial texts.
Keywords: automatically generated texts; artificial texts; massively generated texts; text features; text attribution.
Bibliographic databases:
Document Type: Article
UDC: 004.072.7
Language: Russian
Citation: A. O. Shumskaya, “Method of the artificial texts identification based on the calculation of the belonging measure to the invariants”, Tr. SPIIRAN, 49 (2016), 104–121
Citation in format AMSBIB
\Bibitem{Shu16}
\by A.~O.~Shumskaya
\paper Method of the artificial texts identification based on the calculation of the belonging measure to the invariants
\jour Tr. SPIIRAN
\yr 2016
\vol 49
\pages 104--121
\mathnet{http://mi.mathnet.ru/trspy919}
\crossref{https://doi.org/10.15622/sp.49.6}
\elib{https://elibrary.ru/item.asp?id=27657125}
Linking options:
  • https://www.mathnet.ru/eng/trspy919
  • https://www.mathnet.ru/eng/trspy/v49/p104
  • This publication is cited in the following 7 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Informatics and Automation
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024