Sistemy i Sredstva Informatiki [Systems and Means of Informatics]
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Sistemy i Sredstva Inform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Sistemy i Sredstva Informatiki [Systems and Means of Informatics], 2017, Volume 27, Issue 1, Pages 100–107
DOI: https://doi.org/10.14357/08696527170107
(Mi ssi505)
 

This article is cited in 2 scientific papers (total in 2 papers)

On the main types of relatedness between text documents

M. M. Charnine, N. V. Somin

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
Full-text PDF (222 kB) Citations (2)
References:
Abstract: This paper considers the question of relatedness of natural language texts based on textual features (fragments). Two types of relatedness are revealed: first, explicit relatedness, when the texts are linked by bibliographic references, and, second, implicit relatedness, when the texts are linked through common text fragments. The advantages and applications of implicit relatedness are discussed. It is shown that the use of implicit relatedness increases the scope of text processing techniques based on relatedness of texts significantly. Measures of explicit and implicit relatedness are proposed. An experiment was conducted on a set of texts from the subject area of “computer graphics”. On the basis of the experiment, it was shown that both types of relatedness are correlated with each other. The authors found the parameters of text processing when the correlation was at maximum and reached about 55%. The plan for further development of the proposed method of texts comparison and refinement of the results is suggested.
Keywords: relatedness between texts; explicit relatedness; implicit relatedness; measure of relatedness; collection of texts; correlation.
Funding agency Grant number
Russian Foundation for Basic Research 16-07-00756_а
16-29-09527_офи_м
15-07-06586_а
The work was supported by the Russian Foundation for Basic Research (projects 16-07-00756, 16-29-09527, and 15-07-06586).
Received: 29.10.2016
Bibliographic databases:
Document Type: Article
Language: Russian
Citation: M. M. Charnine, N. V. Somin, “On the main types of relatedness between text documents”, Sistemy i Sredstva Inform., 27:1 (2017), 100–107
Citation in format AMSBIB
\Bibitem{ShaSom17}
\by M.~M.~Charnine, N.~V.~Somin
\paper On the main types of relatedness between text documents
\jour Sistemy i Sredstva Inform.
\yr 2017
\vol 27
\issue 1
\pages 100--107
\mathnet{http://mi.mathnet.ru/ssi505}
\crossref{https://doi.org/10.14357/08696527170107}
\elib{https://elibrary.ru/item.asp?id=29160548}
Linking options:
  • https://www.mathnet.ru/eng/ssi505
  • https://www.mathnet.ru/eng/ssi/v27/i1/p100
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Системы и средства информатики
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024