Abstract:
The article covers research in the field of Natural Language Processing. The method and the algorithm for searching thematically similar documents are presented. A comparison of various measures of thematic similarity and sets of features is performed.
Keywords:
text similarity, vector space model, TF, IDF, topic importance characteristic, measure of thematic similarity, assessment of methods for information retrieval, DCG.
Citation:
R. E. Suvorov, I. V. Sochenkov, “Method for detecting relationships between sci-tech documents based on topic importance characteristic”, Artificial Intelligence and Decision Making, 2013, no. 1, 33–40; Scientific and Technical Information Processing, 42:5 (2015), 321–327
\Bibitem{SuvSoc13}
\by R.~E.~Suvorov, I.~V.~Sochenkov
\paper Method for detecting relationships between sci-tech documents based on topic importance characteristic
\jour Artificial Intelligence and Decision Making
\yr 2013
\issue 1
\pages 33--40
\mathnet{http://mi.mathnet.ru/iipr387}
\elib{https://elibrary.ru/item.asp?id=19096186}
\transl
\jour Scientific and Technical Information Processing
\yr 2015
\vol 42
\issue 5
\pages 321--327
\crossref{https://doi.org/10.3103/S0147688215050081}
Linking options:
https://www.mathnet.ru/eng/iipr387
https://www.mathnet.ru/eng/iipr/y2013/i1/p33
This publication is cited in the following 3 articles:
Yulia Otmakhova, Dmitry Devyatkin, Lecture Notes in Information Systems and Organisation, 54, Digital Transformation in Industry, 2022, 481
Sergey Volkov, Dmitry Devyatkin, Ilya Tikhomirov, Ilya Sochenkov, Communications in Computer and Information Science, 1427, Data Analytics and Management in Data Intensive Domains, 2021, 204
V. N. Shvedenko, O. V. Shchekochikhin, Y. A. Sinkevich, “A Methodology of Constructing a Distributed Information System for Searching for Scientific and Technical Information Based on an Object Data Model”, Autom. Doc. Math. Linguist., 54:5 (2020), 243