|
This article is cited in 1 scientific paper (total in 1 paper)
Matrix text models. Text corpora models
M. G. Kreines, E. M. Kreines BaseTech Llc, Moscow
Abstract:
The models of text corpora, formed on the basis of the matrix model of texts in natural
languages, are presented. As methods to form models of collections we consider the
techniques of computational identification of the thematic structure of the collections.
We suggest to use the models for searching for thematically similar text collections and
thematic categorization of texts based on text models and text collections. The differences of the proposed models of text collections from the common approaches to their
analysis and modeling are analyzed.
Keywords:
natural language texts, text corpora, text corpora models, topic models, text models, text information retrieval.
Received: 16.05.2019 Revised: 16.05.2019 Accepted: 01.07.2019
Citation:
M. G. Kreines, E. M. Kreines, “Matrix text models. Text corpora models”, Matem. Mod., 32:2 (2020), 37–57; Math. Models Comput. Simul., 12:5 (2020), 779–790
Linking options:
https://www.mathnet.ru/eng/mm4154 https://www.mathnet.ru/eng/mm/v32/i2/p37
|
Statistics & downloads: |
Abstract page: | 358 | Full-text PDF : | 116 | References: | 37 | First page: | 10 |
|