|
This article is cited in 2 scientific papers (total in 2 papers)
Algorithms and Software
Automatic Categorization of Documents Using Latent Semantic Analysis and Fuzzy Inference Algorithm of Mamdani
A. D. Khomonenkoa, S. V. Logashevb, S. A. Krasnovb a Petersburg State Transport University
b Mozhaisky Military Space Academy
Abstract:
We propose an approach to the automatic categorization of text documents based on the joint application of the method of latent semantic analysis (LSA) and fuzzy inference Mamdani algorithm. Method LSA is used for the semantic analysis of information in electronic document management systems by identifying semantic relationships between terms of documents and receipt of the compliance rate of the compared vectors. The rule base is proposed for fuzzy inference algorithm of Mamdani implementing the automatic rubrication of documents for a variety of given topics enabling automated monitoring of the distribution of documents not relevant to the specified topics, or having similarities in several thematic categories on the basis of the results of latent semantic analysis.
Keywords:
rubrication of documents; fuzzy inference; latent semantic analysis; the rule base; a fuzzy inference Mamdani algorithm.
Citation:
A. D. Khomonenko, S. V. Logashev, S. A. Krasnov, “Automatic Categorization of Documents Using Latent Semantic Analysis and Fuzzy Inference Algorithm of Mamdani”, Tr. SPIIRAN, 44 (2016), 5–19
Linking options:
https://www.mathnet.ru/eng/trspy851 https://www.mathnet.ru/eng/trspy/v44/p5
|
Statistics & downloads: |
Abstract page: | 178 | Full-text PDF : | 109 |
|