Computer Optics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Optics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Optics, 2015, Volume 39, Issue 3, Pages 429–438
DOI: https://doi.org/10.18287/0134-2452-2015-39-3-429-438
(Mi co106)
 

This article is cited in 2 scientific papers (total in 2 papers)

IMAGE PROCESSING, PATTERN RECOGNITION

An approach based on TF-IDF metrics to extract the knowledge and relevant linguistic means on subject-oriented text sets

D. V. Mikhaylov, A. P. Kozlov, G. M. Emelyanov

Yaroslav-the-Wise Novgorod State University, Novgorod, Russia
Full-text PDF (251 kB) Citations (2)
References:
Abstract: In this paper we look at a problem of extracting knowledge units from the sets of subject-oriented texts. Each such text set is considered as a corpus. The main practical goal here is finding the most rational variant to express the knowledge fragment in a given natural language for further reflection in the thesaurus and ontology of a subject area. The problem is of importance when constructing systems for processing, analysis, estimation and understanding of information represented, in particular, by images. In this paper, by applying the TF-IDF metrics to classify words of the initial phrase in relation to given text corpora we address the task of selecting phrases closest to the initial one in terms of the described fragment of actual knowledge or forms of its expression in a given natural language.
Keywords: pattern recognition, intelligent data analysis, information theory, open-form test assignment, natural-language expression of expert knowledge.
Funding agency Grant number
Russian Foundation for Basic Research 13-01-00055
Ministry of Education and Science of the Russian Federation
This work was supported by RFBR (project №13-01-00055) and the Ministry of Education of the Russian Federation (the base portion goszadaniya).
Received: 22.04.2015
Revised: 02.06.2015
Document Type: Article
Language: Russian
Citation: D. V. Mikhaylov, A. P. Kozlov, G. M. Emelyanov, “An approach based on TF-IDF metrics to extract the knowledge and relevant linguistic means on subject-oriented text sets”, Computer Optics, 39:3 (2015), 429–438
Citation in format AMSBIB
\Bibitem{MikKozEme15}
\by D.~V.~Mikhaylov, A.~P.~Kozlov, G.~M.~Emelyanov
\paper An approach based on TF-IDF metrics to extract the knowledge and relevant linguistic means on subject-oriented text sets
\jour Computer Optics
\yr 2015
\vol 39
\issue 3
\pages 429--438
\mathnet{http://mi.mathnet.ru/co106}
\crossref{https://doi.org/10.18287/0134-2452-2015-39-3-429-438}
Linking options:
  • https://www.mathnet.ru/eng/co106
  • https://www.mathnet.ru/eng/co/v39/i3/p429
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Optics
    Statistics & downloads:
    Abstract page:204
    Full-text PDF :73
    References:45
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024