Vestnik of Astrakhan State Technical University. Series: Management, Computer Sciences and Informatics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik of Astrakhan State Technical University. Series: Management, Computer Sciences and Informatics, 2012, Number 1, Pages 136–141 (Mi vagtu45)  

COMPUTER SOFTWARE AND COMPUTING EQUIPMENT

Principles of construction of the multidimensional space of terms in the analysis of object-oriented collection of documents

R. V. Khrunichev

Ryazan State of Radio Engineering University
References:
Abstract: The paper considers the problem of information retrieval in object-oriented collection of documents, the possibility of searching for documents by means of the application of the modified search model, based on the vector model. Modernization of the vector model is the ability to use object-oriented glossary of terms at the stage of preliminary processing of the text, allowing to reduce the number of terms for subsequent frequency analysis of the text. Zipf's rule and the principle of Luhn, used during the frequency analysis, can also significantly reduce the number of analyzed terms. The paper shows the principle of construction of the multidimensional space of terms, based on the vectors that describe the document. The principles of these vectors formation are given. The article also lists the advantages of the object-oriented vocabulary application in the process of constructing the space of terms, consisting in the possibility of separating of composite terms, and through this, more accurate positioning of the document in its issue upon request.
Keywords: object-oriented collection of documents, frequency analysis of the text, data warehouse, space of terms.
Received: 30.11.2011
Revised: 19.12.2011
Document Type: Article
UDC: 002.513.5
Language: Russian
Citation: R. V. Khrunichev, “Principles of construction of the multidimensional space of terms in the analysis of object-oriented collection of documents”, Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics, 2012, no. 1, 136–141
Citation in format AMSBIB
\Bibitem{Khr12}
\by R.~V.~Khrunichev
\paper Principles of construction of the multidimensional space of terms in the analysis of object-oriented collection of documents
\jour Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics
\yr 2012
\issue 1
\pages 136--141
\mathnet{http://mi.mathnet.ru/vagtu45}
Linking options:
  • https://www.mathnet.ru/eng/vagtu45
  • https://www.mathnet.ru/eng/vagtu/y2012/i1/p136
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Вестник Астраханского государственного технического университета. Серия: Управление, вычислительная техника и информатика
    Statistics & downloads:
    Abstract page:130
    Full-text PDF :58
    References:20
    First page:1
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024