D. V. Bondarchuk, “Vector space model of knowledge representation based on semantic relatedness”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 6:3 (2017), 73

Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Vestn. YuUrGU. Ser. Vych. Matem. Inform.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika", 2017, Volume 6, Issue 3, Pages 73–83
DOI: https://doi.org/10.14529/cmse170305 (Mi vyurv172)

Computer Science, Engineering and Control

Vector space model of knowledge representation based on semantic relatedness

D. V. Bondarchuk

Ural State University of Railway Transport (st. Kolmogorova 66, Yekaterinburg, 620034 Russia)

Full-text PDF (510 kB)

References:

PDF

HTML

DOI: https://doi.org/10.14529/cmse170305

Abstract: Most of text mining algorithms uses vector space model of knowledge representation. Vector space model uses the frequency (weight) of term to determine its importance in the document. Terms can be semantically similarbut different lexicographically, which in turn will lead to the fact that the classification is based on the frequencyof the terms does not give the desired result.
Analysis of a low-quality results shows that errors occur due to the characteristics of natural language, which were not taken into account. Neglect of these features, namely, synonymy and polysemy, increases the dimension ofsemantic space, which determines the performance of the final software product developed based on the algorithm.Furthermore, the results of many complex algorithms perceived domain expert to prepare training sample, whichin turn also affects quality issue algorithm.
We propose a model that in addition to the weight of a term in a document also uses semantic weight of the term. Semantic weight terms, the higher they are semantically closer to each other.
To calculate the semantic similarity of terms we propose to use a adaptation of the extended Lesk algorithm. The method of calculating semantic similarity lies in the fact that for each value of the word in question is countedas the number of words referred to the dictionary definition of this value (assuming that the dictionary definitiondescribes several meanings of the word), and in the immediate context of the word in question. As the mostprobable meaning of the word is selected such that this intersection was more. Vector model based on semanticproximity of terms solves the problem of the ambiguity of synonyms.

Keywords: text-mining, vector space model, semantic relatedness.

Received: 26.07.2015

Bibliographic databases:

Document Type: Article

UDC: 004.822

Language: Russian

Citation: D. V. Bondarchuk, “Vector space model of knowledge representation based on semantic relatedness”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 6:3 (2017), 73–83

Citation in format AMSBIB

\Bibitem{Bon17}

\by D.~V.~Bondarchuk

\paper Vector space model of knowledge representation based on semantic relatedness

\jour Vestn. YuUrGU. Ser. Vych. Matem. Inform.

\yr 2017

\vol 6

\issue 3

\pages 73--83

\mathnet{http://mi.mathnet.ru/vyurv172}

\crossref{https://doi.org/10.14529/cmse170305}

\elib{https://elibrary.ru/item.asp?id=30016529}

Linking options:

https://www.mathnet.ru/eng/vyurv172

https://www.mathnet.ru/eng/vyurv/v6/i3/p73

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"

Statistics & downloads:
Abstract page:	145
Full-text PDF :	113
References:	24

Что такое QR-код?

Registration to the website

Logotypes