M. P. Krivenko, “Noisy text analytics”, Sistemy i Sredstva Inform., 32:4 (2022), 45

Sistemy i Sredstva Informatiki [Systems and Means of Informatics]

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Sistemy i Sredstva Inform.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Sistemy i Sredstva Informatiki [Systems and Means of Informatics], 2022, Volume 32, Issue 4, Pages 45–58
DOI: https://doi.org/10.14357/08696527220405 (Mi ssi855)

Noisy text analytics

M. P. Krivenko

Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Full-text PDF (200 kB)

References:

PDF

HTML

DOI: https://doi.org/10.14357/08696527220405

Abstract: The article is devoted to an overview of methods for interpreting noisy text data in order to obtain significant information from them. Analytics allows one to isolate useful concepts, draw conclusions from the collected data, and form a forecast. It is assumed that the texts being processed may not correspond to the target and selected reference language. Such deviations can be caused by measurement and fixation errors, be the result of the influence of random or unforeseen factors, or arise as a result of incorrect choice or tuning of the model. The article lists the types of distortions. The areas of application of methods of intellectual text processing are considered: scientific publications; blogging; e-mails; social media; speech messages; and web analytics. The methods focused on the processing of noisy texts are indicated. Promising directions for further research are formulated: clarification of the concepts of “noise” and “dirty” texts; development of ways to measure the degree of anomaly of the text; systematization of analytical tasks of text processing; and formation of criteria for the effectiveness of methods of intellectual analysis of the text to facilitate the selection of suitable technologies.

Keywords: text mining, noisy text, dirty text, analytics, review.

Received: 22.06.2022

Document Type: Article

Language: Russian

Citation: M. P. Krivenko, “Noisy text analytics”, Sistemy i Sredstva Inform., 32:4 (2022), 45–58

Citation in format AMSBIB

\Bibitem{Kri22}

\by M.~P.~Krivenko

\paper Noisy text analytics

\jour Sistemy i Sredstva Inform.

\yr 2022

\vol 32

\issue 4

\pages 45--58

\mathnet{http://mi.mathnet.ru/ssi855}

\crossref{https://doi.org/10.14357/08696527220405}

Linking options:

https://www.mathnet.ru/eng/ssi855

https://www.mathnet.ru/eng/ssi/v32/i4/p45

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Что такое QR-код?

Registration to the website

Logotypes