I. S. Kipyatkova, “Software for Creation of Sintactico-Statistical Russian Language Model Based on the Text Corpus”, Tr. SPIIRAN, 24 (2013), 332

Loading [MathJax]/jax/output/SVG/config.js

Trudy SPIIRAN

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Informatics and Automation:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Trudy SPIIRAN, 2013, Issue 24, Pages 332–348 (Mi trspy571)

Software for Creation of Sintactico-Statistical Russian Language Model Based on the Text Corpus

I. S. Kipyatkova

St. Petersburg Institute for Informatics and Automation of RAS

Full-text PDF (768 kB)

References:

PDF

HTML

Abstract: Creation of the language model is one of the stages of training of a continuous speech recognition system. In the paper, the developed software for creation of syntactic-statistical Russian language model based on a text corpus is described. The main stages of the algorithm are preliminary text material processing, creation of statistical n-gram language model, extension of the statistical model by n-grams obtained by syntactical analysis. Syntactical analysis permits to increase the quantity of different bigrams created during text processing and to improve the quality of the language model by extracting grammatically-connected word pairs. The results of the testing of the language models created with the help of the software module are presented.

Keywords: automatic speech recognition, statistical language model, syntactical analysis.

Received: 01.02.2013

Document Type: Article

UDC: 004.522

PACS: 43.71.Sy

MSC: 68T50

Language: Russian

Citation: I. S. Kipyatkova, “Software for Creation of Sintactico-Statistical Russian Language Model Based on the Text Corpus”, Tr. SPIIRAN, 24 (2013), 332–348

Citation in format AMSBIB

\Bibitem{Kip13}

\by I.~S.~Kipyatkova

\paper Software for Creation of Sintactico-Statistical Russian Language Model Based on the Text Corpus

\jour Tr. SPIIRAN

\yr 2013

\vol 24

\pages 332--348

\mathnet{http://mi.mathnet.ru/trspy571}

Linking options:

https://www.mathnet.ru/eng/trspy571

https://www.mathnet.ru/eng/trspy/v24/p332

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	259
Full-text PDF :	120
References:	46
First page:	1

Registration to the website

Logotypes