D. Obukhov, “Speech recognition system for russian-language telephone speech”, UBS, 89 (2021), 106

Upravlenie Bol'shimi Sistemami

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

UBS:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Upravlenie Bol'shimi Sistemami, 2021, Issue 89, Pages 106–122
DOI: https://doi.org/10.25728/ubs.2021.89.4 (Mi ubs1070)

Simulation Tools for Control Systems and Controlled Objects

Speech recognition system for russian-language telephone speech

D. Obukhov

Novosibirsk State Technical University

Full-text PDF (605 kB)

References:

PDF

HTML

DOI: https://doi.org/10.25728/ubs.2021.89.4

Abstract: We describe a system designed to recognize Russian-language speech. Our focus is on the domain of telephone conversations, when a single-channel noisy audio signal with a sample rate of 8 kHz is received at the input. Additionally, data from YouTube video hosting is used for training. We consider a number of acoustic models and techniques for building a lexicon and language model. In addition, we conduct experiments on the influence of speaker information. It is also shown that the use of augmentation techniques such as reverb, changing the speed and volume of a signal, masking frequency and time characteristics significantly increase the quality of recognition. We achieve word error rate 24.21 on our validation dataset.

Keywords: speech recognition, russian-language speech, acoustic model, language model, speech augmentation, speaker embedding.

Received: May 9, 2020
Published: January 31, 2021

Document Type: Article

UDC: 004.934.1

BBC: 32.813

Language: Russian

Citation: D. Obukhov, “Speech recognition system for russian-language telephone speech”, UBS, 89 (2021), 106–122

Citation in format AMSBIB

\Bibitem{Obu21}

\by D.~Obukhov

\paper Speech recognition system for russian-language telephone speech

\jour UBS

\yr 2021

\vol 89

\pages 106--122

\mathnet{http://mi.mathnet.ru/ubs1070}

\crossref{https://doi.org/10.25728/ubs.2021.89.4}

Linking options:

https://www.mathnet.ru/eng/ubs1070

https://www.mathnet.ru/eng/ubs/v89/p106

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	203
Full-text PDF :	409
References:	17

Что такое QR-код?

Registration to the website

Logotypes