A. G. Shishkin, S. D. Protserov, “Segmentation of noisy speech signals”, Artificial Intelligence and Decision Making, 2021, no. 1, 75–85; Scientific and Technical Information Processing, 49:5 (2022), 356

Loading [MathJax]/jax/output/SVG/config.js

Artificial Intelligence and Decision Making

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Guidelines for authors

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Artificial Intelligence and Decision Making:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Artificial Intelligence and Decision Making, 2021, Issue 1, Pages 75–85
DOI: https://doi.org/10.14357/20718594210107 (Mi iipr93)

Analysis of signals, audio and video information

Segmentation of noisy speech signals

A. G. Shishkin, S. D. Protserov

Lomonosov Moscow State University, Moscow, Russia

Full-text PDF (971 kB)

DOI: https://doi.org/10.14357/20718594210107

Abstract: One of the most important problems in digital speech signal processing is to determine which parts of input acoustic signal contain speech, and which contain background noise or silence. This problem arises in many important practical applications, such as speech analysis in voice command systems, transmission of speech over the network and automated speech recognition. However, most of the existing systems designed for automated speech analysis are unable to solve this problem efficiently if the signal-to-noise ratio is too low. Moreover, their parameters have to be tuned separately for different noise levels. This prevents fully automated segmentation of noisy speech signals. In this paper we design a system for automated segmentation of speech signals distorted by additive noise of different type and intensity. Our system is based on three different convolutional neural network models and is capable of efficiently determining speech and silence segments in noisy signals with a wide range of noise intensity and different noise types.

Keywords: speech signal, convolutional neural network, segmentation, digital signal processing.

English version:
Scientific and Technical Information Processing, 2022, Volume 49, Issue 5, Pages 356–363
DOI: https://doi.org/10.3103/S0147688222050100

Bibliographic databases:

Document Type: Article

Language: Russian

Citation: A. G. Shishkin, S. D. Protserov, “Segmentation of noisy speech signals”, Artificial Intelligence and Decision Making, 2021, no. 1, 75–85; Scientific and Technical Information Processing, 49:5 (2022), 356–363

Citation in format AMSBIB

\Bibitem{ShiPro21}

\by A.~G.~Shishkin, S.~D.~Protserov

\paper Segmentation of noisy speech signals

\jour Artificial Intelligence and Decision Making

\yr 2021

\issue 1

\pages 75--85

\mathnet{http://mi.mathnet.ru/iipr93}

\crossref{https://doi.org/10.14357/20718594210107}

\elib{https://elibrary.ru/item.asp?id=45149130}

\transl

\jour Scientific and Technical Information Processing

\yr 2022

\vol 49

\issue 5

\pages 356--363

\crossref{https://doi.org/10.3103/S0147688222050100}

Linking options:

https://www.mathnet.ru/eng/iipr93

https://www.mathnet.ru/eng/iipr/y2021/i1/p75

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Artificial Intelligence and Decision Making

Statistics & downloads:
Abstract page:	45
Full-text PDF :	19
References:	1

Registration to the website

Logotypes