Artificial Intelligence and Decision Making
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Guidelines for authors

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Artificial Intelligence and Decision Making:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Artificial Intelligence and Decision Making, 2021, Issue 1, Pages 75–85
DOI: https://doi.org/10.14357/20718594210107
(Mi iipr93)
 

Analysis of signals, audio and video information

Segmentation of noisy speech signals

A. G. Shishkin, S. D. Protserov

Lomonosov Moscow State University, Moscow, Russia
Abstract: One of the most important problems in digital speech signal processing is to determine which parts of input acoustic signal contain speech, and which contain background noise or silence. This problem arises in many important practical applications, such as speech analysis in voice command systems, transmission of speech over the network and automated speech recognition. However, most of the existing systems designed for automated speech analysis are unable to solve this problem efficiently if the signal-to-noise ratio is too low. Moreover, their parameters have to be tuned separately for different noise levels. This prevents fully automated segmentation of noisy speech signals. In this paper we design a system for automated segmentation of speech signals distorted by additive noise of different type and intensity. Our system is based on three different convolutional neural network models and is capable of efficiently determining speech and silence segments in noisy signals with a wide range of noise intensity and different noise types.
Keywords: speech signal, convolutional neural network, segmentation, digital signal processing.
English version:
Scientific and Technical Information Processing, 2022, Volume 49, Issue 5, Pages 356–363
DOI: https://doi.org/10.3103/S0147688222050100
Bibliographic databases:
Document Type: Article
Language: Russian
Citation: A. G. Shishkin, S. D. Protserov, “Segmentation of noisy speech signals”, Artificial Intelligence and Decision Making, 2021, no. 1, 75–85; Scientific and Technical Information Processing, 49:5 (2022), 356–363
Citation in format AMSBIB
\Bibitem{ShiPro21}
\by A.~G.~Shishkin, S.~D.~Protserov
\paper Segmentation of noisy speech signals
\jour Artificial Intelligence and Decision Making
\yr 2021
\issue 1
\pages 75--85
\mathnet{http://mi.mathnet.ru/iipr93}
\crossref{https://doi.org/10.14357/20718594210107}
\elib{https://elibrary.ru/item.asp?id=45149130}
\transl
\jour Scientific and Technical Information Processing
\yr 2022
\vol 49
\issue 5
\pages 356--363
\crossref{https://doi.org/10.3103/S0147688222050100}
Linking options:
  • https://www.mathnet.ru/eng/iipr93
  • https://www.mathnet.ru/eng/iipr/y2021/i1/p75
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Artificial Intelligence and Decision Making
    Statistics & downloads:
    Abstract page:23
    Full-text PDF :12
    References:1
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024