Trudy SPIIRAN
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Informatics and Automation:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Trudy SPIIRAN, 2020, Issue 19, volume 2, Pages 249–276
DOI: https://doi.org/10.15622/sp.2020.19.2.1
(Mi trspy1098)
 

This article is cited in 1 scientific paper (total in 1 paper)

Artificial Intelligence, Knowledge and Data Engineering

Voice pathology detection based on analysis of modulation spectrum in critical bands

M. Vashkevich, E. Azarov

Belarusian State University of Informatics and Radioelectronics (BSUIR)
Abstract: The paper presents an approach to the analysis of the modulation spectrum of a voice signal, in which the primary acoustic analysis is performed in bands of unequal width. Nonuniform analysis corresponds to the psychoacoustic laws of human perception of sound information. In the context of the analysis of the modulation spectrum, the considered approach can significantly reduce the resulting number of parameters, which greatly simplifies the task of detecting pathological changes in the voice signal based on the analysis of the parameters of the modulation spectrum. For frequency decomposition of a signal into bands of unequal width, two methods are considered: 1) DFT with channel combination and 2) the use of an nonuniform filter bank. The first method is characterized by a fixed time window for the analysis of all frequency components, while in the second method the time-frequency analysis plan is consistent with the critical frequency scale of the barks. For each method, a practical signal analysis circuit has been developed and described. The paper presents the experimental data on the application of the developed schemes for the analysis of the modulation spectrum to the problem of detecting pathology in a speech signal. The parameters of the modulation spectrum acted as information signs for a classifier built on the basis of linear discriminant analysis. Three different voice bases were used in the experiment (in two cases, the pathology was neurological ALS disease (amyotrophic lateral sclerosis), and in the third case, diseases of the larynx). The parameters of the modulation spectrum obtained in the DFT-based scheme with channel combining turned out to be more preferable for classification with a small number of features, however, greater accuracy (with an increase in the number of features) made it possible to obtain the parameters obtained in the scheme based on an unequal filter bank. In all cases, the obtained classifiers were highly accurate (more than 97%). The obtained results show that the use of nonuniform time-frequency representation is preferable in the case when the analyzed signal is a sustained vowel phonation, since it provides higher accuracy of pathology detection using fewer modulation parameters.
Keywords: speech signal analysis, critical bands, modulation spectra, feature extraction, voice pathology detection.
Received: 18.03.2020
Document Type: Article
UDC: 616.71 + 612.78
Language: Russian
Citation: M. Vashkevich, E. Azarov, “Voice pathology detection based on analysis of modulation spectrum in critical bands”, Tr. SPIIRAN, 19:2 (2020), 249–276
Citation in format AMSBIB
\Bibitem{VasAza20}
\by M.~Vashkevich, E.~Azarov
\paper Voice pathology detection based on analysis of modulation spectrum in critical bands
\jour Tr. SPIIRAN
\yr 2020
\vol 19
\issue 2
\pages 249--276
\mathnet{http://mi.mathnet.ru/trspy1098}
\crossref{https://doi.org/10.15622/sp.2020.19.2.1}
Linking options:
  • https://www.mathnet.ru/eng/trspy1098
  • https://www.mathnet.ru/eng/trspy/v19/i2/p249
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Informatics and Automation
    Statistics & downloads:
    Abstract page:194
    Full-text PDF :61
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024