Loading [MathJax]/jax/output/SVG/config.js
Journal of Siberian Federal University. Mathematics & Physics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor
Guidelines for authors

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



J. Sib. Fed. Univ. Math. Phys.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Journal of Siberian Federal University. Mathematics & Physics, 2016, Volume 9, Issue 4, Pages 518–523
DOI: https://doi.org/10.17516/1997-1397-2016-9-4-518-523
(Mi jsfu514)
 

This article is cited in 7 scientific papers (total in 7 papers)

Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation

Maxim Sidorova, Wolfgang Minkera, Eugene S. Semenkinb

a Institute of Communications Engineering, Ulm University, Albert-Einstein-Allee, 43, Ulm, 89081
b Informatics and Telecommunications Institute, Reshetnev Siberian State Aerospace University, Krasnoyarskiy Rabochiy, 31, Krasnoyarsk, 660037, Russia
Full-text PDF (87 kB) Citations (7)
References:
Abstract: In this paper we present the performance of different machine learning algorithms for the problems of speech-based Emotion Recognition (ER) and Speaker Identification (SI) in static and dynamic modes of speech signal representation. We have used a multi-corporal, multi-language approach in the study. 3 databases for the problem of SI and 4 databases for the ER task of 3 different languages (German, English and Japanese) have been used in our study to evaluate the models. More than 45 machine learning algorithms were applied to these tasks in both modes and the results alongside discussion are presented here.
Keywords: emotion recognition from speech, speaker identification from speech, machine learning algorithms, speaker adaptive emotion recognition from speech.
Received: 28.12.2015
Received in revised form: 24.02.2016
Accepted: 15.09.2016
Bibliographic databases:
Document Type: Article
UDC: 519.87
Language: English
Citation: Maxim Sidorov, Wolfgang Minker, Eugene S. Semenkin, “Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation”, J. Sib. Fed. Univ. Math. Phys., 9:4 (2016), 518–523
Citation in format AMSBIB
\Bibitem{SidMinSem16}
\by Maxim~Sidorov, Wolfgang~Minker, Eugene~S.~Semenkin
\paper Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation
\jour J. Sib. Fed. Univ. Math. Phys.
\yr 2016
\vol 9
\issue 4
\pages 518--523
\mathnet{http://mi.mathnet.ru/jsfu514}
\crossref{https://doi.org/10.17516/1997-1397-2016-9-4-518-523}
\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000412010800016}
Linking options:
  • https://www.mathnet.ru/eng/jsfu514
  • https://www.mathnet.ru/eng/jsfu/v9/i4/p518
  • This publication is cited in the following 7 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Журнал Сибирского федерального университета. Серия "Математика и физика"
    Statistics & downloads:
    Abstract page:455
    Full-text PDF :80
    References:44
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025