Maxim Sidorov, Wolfgang Minker, Eugene S. Semenkin, “Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation”, J. Sib. Fed. Univ. Math. Phys., 9:4 (2016), 518

Loading [MathJax]/jax/output/SVG/config.js

Journal of Siberian Federal University. Mathematics & Physics

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor
	Guidelines for authors

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

J. Sib. Fed. Univ. Math. Phys.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Journal of Siberian Federal University. Mathematics & Physics, 2016, Volume 9, Issue 4, Pages 518–523
DOI: https://doi.org/10.17516/1997-1397-2016-9-4-518-523 (Mi jsfu514)

This article is cited in 7 scientific papers (total in 7 papers)

Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation

Maxim Sidorov^a, Wolfgang Minker^a, Eugene S. Semenkin^b

^a Institute of Communications Engineering, Ulm University, Albert-Einstein-Allee, 43, Ulm, 89081
^b Informatics and Telecommunications Institute, Reshetnev Siberian State Aerospace University, Krasnoyarskiy Rabochiy, 31, Krasnoyarsk, 660037, Russia

Full-text PDF (87 kB) Citations (7)

References:

PDF

HTML

DOI: https://doi.org/10.17516/1997-1397-2016-9-4-518-523

Abstract: In this paper we present the performance of different machine learning algorithms for the problems of speech-based Emotion Recognition (ER) and Speaker Identification (SI) in static and dynamic modes of speech signal representation. We have used a multi-corporal, multi-language approach in the study. 3 databases for the problem of SI and 4 databases for the ER task of 3 different languages (German, English and Japanese) have been used in our study to evaluate the models. More than 45 machine learning algorithms were applied to these tasks in both modes and the results alongside discussion are presented here.

Keywords: emotion recognition from speech, speaker identification from speech, machine learning algorithms, speaker adaptive emotion recognition from speech.

Received: 28.12.2015
Received in revised form: 24.02.2016
Accepted: 15.09.2016

Bibliographic databases:

Document Type: Article

UDC: 519.87

Language: English

Citation: Maxim Sidorov, Wolfgang Minker, Eugene S. Semenkin, “Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation”, J. Sib. Fed. Univ. Math. Phys., 9:4 (2016), 518–523

Citation in format AMSBIB

\Bibitem{SidMinSem16}

\by Maxim~Sidorov, Wolfgang~Minker, Eugene~S.~Semenkin

\paper Speech-based emotion recognition and speaker identification: static vs. dynamic mode of speech representation

\jour J. Sib. Fed. Univ. Math. Phys.

\yr 2016

\vol 9

\issue 4

\pages 518--523

\mathnet{http://mi.mathnet.ru/jsfu514}

\crossref{https://doi.org/10.17516/1997-1397-2016-9-4-518-523}

\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000412010800016}

Linking options:

https://www.mathnet.ru/eng/jsfu514

https://www.mathnet.ru/eng/jsfu/v9/i4/p518

This publication is cited in the following 7 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Журнал Сибирского федерального университета. Серия "Математика и физика"

Statistics & downloads:
Abstract page:	455
Full-text PDF :	80
References:	44

Registration to the website

Logotypes