A. G. Shishkin, A. A. Moskvin, “Application of deep learning methods to recognize the emotional state of a person in a video image”, Artificial Intelligence and Decision Making, 2019, no. 2, 3

Artificial Intelligence and Decision Making

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Guidelines for authors

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Artificial Intelligence and Decision Making:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Artificial Intelligence and Decision Making, 2019, Issue 2, Pages 3–14
DOI: https://doi.org/10.14357/20718594190201 (Mi iipr165)

This article is cited in 1 scientific paper (total in 1 paper)

Data mining

Application of deep learning methods to recognize the emotional state of a person in a video image

A. G. Shishkin, A. A. Moskvin

Lomonosov Moscow State University, Moscow, Russia

Full-text PDF (829 kB) Citations (1)

DOI: https://doi.org/10.14357/20718594190201

Abstract: In this paper, using the use of deep neural networks developed and implemented a model that allows you to determine in real time with limited computing resources emotional state of a person by video sequence, which is present as a voice signal related to the source for which you want to determine the state, and his face full face. Visual information is represented by 16 consecutive frames with a size of 96 $\times$ 96 pixels, and voice – with 140 of characteristics for a sequence of the 37 Windows. On the basis of experimental studies, the architecture of the model using convolutional and recurrent neural networks is developed. For 7 classes that meet different emotional States – neutral state, anger, sadness, fright, joy, disappointment and surprise – the recognition efficiency is 59%. Studies have shown that the use of audio information in conjunction with the visual can increase the accuracy of recognition by 12%. The created system is dynamic in terms of selection of parameters, narrowing or expanding the number of classes, as well as the ability to easily add, accumulate and use information from other external devices for further development and improvement of classification accuracy.

Keywords: artificial neural networks, deep learning, emotion recognition, video, speech signal.

Bibliographic databases:

Document Type: Article

Language: Russian

Citation: A. G. Shishkin, A. A. Moskvin, “Application of deep learning methods to recognize the emotional state of a person in a video image”, Artificial Intelligence and Decision Making, 2019, no. 2, 3–14

Citation in format AMSBIB

\Bibitem{ShiMos19}

\by A.~G.~Shishkin, A.~A.~Moskvin

\paper Application of deep learning methods to recognize the emotional state of a person in a video image

\jour Artificial Intelligence and Decision Making

\yr 2019

\issue 2

\pages 3--14

\mathnet{http://mi.mathnet.ru/iipr165}

\crossref{https://doi.org/10.14357/20718594190201}

\elib{https://elibrary.ru/item.asp?id=38303571}

Linking options:

https://www.mathnet.ru/eng/iipr165

https://www.mathnet.ru/eng/iipr/y2019/i2/p3

This publication is cited in the following 1 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Artificial Intelligence and Decision Making

Statistics & downloads:
Abstract page:	47
Full-text PDF :	42
References:	1

Что такое QR-код?

Registration to the website

Logotypes