Trudy SPIIRAN
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Informatics and Automation:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Trudy SPIIRAN, 2016, Issue 44, Pages 98–113
DOI: https://doi.org/10.15622/sp.44.7
(Mi trspy857)
 

This article is cited in 4 scientific papers (total in 4 papers)

Methods of Information Processing and Management

An Analysis of Perspectives for Using High-Speed Cameras in Processing Dynamic Video Information

D. V. Ivankoa, A. A. Karpovb

a ITMO University (Saint Petersburg National Research University of Information Technologies, Mechanics and Optics)
b St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS)
Full-text PDF (987 kB) Citations (4)
Abstract: In this paper, we review the actual and perspective areas of use of high-speed video cameras. We discuss the possibility of applying high-speed cameras in the field of human-computer interaction to detect dynamic video information (including visual speech). We also describe main tasks, which can be solved with high-speed cameras, such as: automatic lip-reading, eye blink detection, facial micro-expression recognition, etc. We identify potential challenges associated with the introduction of high-speed video cameras and analyze the conditions of research area. Besides, we analyze state-of-the-art in the field at the moment and prove that there is an urgent need for further scientific and technical developments in this area. We propose some advanced applications and tasks in the human-computer interaction domain, where high-speed video capturing can be useful, such as audio-visual continuous speech recognition and automatic reading speech by lips. In further research, we will implement such a multimodal system for audio-visual Russian speech recognition using a microphone and a high-speed video camera JAI Pulnix.
Keywords: high-speed video camera; computer vision; audio-visual speech recognition; audio-visual data corpus; lip-reading; dynamic video information.
Funding agency Grant number
Russian Foundation for Basic Research 15-07-04415_a
Ministry of Education and Science of the Russian Federation МД-3035.2015.8
The research is financially supported by the Russian Foundation for Basic Research (Project No. 15-07-04415-a) and by the Council for Grants of the President of Russia (Project No. MD-3035.2015.8).
Bibliographic databases:
Document Type: Article
UDC: 004.5
Language: Russian
Citation: D. V. Ivanko, A. A. Karpov, “An Analysis of Perspectives for Using High-Speed Cameras in Processing Dynamic Video Information”, Tr. SPIIRAN, 44 (2016), 98–113
Citation in format AMSBIB
\Bibitem{IvaKar16}
\by D.~V.~Ivanko, A.~A.~Karpov
\paper An Analysis of Perspectives for Using High-Speed Cameras in Processing Dynamic Video Information
\jour Tr. SPIIRAN
\yr 2016
\vol 44
\pages 98--113
\mathnet{http://mi.mathnet.ru/trspy857}
\crossref{https://doi.org/10.15622/sp.44.7}
\elib{https://elibrary.ru/item.asp?id=25616420}
Linking options:
  • https://www.mathnet.ru/eng/trspy857
  • https://www.mathnet.ru/eng/trspy/v44/p98
  • This publication is cited in the following 4 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Informatics and Automation
    Statistics & downloads:
    Abstract page:161
    Full-text PDF :57
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024