S. A. Komkov, M. D. Dzabraev, A. A. Petiushko, “Mutual modality learning for video action classification”, Computer Optics, 47:4 (2023), 637

Computer Optics

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Computer Optics:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Computer Optics, 2023, Volume 47, Issue 4, Pages 637–649
DOI: https://doi.org/10.18287/2412-6179-CO-1277 (Mi co1165)

This article is cited in 1 scientific paper (total in 1 paper)

IMAGE PROCESSING, PATTERN RECOGNITION

Mutual modality learning for video action classification

S. A. Komkov^ab, M. D. Dzabraev^ab, A. A. Petiushko^ab

^a Lomonosov Moscow State University
^b Huawei Moscow Research Center, 121099, Russia, Moscow, Smolenskaya ploshchad 7–9

Full-text PDF (1098 kB) Citations (1)

References:

PDF

HTML

DOI: https://doi.org/10.18287/2412-6179-CO-1277

Abstract: The construction of models for video action classification progresses rapidly. However, the performance of those models can still be easily improved by ensembling with the same models trained on different modalities (e.g. Optical flow). Unfortunately, it is computationally expensive to use several modalities during inference. Recent works examine the ways to integrate advantages of multi-modality into a single RGB-model. Yet, there is still room for improvement. In this paper, we explore various methods to embed the ensemble power into a single model. We show that proper initialization, as well as mutual modality learning, enhances single-modality models. As a result, we achieve state-of-the-art results in the Something-Something-v2 benchmark.

Keywords: video recognition, video action classification, video labeling, mutual learning, optical flow

Received: 13.01.2023
Accepted: 29.03.2023

Document Type: Article

Language: English

Citation: S. A. Komkov, M. D. Dzabraev, A. A. Petiushko, “Mutual modality learning for video action classification”, Computer Optics, 47:4 (2023), 637–649

Citation in format AMSBIB

\Bibitem{KomDzaPet23}

\by S.~A.~Komkov, M.~D.~Dzabraev, A.~A.~Petiushko

\paper Mutual modality learning for video action classification

\jour Computer Optics

\yr 2023

\vol 47

\issue 4

\pages 637--649

\mathnet{http://mi.mathnet.ru/co1165}

\crossref{https://doi.org/10.18287/2412-6179-CO-1277}

Linking options:

https://www.mathnet.ru/eng/co1165

https://www.mathnet.ru/eng/co/v47/i4/p637

This publication is cited in the following 1 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	25
Full-text PDF :	8
References:	6

Что такое QR-код?

Registration to the website

Logotypes