O. O. Petrova, K. B. Bulatov, V. V. Arlazarov, V. L. Arlazarov

Компьютерная оптика

RUS ENG

ЖУРНАЛЫ ПЕРСОНАЛИИ ОРГАНИЗАЦИИ КОНФЕРЕНЦИИ СЕМИНАРЫ ВИДЕОТЕКА ПАКЕТ AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	Общая информация
	Последний выпуск
	Архив
	Правила для авторов

	Поиск публикаций
	Поиск ссылок

	RSS
	Последний выпуск
	Текущие выпуски
	Архивные выпуски
	Что такое RSS

Компьютерная оптика:
Год:
Том:
Выпуск:
Страница:
	Найти

Персональный вход:
Логин:
Пароль:
	Запомнить пароль
	Войти
	Забыли пароль?
	Регистрация

Компьютерная оптика, 2021, том 45, выпуск 1, статья опубликована в англоязычной версии журнала
DOI: https://doi.org/10.18287/2412-6179-CO-795 (Mi co884)

Эта публикация цитируется в 11 научных статьях (всего в 11 статьях)

INTERNATIONAL CONFERENCE ON MACHINE VISION

Weighted combination of per-frame recognition results for text recognition in a video stream

O. O. Petrova^ab, K. B. Bulatov^bac, V. V. Arlazarov^ba, V. L. Arlazarov^acb

^a FRC CSC RAS, Moscow, Russia
^b Smart Engines Service LLC, Moscow, Russia
^c Moscow Institute of Physics and Technology (State University), Moscow, Russia

PDF полного текста (2666 kB) Список цитирования (11)

Список литературы:

PDF

HTML

DOI: https://doi.org/10.18287/2412-6179-CO-795

Аннотация: The scope of uses of automated document recognition has extended and as a result, recognition techniques that do not require specialized equipment have become more relevant. Among such techniques, document recognition using mobile devices is of interest. However, it is not always possible to ensure controlled capturing conditions and, consequentially, high quality of input images. Unlike specialized scanners, mobile cameras allow using a video stream as an input, thus obtaining several images of the recognized object, captured with various characteristics. In this case, a problem of combining the information from multiple input frames arises. In this paper, we propose a weighing model for the process of combining the per-frame recognition results, two approaches to the weighted combination of the text recognition results, and two weighing criteria. The effectiveness of the proposed approaches is tested using datasets of identity documents captured with a mobile device camera in different conditions, including perspective distortion of the document image and low lighting conditions. The experimental results show that the weighting combination can improve the text recognition result quality in the video stream, and the per-character weighting method with input image focus estimation as a base criterion allows one to achieve the best results on the datasets analyzed.

Ключевые слова: mobile OCR, video stream, anytime algorithms, weighted combination, ensemble methods.

Финансовая поддержка	Номер гранта
Российский фонд фундаментальных исследований	17-29-03236 офи_м 18-07-01387 а
This work is partially supported by the Russian Foundation for Basic Research (projects 17-29-03236 and 18-07-01387).

Поступила в редакцию: 03.08.2020
Принята в печать: 30.12.2020

Тип публикации: Статья

Язык публикации: английский

Образец цитирования: O. O. Petrova, K. B. Bulatov, V. V. Arlazarov, V. L. Arlazarov

Цитирование в формате AMSBIB

\RBibitem{PetBulArl21}

\by O.~O.~Petrova, K.~B.~Bulatov, V.~V.~Arlazarov, V.~L.~Arlazarov

\mathnet{http://mi.mathnet.ru/co884}

\crossref{https://doi.org/10.18287/2412-6179-CO-795}

Образцы ссылок на эту страницу:

https://www.mathnet.ru/rus/co884

Эта публикация цитируется в следующих 11 статьяx:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Статистика просмотров:
Страница аннотации:	116
PDF полного текста:	53
Список литературы:	17

Что такое QR-код?

Обратная связь:

Пользовательское соглашение

Регистрация посетителей портала

Логотипы