R. Sánchez-Rivero, P. V. Bezmaternykh, A. V. Gayer, A. Morales-González, F. J. Silva-Mata, K. B. Bulatov, “A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition”, Компьютерная оптика, 47:4 (2023), 627

Компьютерная оптика

RUS ENG

ЖУРНАЛЫ ПЕРСОНАЛИИ ОРГАНИЗАЦИИ КОНФЕРЕНЦИИ СЕМИНАРЫ ВИДЕОТЕКА ПАКЕТ AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	Общая информация
	Последний выпуск
	Архив
	Правила для авторов

	Поиск публикаций
	Поиск ссылок

	RSS
	Последний выпуск
	Текущие выпуски
	Архивные выпуски
	Что такое RSS

Компьютерная оптика:
Год:
Том:
Выпуск:
Страница:
	Найти

Персональный вход:
Логин:
Пароль:
	Запомнить пароль
	Войти
	Забыли пароль?
	Регистрация

Компьютерная оптика, 2023, том 47, выпуск 4, страницы 627–636
DOI: https://doi.org/10.18287/2412-6179-CO-1207 (Mi co1164)

ОБРАБОТКА ИЗОБРАЖЕНИЙ, РАСПОЗНАВАНИЕ ОБРАЗОВ

A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition

R. Sánchez-Rivero^a, P. V. Bezmaternykh^bc, A. V. Gayer^bc, A. Morales-González^a, F. J. Silva-Mata^a, K. B. Bulatov^bc

^a Advanced Technologies Application Center (CENATAV), Playa P.C.12200, Havana, Cuba, 7A, #21406 Siboney
^b Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow
^c Smart Engines Service LLC, Moscow

PDF полного текста (1123 kB)

Список литературы:

PDF

HTML

DOI: https://doi.org/10.18287/2412-6179-CO-1207

Аннотация: Text recognition has benefited considerably from deep learning research, as well as the preprocessing methods included in its workflow. Identity documents are critical in the field of document analysis and should be thoroughly researched in relation to this workflow. We propose to examine the link between deep learning-based binarization and recognition algorithms for this sort of documents on the MIDV-500 and MIDV-2020 datasets. We provide a series of experiments to illustrate the relation between the quality of the collected images with respect to the binarization results, as well as the influence of its output on final recognition performance. We show that deep learning-based binarization solutions are affected by the capture quality, which implies that they still need significant improvements. We also show that proper binarization results can improve the performance for many recognition methods. Our retrained U-Net-bin outperformed all other binarization methods, and the best result in recognition was obtained by Paddle Paddle OCR v2.

Ключевые слова: document image binarization, identity document recognition, optical character recognition, deep learning, U-Net architecture

Поступила в редакцию: 13.09.2022
Принята в печать: 20.02.2023

Тип публикации: Статья

Язык публикации: английский

Образец цитирования: R. Sánchez-Rivero, P. V. Bezmaternykh, A. V. Gayer, A. Morales-González, F. J. Silva-Mata, K. B. Bulatov, “A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition”, Компьютерная оптика, 47:4 (2023), 627–636

Цитирование в формате AMSBIB

\RBibitem{SanBezGay23}

\by R.~S{\' a}nchez-Rivero, P.~V.~Bezmaternykh, A.~V.~Gayer, A.~Morales-Gonz{\' a}lez, F.~J.~Silva-Mata, K.~B.~Bulatov

\paper A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition

\jour Компьютерная оптика

\yr 2023

\vol 47

\issue 4

\pages 627--636

\mathnet{http://mi.mathnet.ru/co1164}

\crossref{https://doi.org/10.18287/2412-6179-CO-1207}

Образцы ссылок на эту страницу:

https://www.mathnet.ru/rus/co1164

https://www.mathnet.ru/rus/co/v47/i4/p627

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Статистика просмотров:
Страница аннотации:	9
PDF полного текста:	2
Список литературы:	1

Что такое QR-код?

Обратная связь:

Пользовательское соглашение

Регистрация посетителей портала

Логотипы