V. V. Arlazarov, E. I. Andreeva, K. B. Bulatov, D. P. Nikolaev, O. O. Petrova, B. I. Savelev, O. A. Slavin, “Document image analysis and recognition: a survey”, Компьютерная оптика, 46:4 (2022), 567

Компьютерная оптика

RUS ENG

ЖУРНАЛЫ ПЕРСОНАЛИИ ОРГАНИЗАЦИИ КОНФЕРЕНЦИИ СЕМИНАРЫ ВИДЕОТЕКА ПАКЕТ AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	Общая информация
	Последний выпуск
	Архив
	Правила для авторов

	Поиск публикаций
	Поиск ссылок

	RSS
	Последний выпуск
	Текущие выпуски
	Архивные выпуски
	Что такое RSS

Компьютерная оптика:
Год:
Том:
Выпуск:
Страница:
	Найти

Персональный вход:
Логин:
Пароль:
	Запомнить пароль
	Войти
	Забыли пароль?
	Регистрация

Компьютерная оптика, 2022, том 46, выпуск 4, страницы 567–589
DOI: https://doi.org/10.18287/2412-6179-CO-1020 (Mi co1048)

Эта публикация цитируется в 5 научных статьях (всего в 5 статьях)

ОБРАБОТКА ИЗОБРАЖЕНИЙ, РАСПОЗНАВАНИЕ ОБРАЗОВ

Document image analysis and recognition: a survey

V. V. Arlazarov^ab, E. I. Andreeva^b, K. B. Bulatov^ab, D. P. Nikolaev^c, O. O. Petrova^b, B. I. Savelev^b, O. A. Slavin^a

^a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow
^b Smart Engines Service LLC, Moscow
^c Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow

PDF полного текста (1287 kB) Список цитирования (5)

DOI: https://doi.org/10.18287/2412-6179-CO-1020

Аннотация: This paper analyzes the problems of document image recognition and the existing solutions. Document recognition algorithms have been studied for quite a long time, but despite this, currently, the topic is relevant and research continues, as evidenced by a large number of associated publications and reviews. However, most of these works and reviews are devoted to individual recognition tasks. In this review, the entire set of methods, approaches, and algorithms necessary for document recognition is considered. A preliminary systematization allowed us to distinguish groups of methods for extracting information from documents of different types: single-page and multi-page, with text and handwritten contents, with a fixed template and flexible structure, and digitalized via different ways: scanning, photographing, video recording. Here, we consider methods of document recognition and analysis applied to a wide range of tasks: identification and verification of identity, due diligence, machine learning algorithms, questionnaires, and audits. The groups of methods necessary for the recognition of a single page image are examined: the classical computer vision algorithms, i.e., keypoints, local feature descriptors, Fast Hough Transforms, image binarization, and modern neural network models for document boundary detection, document classification, document structure analysis, i.e., text blocks and tables localization, extraction and recognition of the details, post-processing of recognition results. The review provides a description of publicly available experimental data packages for training and testing recognition algorithms. Methods for optimizing the performance of document image analysis and recognition methods are described.

Ключевые слова: document recognition, image normalization, binarization, local features, segmentation, document boundary detection, artificial neural network, information extraction, document sorting, document comparison, video sequence recognition

Финансовая поддержка	Номер гранта
Российский фонд фундаментальных исследований	20-17-50177
Представленное исследование выполнено на средства РФФИ, номер проекта 20-17-50177.

Поступила в редакцию: 05.08.2021
Принята в печать: 26.10.2021

Тип публикации: Статья

Язык публикации: английский

Образец цитирования: V. V. Arlazarov, E. I. Andreeva, K. B. Bulatov, D. P. Nikolaev, O. O. Petrova, B. I. Savelev, O. A. Slavin, “Document image analysis and recognition: a survey”, Компьютерная оптика, 46:4 (2022), 567–589

Цитирование в формате AMSBIB

\RBibitem{ArlAndBul22}

\by V.~V.~Arlazarov, E.~I.~Andreeva, K.~B.~Bulatov, D.~P.~Nikolaev, O.~O.~Petrova, B.~I.~Savelev, O.~A.~Slavin

\paper Document image analysis and recognition: a survey

\jour Компьютерная оптика

\yr 2022

\vol 46

\issue 4

\pages 567--589

\mathnet{http://mi.mathnet.ru/co1048}

\crossref{https://doi.org/10.18287/2412-6179-CO-1020}

Образцы ссылок на эту страницу:

https://www.mathnet.ru/rus/co1048

https://www.mathnet.ru/rus/co/v46/i4/p567

Эта публикация цитируется в следующих 5 статьяx:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Что такое QR-код?

Обратная связь:

Пользовательское соглашение

Регистрация посетителей портала

Логотипы