Computer Optics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Optics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Optics, 2022, Volume 46, Issue 4, Pages 567–589
DOI: https://doi.org/10.18287/2412-6179-CO-1020
(Mi co1048)
 

This article is cited in 5 scientific papers (total in 5 papers)

IMAGE PROCESSING, PATTERN RECOGNITION

Document image analysis and recognition: a survey

V. V. Arlazarovab, E. I. Andreevab, K. B. Bulatovab, D. P. Nikolaevc, O. O. Petrovab, B. I. Savelevb, O. A. Slavina

a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow
b Smart Engines Service LLC, Moscow
c Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow
Abstract: This paper analyzes the problems of document image recognition and the existing solutions. Document recognition algorithms have been studied for quite a long time, but despite this, currently, the topic is relevant and research continues, as evidenced by a large number of associated publications and reviews. However, most of these works and reviews are devoted to individual recognition tasks. In this review, the entire set of methods, approaches, and algorithms necessary for document recognition is considered. A preliminary systematization allowed us to distinguish groups of methods for extracting information from documents of different types: single-page and multi-page, with text and handwritten contents, with a fixed template and flexible structure, and digitalized via different ways: scanning, photographing, video recording. Here, we consider methods of document recognition and analysis applied to a wide range of tasks: identification and verification of identity, due diligence, machine learning algorithms, questionnaires, and audits. The groups of methods necessary for the recognition of a single page image are examined: the classical computer vision algorithms, i.e., keypoints, local feature descriptors, Fast Hough Transforms, image binarization, and modern neural network models for document boundary detection, document classification, document structure analysis, i.e., text blocks and tables localization, extraction and recognition of the details, post-processing of recognition results. The review provides a description of publicly available experimental data packages for training and testing recognition algorithms. Methods for optimizing the performance of document image analysis and recognition methods are described.
Keywords: document recognition, image normalization, binarization, local features, segmentation, document boundary detection, artificial neural network, information extraction, document sorting, document comparison, video sequence recognition
Funding agency Grant number
Russian Foundation for Basic Research 20-17-50177
The reported study was funded by RFBR, project number 20-17-50177.
Received: 05.08.2021
Accepted: 26.10.2021
Document Type: Article
Language: English
Citation: V. V. Arlazarov, E. I. Andreeva, K. B. Bulatov, D. P. Nikolaev, O. O. Petrova, B. I. Savelev, O. A. Slavin, “Document image analysis and recognition: a survey”, Computer Optics, 46:4 (2022), 567–589
Citation in format AMSBIB
\Bibitem{ArlAndBul22}
\by V.~V.~Arlazarov, E.~I.~Andreeva, K.~B.~Bulatov, D.~P.~Nikolaev, O.~O.~Petrova, B.~I.~Savelev, O.~A.~Slavin
\paper Document image analysis and recognition: a survey
\jour Computer Optics
\yr 2022
\vol 46
\issue 4
\pages 567--589
\mathnet{http://mi.mathnet.ru/co1048}
\crossref{https://doi.org/10.18287/2412-6179-CO-1020}
Linking options:
  • https://www.mathnet.ru/eng/co1048
  • https://www.mathnet.ru/eng/co/v46/i4/p567
  • This publication is cited in the following 5 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Optics
    Statistics & downloads:
    Abstract page:8
    Full-text PDF :18
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024