Computer Optics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Optics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Optics, 2023, Volume 47, Issue 4, Pages 627–636
DOI: https://doi.org/10.18287/2412-6179-CO-1207
(Mi co1164)
 

IMAGE PROCESSING, PATTERN RECOGNITION

A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition

R. Sánchez-Riveroa, P. V. Bezmaternykhbc, A. V. Gayerbc, A. Morales-Gonzáleza, F. J. Silva-Mataa, K. B. Bulatovbc

a Advanced Technologies Application Center (CENATAV), Playa P.C.12200, Havana, Cuba, 7A, #21406 Siboney
b Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow
c Smart Engines Service LLC, Moscow
References:
Abstract: Text recognition has benefited considerably from deep learning research, as well as the preprocessing methods included in its workflow. Identity documents are critical in the field of document analysis and should be thoroughly researched in relation to this workflow. We propose to examine the link between deep learning-based binarization and recognition algorithms for this sort of documents on the MIDV-500 and MIDV-2020 datasets. We provide a series of experiments to illustrate the relation between the quality of the collected images with respect to the binarization results, as well as the influence of its output on final recognition performance. We show that deep learning-based binarization solutions are affected by the capture quality, which implies that they still need significant improvements. We also show that proper binarization results can improve the performance for many recognition methods. Our retrained U-Net-bin outperformed all other binarization methods, and the best result in recognition was obtained by Paddle Paddle OCR v2.
Keywords: document image binarization, identity document recognition, optical character recognition, deep learning, U-Net architecture
Received: 13.09.2022
Accepted: 20.02.2023
Document Type: Article
Language: English
Citation: R. Sánchez-Rivero, P. V. Bezmaternykh, A. V. Gayer, A. Morales-González, F. J. Silva-Mata, K. B. Bulatov, “A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition”, Computer Optics, 47:4 (2023), 627–636
Citation in format AMSBIB
\Bibitem{SanBezGay23}
\by R.~S{\' a}nchez-Rivero, P.~V.~Bezmaternykh, A.~V.~Gayer, A.~Morales-Gonz{\' a}lez, F.~J.~Silva-Mata, K.~B.~Bulatov
\paper A joint study of deep learning-based methods for identity document image binarization and its influence on attribute recognition
\jour Computer Optics
\yr 2023
\vol 47
\issue 4
\pages 627--636
\mathnet{http://mi.mathnet.ru/co1164}
\crossref{https://doi.org/10.18287/2412-6179-CO-1207}
Linking options:
  • https://www.mathnet.ru/eng/co1164
  • https://www.mathnet.ru/eng/co/v47/i4/p627
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Optics
    Statistics & downloads:
    Abstract page:9
    Full-text PDF :2
    References:2
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024