Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestnik YuUrGU. Ser. Mat. Model. Progr.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie, 2022, Volume 15, Issue 4, Pages 80–89
DOI: https://doi.org/10.14529/mmp220407
(Mi vyuru663)
 

This article is cited in 2 scientific papers (total in 2 papers)

Programming & Computer Software

Method for analyzing the structure of noisy images of administrative documents

O. A. Slavina, E. L. Pliskinab

a Federal Research Center “Computer Science and Control” of the Russian Academy
b LLC “Smart Engines Service”, Moscow, Russian Federation
Full-text PDF (219 kB) Citations (2)
References:
Abstract: The problem of extracting content elements (fields) from the images of administrative documents via descriptions of anchoring elements is considered. Administrative documents contain static elements and content elements (filled information). The static objects of the document model are the lines of the document structure and the words. Sets of objects united by properties and relationships are described. The text descriptor can contain attributes that distinguish it from similar descriptors. We suggest using combined descriptors of line segments and words. We showed experimentally that the extraction of object sets improves the recognition accuracy of the document fields by 17% and the accuracy of information extraction by 16%. For optical character recognition, we employed SDK Smart Document Engine in the experiment.
Keywords: noisy image, document recognition, special text point, descriptor.
Funding agency Grant number
Russian Foundation for Basic Research 20-07-00934
The research was supported by the Russian Foundation for Basic Research (Project 20-07-00934 “Development, properties study and justification of anytime algorithms for computed tomography”).
Received: 15.09.2022
Document Type: Article
UDC: 004.932.72'1
MSC: 90C35, 90C27
Language: English
Citation: O. A. Slavin, E. L. Pliskin, “Method for analyzing the structure of noisy images of administrative documents”, Vestnik YuUrGU. Ser. Mat. Model. Progr., 15:4 (2022), 80–89
Citation in format AMSBIB
\Bibitem{SlaPli22}
\by O.~A.~Slavin, E.~L.~Pliskin
\paper Method for analyzing the structure of noisy images of administrative documents
\jour Vestnik YuUrGU. Ser. Mat. Model. Progr.
\yr 2022
\vol 15
\issue 4
\pages 80--89
\mathnet{http://mi.mathnet.ru/vyuru663}
\crossref{https://doi.org/10.14529/mmp220407}
Linking options:
  • https://www.mathnet.ru/eng/vyuru663
  • https://www.mathnet.ru/eng/vyuru/v15/i4/p80
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Statistics & downloads:
    Abstract page:50
    Full-text PDF :10
    References:11
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024