Computer Optics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Optics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Optics, 2021, Volume 45, Issue 1, paper published in the English version journal
DOI: https://doi.org/10.18287/2412-6179-CO-795
(Mi co884)
 

This article is cited in 11 scientific papers (total in 11 papers)

INTERNATIONAL CONFERENCE ON MACHINE VISION

Weighted combination of per-frame recognition results for text recognition in a video stream

O. O. Petrovaab, K. B. Bulatovbac, V. V. Arlazarovba, V. L. Arlazarovacb

a FRC CSC RAS, Moscow, Russia
b Smart Engines Service LLC, Moscow, Russia
c Moscow Institute of Physics and Technology (State University), Moscow, Russia
References:
Abstract: The scope of uses of automated document recognition has extended and as a result, recognition techniques that do not require specialized equipment have become more relevant. Among such techniques, document recognition using mobile devices is of interest. However, it is not always possible to ensure controlled capturing conditions and, consequentially, high quality of input images. Unlike specialized scanners, mobile cameras allow using a video stream as an input, thus obtaining several images of the recognized object, captured with various characteristics. In this case, a problem of combining the information from multiple input frames arises. In this paper, we propose a weighing model for the process of combining the per-frame recognition results, two approaches to the weighted combination of the text recognition results, and two weighing criteria. The effectiveness of the proposed approaches is tested using datasets of identity documents captured with a mobile device camera in different conditions, including perspective distortion of the document image and low lighting conditions. The experimental results show that the weighting combination can improve the text recognition result quality in the video stream, and the per-character weighting method with input image focus estimation as a base criterion allows one to achieve the best results on the datasets analyzed.
Keywords: mobile OCR, video stream, anytime algorithms, weighted combination, ensemble methods.
Funding agency Grant number
Russian Foundation for Basic Research 17-29-03236 офи_м
18-07-01387 а
This work is partially supported by the Russian Foundation for Basic Research (projects 17-29-03236 and 18-07-01387).
Received: 03.08.2020
Accepted: 30.12.2020
Document Type: Article
Language: English
Citation: O. O. Petrova, K. B. Bulatov, V. V. Arlazarov, V. L. Arlazarov
Citation in format AMSBIB
\Bibitem{PetBulArl21}
\by O.~O.~Petrova, K.~B.~Bulatov, V.~V.~Arlazarov, V.~L.~Arlazarov
\mathnet{http://mi.mathnet.ru/co884}
\crossref{https://doi.org/10.18287/2412-6179-CO-795}
Linking options:
  • https://www.mathnet.ru/eng/co884
  • This publication is cited in the following 11 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Optics
    Statistics & downloads:
    Abstract page:107
    Full-text PDF :47
    References:16
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024