|
Analysis of textual and graphical information
Methods for combining multiple text recognition results
V. V. Arlazarovab a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia
b Smart Engines Service LLC, Moscow, Russia
Abstract:
The task of per-frame combination of text recognition results from multiple images is an important component of video stream document recognition systems. Currently there is no unified approach to solving this problem which would yield a high precision of text recognition. In this paper a comparative study is presented of known approaches to the combination of recognition results for identity document fields. It was demonstrated that different approaches are advantageous on different parts of the data sets, while a sepection of the potential best single result can still significantly outperform all the analyzed methods.
Keywords:
text recognition, document analysis, video stream recognition, combination methods, OCR, image processing.
Citation:
V. V. Arlazarov, “Methods for combining multiple text recognition results”, Artificial Intelligence and Decision Making, 2022, no. 3, 106–116; Scientific and Technical Information Processing, 50:5 (2023), 368–375
Linking options:
https://www.mathnet.ru/eng/iipr75 https://www.mathnet.ru/eng/iipr/y2022/i3/p106
|
Statistics & downloads: |
Abstract page: | 16 | Full-text PDF : | 17 |
|