|
This article is cited in 1 scientific paper (total in 1 paper)
INTELLIGENT SYSTEMS AND TECHNOLOGIES
Object descriptors for linking structural elements of noisy document images
O. A. Slavinab a Federal State Institution "Federal Research Center" Informatics and Management of the Russian Academy of Sciences
b "Smart Engines"
Abstract:
The problem of extracting filling elements (fields) from a recognized image of a document with the help of descriptors – descriptions of one or more structural elements is considered. Structural elements can be words of static text and scribble lines used to shape the design of a document. Business documents with a simplified structure and a limited vocabulary are considered. Flexible business documents that allow significant modifications to the page design are considered. Descriptors are created taking into account a significant number of possible errors in document page recognition. Combined descriptors consisting of several terms and line segments are described. A binding algorithm based on descriptors is given. It is experimentally shown that the extraction of combined descriptors improves the accuracy of recognition of document fields during recognition by 17%, and the accuracy of extracting information from the document image by 16%. The SDK Smart Document Engine was used as OCR in the experiment.
Keywords:
virtual reality, augmented reality, virtual reality helmet, immersiveness, virtual object, heptic technologies, content.
Citation:
O. A. Slavin, “Object descriptors for linking structural elements of noisy document images”, Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2022, no. 4, 13–24
Linking options:
https://www.mathnet.ru/eng/itvs782 https://www.mathnet.ru/eng/itvs/y2022/i4/p13
|
Statistics & downloads: |
Abstract page: | 25 | Full-text PDF : | 1 | First page: | 1 |
|