|
Trudy SPIIRAN, 2011, Issue 16, Pages 110–122
(Mi trspy420)
|
|
|
|
Segmentation methods of OCR systems in problems of automatic processing of archival documents
S. V. Kuleshov, S. Smirnov St. Petersburg Institute for Informatics and Automation of RAS
Abstract:
This paper describes the comparison of the modern optical character recognition systems aimed to find the systems, which do more precise segmentation, and to detect the capabilities of systems to allocate different types of areas. The results of the segmentation methods of OCR systems are analyzed. The effectiveness of the process of segmentation is evaluated. Based on the results of studies and observations made, recommendations to use for different types of documents are made.
Keywords:
optical character recognition, segmentation, OCR systems, document layout analysis, digitization of archival documents.
Received: 24.01.2011
Citation:
S. V. Kuleshov, S. Smirnov, “Segmentation methods of OCR systems in problems of automatic processing of archival documents”, Tr. SPIIRAN, 16 (2011), 110–122
Linking options:
https://www.mathnet.ru/eng/trspy420 https://www.mathnet.ru/eng/trspy/v16/p110
|
Statistics & downloads: |
Abstract page: | 237 | Full-text PDF : | 159 | References: | 37 | First page: | 1 |
|