|
Эта публикация цитируется в 30 научных статьях (всего в 30 статьях)
ОБРАБОТКА ИЗОБРАЖЕНИЙ, РАСПОЗНАВАНИЕ ОБРАЗОВ
U-Net-bin: hacking the document image binarization contest
P. V. Bezmaternykhab, D. A. Ilina, D. P. Nikolaevca a Smart Engines Service LLC, 117312, Moscow, Russia
b Federal Research Center "Computer Science and Control" of RAS, 117312, Moscow, Russia
c Institute for Information Transmission Problems of RAS, 127051, Moscow, Russia
Аннотация:
Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO'17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.
Ключевые слова:
historical document processing, binarization, DIBCO, deep learning, U-Net architecture, training dataset augmentation, document analysis.
Поступила в редакцию: 20.06.2019 Принята в печать: 01.08.2019
Образец цитирования:
P. V. Bezmaternykh, D. A. Ilin, D. P. Nikolaev, “U-Net-bin: hacking the document image binarization contest”, Компьютерная оптика, 43:5 (2019), 825–832
Образцы ссылок на эту страницу:
https://www.mathnet.ru/rus/co709 https://www.mathnet.ru/rus/co/v43/i5/p825
|
Статистика просмотров: |
Страница аннотации: | 201 | PDF полного текста: | 82 | Список литературы: | 24 |
|