Computer Optics
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Optics:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Optics, 2019, Volume 43, Issue 5, Pages 825–832
DOI: https://doi.org/10.18287/2412-6179-2019-43-5-825-832
(Mi co709)
 

This article is cited in 30 scientific papers (total in 30 papers)

IMAGE PROCESSING, PATTERN RECOGNITION

U-Net-bin: hacking the document image binarization contest

P. V. Bezmaternykhab, D. A. Ilina, D. P. Nikolaevca

a Smart Engines Service LLC, 117312, Moscow, Russia
b Federal Research Center "Computer Science and Control" of RAS, 117312, Moscow, Russia
c Institute for Information Transmission Problems of RAS, 127051, Moscow, Russia
References:
Abstract: Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO'17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.
Keywords: historical document processing, binarization, DIBCO, deep learning, U-Net architecture, training dataset augmentation, document analysis.
Funding agency Grant number
Russian Foundation for Basic Research 17-29-07092 а
17-29-07093 а
The work was partially funded by Russian Foundation for Basic Research (projects 17-29-07092 and 17-29-07093).
Received: 20.06.2019
Accepted: 01.08.2019
Document Type: Article
Language: English
Citation: P. V. Bezmaternykh, D. A. Ilin, D. P. Nikolaev, “U-Net-bin: hacking the document image binarization contest”, Computer Optics, 43:5 (2019), 825–832
Citation in format AMSBIB
\Bibitem{BezIliNik19}
\by P.~V.~Bezmaternykh, D.~A.~Ilin, D.~P.~Nikolaev
\paper U-Net-bin: hacking the document image binarization contest
\jour Computer Optics
\yr 2019
\vol 43
\issue 5
\pages 825--832
\mathnet{http://mi.mathnet.ru/co709}
\crossref{https://doi.org/10.18287/2412-6179-2019-43-5-825-832}
Linking options:
  • https://www.mathnet.ru/eng/co709
  • https://www.mathnet.ru/eng/co/v43/i5/p825
  • This publication is cited in the following 30 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Optics
    Statistics & downloads:
    Abstract page:194
    Full-text PDF :78
    References:23
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024