P. V. Bezmaternykh, D. A. Ilin, D. P. Nikolaev, “U-Net-bin: hacking the document image binarization contest”, Компьютерная оптика, 43:5 (2019), 825

Компьютерная оптика

RUS ENG

ЖУРНАЛЫ ПЕРСОНАЛИИ ОРГАНИЗАЦИИ КОНФЕРЕНЦИИ СЕМИНАРЫ ВИДЕОТЕКА ПАКЕТ AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	Общая информация
	Последний выпуск
	Архив
	Правила для авторов

	Поиск публикаций
	Поиск ссылок

	RSS
	Последний выпуск
	Текущие выпуски
	Архивные выпуски
	Что такое RSS

Компьютерная оптика:
Год:
Том:
Выпуск:
Страница:
	Найти

Персональный вход:
Логин:
Пароль:
	Запомнить пароль
	Войти
	Забыли пароль?
	Регистрация

Компьютерная оптика, 2019, том 43, выпуск 5, страницы 825–832
DOI: https://doi.org/10.18287/2412-6179-2019-43-5-825-832 (Mi co709)

Эта публикация цитируется в 30 научных статьях (всего в 30 статьях)

ОБРАБОТКА ИЗОБРАЖЕНИЙ, РАСПОЗНАВАНИЕ ОБРАЗОВ

U-Net-bin: hacking the document image binarization contest

P. V. Bezmaternykh^ab, D. A. Ilin^a, D. P. Nikolaev^ca

^a Smart Engines Service LLC, 117312, Moscow, Russia
^b Federal Research Center "Computer Science and Control" of RAS, 117312, Moscow, Russia
^c Institute for Information Transmission Problems of RAS, 127051, Moscow, Russia

PDF полного текста (3136 kB) Список цитирования (30)

Список литературы:

PDF

HTML

DOI: https://doi.org/10.18287/2412-6179-2019-43-5-825-832

Аннотация: Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO'17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.

Ключевые слова: historical document processing, binarization, DIBCO, deep learning, U-Net architecture, training dataset augmentation, document analysis.

Финансовая поддержка	Номер гранта
Российский фонд фундаментальных исследований	17-29-07092 а 17-29-07093 а

Поступила в редакцию: 20.06.2019
Принята в печать: 01.08.2019

Тип публикации: Статья

Язык публикации: английский

Образец цитирования: P. V. Bezmaternykh, D. A. Ilin, D. P. Nikolaev, “U-Net-bin: hacking the document image binarization contest”, Компьютерная оптика, 43:5 (2019), 825–832

Цитирование в формате AMSBIB

\RBibitem{BezIliNik19}

\by P.~V.~Bezmaternykh, D.~A.~Ilin, D.~P.~Nikolaev

\paper U-Net-bin: hacking the document image binarization contest

\jour Компьютерная оптика

\yr 2019

\vol 43

\issue 5

\pages 825--832

\mathnet{http://mi.mathnet.ru/co709}

\crossref{https://doi.org/10.18287/2412-6179-2019-43-5-825-832}

Образцы ссылок на эту страницу:

https://www.mathnet.ru/rus/co709

https://www.mathnet.ru/rus/co/v43/i5/p825

Эта публикация цитируется в следующих 30 статьяx:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Статистика просмотров:
Страница аннотации:	201
PDF полного текста:	82
Список литературы:	24

Что такое QR-код?

Обратная связь:

Пользовательское соглашение

Регистрация посетителей портала

Логотипы