|
PATTERN RECOGNITION
Achieving statistical dependence of the CNN response on the input data distortion for OCR problem
I. M. Janiszewskia, V. V. Arlazarovbcd, D. G. Sluginba a Federal Research Center Computer Science and Control of Russian Academy of Sciences, Moscow, Russia
b Smart Engines Service LLC, Moscow, Russia
c IInstitute for Information Transmission Problems of Russian Academy of Sciences, Moscow, Russia
d Moscow Institute of Physics and Technology (State University), Moscow, Russia
Abstract:
The paper proposes an approach to training a convolutional neural network using information on the level of distortion of input data. The learning process is modified with an additional layer, which is subsequently deleted, so the architecture of the original network does not change. OCR of data based on the MNIST dataset distorted with Gaussian blur using LeNet5 architecture network is considered. This approach does not have quality loss of the network and has a significant error-free zone in responses on the test data which is absent in the traditional approach to training. The responses are statistically dependent on the level of input image’s distortions and there is a presence of a strong relationship between them.
Keywords:
Convolutional neural networks, pattern recognition, machine learning, distortion, Gaussian blur, OCR, MNIST.
Citation:
I. M. Janiszewski, V. V. Arlazarov, D. G. Slugin, “Achieving statistical dependence of the CNN response on the input data distortion for OCR problem”, Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2019, no. 4, 94–101
Linking options:
https://www.mathnet.ru/eng/itvs366 https://www.mathnet.ru/eng/itvs/y2019/i4/p94
|
Statistics & downloads: |
Abstract page: | 79 | Full-text PDF : | 40 |
|