|
Loss functions for train document image segmentation models
A. I. Perminovab, D. Yu. Turdakovab, O. V. Belyaevaa a Ivannikov Institute for System Programming of the RAS
b Lomonosov Moscow State University
Abstract:
The work is devoted to improving the quality of the results of image segmentation of documents of various scientific articles and legal acts by neural network models by learning using modified loss functions that take into account the features of images of the selected subject area. The analysis of existing loss functions is carried out, as well as the development of new functions that operate both with the coordinates of the bounding boxes and using information about the pixels of the input image. To assess the quality, a neural network segmentation model with modified loss functions is trained, and a theoretical assessment is carried out using a simulation experiment showing the convergence rate and segmentation error. As a result of the study, rapidly converging loss functions were created that improve the quality of document image segmentation using additional information about the input data.
Keywords:
document image segmentation, loss functions, loss function modifications
Citation:
A. I. Perminov, D. Yu. Turdakov, O. V. Belyaeva, “Loss functions for train document image segmentation models”, Proceedings of ISP RAS, 34:2 (2022), 89–110
Linking options:
https://www.mathnet.ru/eng/tisp680 https://www.mathnet.ru/eng/tisp/v34/i2/p89
|
Statistics & downloads: |
Abstract page: | 17 | Full-text PDF : | 10 |
|