|
INTELLIGENT SYSTEMS AND TECHNOLOGIES
Analysis of the usage of problem-oriented datasets in scientific research
V. V. Arlazarovab a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia
b Smart Engines Service LLC, Moscow
Abstract:
In this paper we consider the problem of creating and using open problem-oriented datasets to facilitate verifyable and reproducible research, based on the study of the usage of MIDV family of datasets, which contain images and video sequences of identity documents. An analysis is presented of published scientific works in the fields of computer vision, image processing, and computational linguistics, which use these datasets. Main problems are described which were tackled by the research groups, and general principles are formulated, which could be used for creating and expanding the datasets of this class.
Keywords:
text recognition, document analysis, datasets, reproducible research, image processing.
Citation:
V. V. Arlazarov, “Analysis of the usage of problem-oriented datasets in scientific research”, Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2022, no. 3, 10–23
Linking options:
https://www.mathnet.ru/eng/itvs772 https://www.mathnet.ru/eng/itvs/y2022/i3/p10
|
Statistics & downloads: |
Abstract page: | 31 | Full-text PDF : | 17 |
|