|
On using the computer linguistic models in the classification of biomedical images
E.Yu.Shchetinin Financial University under the Government the Russian Federation
Abstract:
Computer linguistic models have become widespread in the field of natural language processing and have recently been actively used to solve various computer vision problems. In this article, computer studies have been carried out aimed to identify the effectiveness of the use of transformer models in the task of classifying X-ray images of the lungs. The studies used pre-trained models of transformers with different sizes ViT-B(16/32), ViT-L(16/32), which were then fine-tuned on a set of X-ray images of lung. Computer studies of the use of convolutional neural networks VGG-16, Inception V3, ResNet50, EfficientNetV2, DenseNet121 have also been conducted. A comparative analysis of the classification results of the studied X-ray images showed that the ViT-B/32 transformer model has the best accuracy metrics accuracy=97.56%, AUC=99%.
Keywords:
transformers, deep convolutional networks, classification, lungs X-ray images.
Received: 19.06.2023 Revised: 19.06.2023 Accepted: 11.09.2023
Citation:
E.Yu.Shchetinin, “On using the computer linguistic models in the classification of biomedical images”, Matem. Mod., 35:12 (2023), 18–30; Math. Models Comput. Simul., 16:2 (2024), 246–253
Linking options:
https://www.mathnet.ru/eng/mm4510 https://www.mathnet.ru/eng/mm/v35/i12/p18
|
|