Computer Research and Modeling
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Research and Modeling:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Research and Modeling, 2022, Volume 14, Issue 6, Pages 1221–1238
DOI: https://doi.org/10.20537/2076-7633-2022-14-6-1221-1238
(Mi crm1029)
 

NUMERICAL METHODS AND THE BASIS FOR THEIR APPLICATION

Optimal threshold selection algorithms for multi-label classification: property study

A. I. Berger, S. A. Guda

Southern Federal University, Rostov-on-Don Southern Federal University, 105/42 Bolshaya Sadovaya st., Rostov-on-Don, 344006, Russia
References:
Abstract: Multi-label classification models arise in various areas of life, which is explained by an increasing amount of information that requires prompt analysis. One of the mathematical methods for solving this problem is a plug-in approach, at the first stage of which, for each class, a certain ranking function is built, ordering all objects in some way, and at the second stage, the optimal thresholds are selected, the objects on one side of which are assigned to the current class, and on the other — to the other. Thresholds are chosen to maximize the target quality measure. The algorithms which properties are investigated in this article are devoted to the second stage of the plug-in approach which is the choice of the optimal threshold vector. This step becomes non-trivial if the $F$-measure of average precision and recall is used as the target quality assessment since it does not allow independent threshold optimization in each class. In problems of extreme multi-label classification, the number of classes can reach hundreds of thousands, so the original optimization problem is reduced to the problem of searching a fixed point of a specially introduced transformation V, defined on a unit square on the plane of average precision $P$ and recall $R$. Using this transformation, two algorithms are proposed for optimization: the $F$-measure linearization method and the method of V domain analysis. The properties of algorithms are studied when applied to multi-label classification data sets of various sizes and origin, in particular, the dependence of the error on the number of classes, on the $F$-measure parameter, and on the internal parameters of methods under study. The peculiarity of both algorithms work when used for problems with the domain of V, containing large linear boundaries, was found. In case when the optimal point is located in the vicinity of these boundaries, the errors of both methods do not decrease with an increase in the number of classes. In this case, the linearization method quite accurately determines the argument of the optimal point, while the method of V domain analysis — the polar radius.
Keywords: multi-label classification, extreme classification, $F$-measure, linearization method, domain analysis method.
Funding agency Grant number
Russian Science Foundation 20-43-01015
The authors acknowledge Russian Science Foundation grant No. 20-43-01015 for the financial support.
Received: 24.02.2022
Revised: 09.06.2022
Accepted: 08.09.2022
Document Type: Article
UDC: 519.8
Language: Russian
Citation: A. I. Berger, S. A. Guda, “Optimal threshold selection algorithms for multi-label classification: property study”, Computer Research and Modeling, 14:6 (2022), 1221–1238
Citation in format AMSBIB
\Bibitem{BerGud22}
\by A.~I.~Berger, S.~A.~Guda
\paper Optimal threshold selection algorithms for multi-label classification: property study
\jour Computer Research and Modeling
\yr 2022
\vol 14
\issue 6
\pages 1221--1238
\mathnet{http://mi.mathnet.ru/crm1029}
\crossref{https://doi.org/10.20537/2076-7633-2022-14-6-1221-1238}
Linking options:
  • https://www.mathnet.ru/eng/crm1029
  • https://www.mathnet.ru/eng/crm/v14/i6/p1221
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Research and Modeling
    Statistics & downloads:
    Abstract page:58
    Full-text PDF :73
    References:14
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024