|
Optimal control
On some works of Boris Teodorovich Polyak on the convergence of gradient methods and their development
S. S. Ablaevab, A. N. Beznosikovac, A. V. Gasnikovacd, D. M. Dvinskikhacd, A. V. Lobanovad, S. M. Puchinina, F. S. Stonyakinab a Moscow Institute of Physics and Technology, 141700, Dolgoprudny, Moscow oblast, Russia
b Crimean Federal University, 295007, Simferopol, Republic of Crimea, Russia
c Institute for Information Transmission Problems, Russian Academy of Sciences, 127051, Moscow, Russia
d Institute for System Programming, Russian Academy of Sciences, 109004, Moscow, Russia
Abstract:
The paper presents a review of the current state of subgradient and accelerated convex optimization methods, including the cases with the presence of noise and access to various information about the objective function (function value, gradient, stochastic gradient, higher derivatives). For nonconvex problems, the Polyak–Lojasiewicz condition is considered and a review of the main results is given. The behavior of numerical methods in the presence of a sharp minimum is considered. The aim of this review is to show the influence of the works of B.T. Polyak (1935–2023) on gradient optimization methods and their surroundings on the modern development of numerical optimization methods.
Key words:
gradient descent, gradient dominance condition (Polyak–Lojasiewicz), sharp minimum, subgradient Polyak–Shor method, early stopping condition, Polyak heavy ball method, stochastic gradient descent.
Received: 15.09.2023 Revised: 16.12.2023 Accepted: 17.11.2023
Citation:
S. S. Ablaev, A. N. Beznosikov, A. V. Gasnikov, D. M. Dvinskikh, A. V. Lobanov, S. M. Puchinin, F. S. Stonyakin, “On some works of Boris Teodorovich Polyak on the convergence of gradient methods and their development”, Zh. Vychisl. Mat. Mat. Fiz., 64:4 (2024), 587–626; Comput. Math. Math. Phys., 64:4 (2024), 635–675
Linking options:
https://www.mathnet.ru/eng/zvmmf11729 https://www.mathnet.ru/eng/zvmmf/v64/i4/p587
|
Statistics & downloads: |
Abstract page: | 4 |
|