|
This article is cited in 3 scientific papers (total in 3 papers)
MATHEMATICAL MODELING AND NUMERICAL SIMULATION
Averaged heavy-ball method
M. Yu. Danilovaa, G. Malinovskiibc a V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences,
65 Profsoyuznaya st., Moscow, 117997, Russia
b Moscow Institute of Physics and Technology (National Research University),
1a Kerchenskaya st., Moscow, 117303, Russia
c King Abdullah University of Science and Technology,
Thuwal 23955-6900, Kingdom of Saudi Arabia
Abstract:
First-order optimization methods are work horses in a wide range of modern applications in economics, physics, biology, machine learning, control, and other fields. Among other first-order methods accelerated and momentum ones obtain special attention because of their practical efficiency. The heavy-ball method (HB) is one of the first momentum methods. The method was proposed in 1964 and the first analysis was conducted for quadratic strongly convex functions. Since then a number of variations of HB have been proposed and analyzed. In particular, HB is known for its simplicity in implementation and its performance on nonconvex problems. However, as other momentum methods, it has nonmonotone behavior, and for optimal parameters, the method suffers from the so-called peak effect. To address this issue, in this paper, we consider an averaged version of the heavy-ball method (AHB). We show that for quadratic problems AHB has a smaller maximal deviation from the solution than HB. Moreover, for general convex and strongly convex functions, we prove non-accelerated rates of global convergence of AHB, its weighted version WAHB, and for AHB with restarts R-AHB. To the best of our knowledge, such guarantees for HB with averaging were not explicitly proven for strongly convex problems in the existing works. Finally, we conduct several numerical experiments on minimizing quadratic and nonquadratic functions to demonstrate the advantages of using averaging for HB. Moreover, we also tested one more modification of AHB called the tail-averaged heavy-ball method (TAHB). In the experiments, we observed that HB with a properly adjusted averaging scheme converges faster than HB without averaging and has smaller oscillations.
Keywords:
first-order methods, convex optimization, momentum methods, global convergence guarantees.
Received: 18.11.2021 Accepted: 13.02.2022
Citation:
M. Yu. Danilova, G. Malinovskii, “Averaged heavy-ball method”, Computer Research and Modeling, 14:2 (2022), 277–308
Linking options:
https://www.mathnet.ru/eng/crm968 https://www.mathnet.ru/eng/crm/v14/i2/p277
|
Statistics & downloads: |
Abstract page: | 125 | Full-text PDF : | 89 | References: | 27 |
|