Computer Research and Modeling
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Research and Modeling:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Research and Modeling, 2022, Volume 14, Issue 2, Pages 277–308
DOI: https://doi.org/10.20537/2076-7633-2022-14-2-277-308
(Mi crm968)
 

This article is cited in 3 scientific papers (total in 3 papers)

MATHEMATICAL MODELING AND NUMERICAL SIMULATION

Averaged heavy-ball method

M. Yu. Danilovaa, G. Malinovskiibc

a V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 65 Profsoyuznaya st., Moscow, 117997, Russia
b Moscow Institute of Physics and Technology (National Research University), 1a Kerchenskaya st., Moscow, 117303, Russia
c King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
References:
Abstract: First-order optimization methods are work horses in a wide range of modern applications in economics, physics, biology, machine learning, control, and other fields. Among other first-order methods accelerated and momentum ones obtain special attention because of their practical efficiency. The heavy-ball method (HB) is one of the first momentum methods. The method was proposed in 1964 and the first analysis was conducted for quadratic strongly convex functions. Since then a number of variations of HB have been proposed and analyzed. In particular, HB is known for its simplicity in implementation and its performance on nonconvex problems. However, as other momentum methods, it has nonmonotone behavior, and for optimal parameters, the method suffers from the so-called peak effect. To address this issue, in this paper, we consider an averaged version of the heavy-ball method (AHB). We show that for quadratic problems AHB has a smaller maximal deviation from the solution than HB. Moreover, for general convex and strongly convex functions, we prove non-accelerated rates of global convergence of AHB, its weighted version WAHB, and for AHB with restarts R-AHB. To the best of our knowledge, such guarantees for HB with averaging were not explicitly proven for strongly convex problems in the existing works. Finally, we conduct several numerical experiments on minimizing quadratic and nonquadratic functions to demonstrate the advantages of using averaging for HB. Moreover, we also tested one more modification of AHB called the tail-averaged heavy-ball method (TAHB). In the experiments, we observed that HB with a properly adjusted averaging scheme converges faster than HB without averaging and has smaller oscillations.
Keywords: first-order methods, convex optimization, momentum methods, global convergence guarantees.
Funding agency Grant number
Russian Foundation for Basic Research 20-31-90073
Russian Science Foundation 21-71-30005
This work was supported by Russian Foundation for Basic Research (Theorems 2 and 3, project No. 20-31-90073) and by Russian Science Foundation (Theorems 5 and 4, project No. 21-71-30005)
Received: 18.11.2021
Accepted: 13.02.2022
Document Type: Article
UDC: 519.6
Language: English
Citation: M. Yu. Danilova, G. Malinovskii, “Averaged heavy-ball method”, Computer Research and Modeling, 14:2 (2022), 277–308
Citation in format AMSBIB
\Bibitem{DanMal22}
\by M.~Yu.~Danilova, G.~Malinovskii
\paper Averaged heavy-ball method
\jour Computer Research and Modeling
\yr 2022
\vol 14
\issue 2
\pages 277--308
\mathnet{http://mi.mathnet.ru/crm968}
\crossref{https://doi.org/10.20537/2076-7633-2022-14-2-277-308}
Linking options:
  • https://www.mathnet.ru/eng/crm968
  • https://www.mathnet.ru/eng/crm/v14/i2/p277
  • This publication is cited in the following 3 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Research and Modeling
    Statistics & downloads:
    Abstract page:125
    Full-text PDF :89
    References:27
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024