M. Yu. Danilova, G. Malinovskii, “Averaged heavy-ball method”, Computer Research and Modeling, 14:2 (2022), 277

Loading [MathJax]/jax/output/SVG/config.js

Computer Research and Modeling

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Computer Research and Modeling:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Computer Research and Modeling, 2022, Volume 14, Issue 2, Pages 277–308
DOI: https://doi.org/10.20537/2076-7633-2022-14-2-277-308 (Mi crm968)

This article is cited in 3 scientific papers (total in 3 papers)

MATHEMATICAL MODELING AND NUMERICAL SIMULATION

Averaged heavy-ball method

M. Yu. Danilova^a, G. Malinovskii^bc

^a V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, 65 Profsoyuznaya st., Moscow, 117997, Russia
^b Moscow Institute of Physics and Technology (National Research University), 1a Kerchenskaya st., Moscow, 117303, Russia
^c King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia

Full-text PDF (2756 kB) Citations (3)

References:

PDF

HTML

DOI: https://doi.org/10.20537/2076-7633-2022-14-2-277-308

Abstract: First-order optimization methods are work horses in a wide range of modern applications in economics, physics, biology, machine learning, control, and other fields. Among other first-order methods accelerated and momentum ones obtain special attention because of their practical efficiency. The heavy-ball method (HB) is one of the first momentum methods. The method was proposed in 1964 and the first analysis was conducted for quadratic strongly convex functions. Since then a number of variations of HB have been proposed and analyzed. In particular, HB is known for its simplicity in implementation and its performance on nonconvex problems. However, as other momentum methods, it has nonmonotone behavior, and for optimal parameters, the method suffers from the so-called peak effect. To address this issue, in this paper, we consider an averaged version of the heavy-ball method (AHB). We show that for quadratic problems AHB has a smaller maximal deviation from the solution than HB. Moreover, for general convex and strongly convex functions, we prove non-accelerated rates of global convergence of AHB, its weighted version WAHB, and for AHB with restarts R-AHB. To the best of our knowledge, such guarantees for HB with averaging were not explicitly proven for strongly convex problems in the existing works. Finally, we conduct several numerical experiments on minimizing quadratic and nonquadratic functions to demonstrate the advantages of using averaging for HB. Moreover, we also tested one more modification of AHB called the tail-averaged heavy-ball method (TAHB). In the experiments, we observed that HB with a properly adjusted averaging scheme converges faster than HB without averaging and has smaller oscillations.

Keywords: first-order methods, convex optimization, momentum methods, global convergence guarantees.

Funding agency	Grant number
Russian Foundation for Basic Research	20-31-90073
Russian Science Foundation	21-71-30005
This work was supported by Russian Foundation for Basic Research (Theorems 2 and 3, project No. 20-31-90073) and by Russian Science Foundation (Theorems 5 and 4, project No. 21-71-30005)

Received: 18.11.2021
Accepted: 13.02.2022

Document Type: Article

UDC: 519.6

Language: English

Citation: M. Yu. Danilova, G. Malinovskii, “Averaged heavy-ball method”, Computer Research and Modeling, 14:2 (2022), 277–308

Citation in format AMSBIB

\Bibitem{DanMal22}

\by M.~Yu.~Danilova, G.~Malinovskii

\paper Averaged heavy-ball method

\jour Computer Research and Modeling

\yr 2022

\vol 14

\issue 2

\pages 277--308

\mathnet{http://mi.mathnet.ru/crm968}

\crossref{https://doi.org/10.20537/2076-7633-2022-14-2-277-308}

Linking options:

https://www.mathnet.ru/eng/crm968

https://www.mathnet.ru/eng/crm/v14/i2/p277

This publication is cited in the following 3 articles:

Gerasim V. Krivovichev, Valentina Yu. Sergeeva, “Analysis of a Two-Step Gradient Method with Two Momentum Parameters for Strongly Convex Unconstrained Optimization”, Algorithms, 17:3 (2024), 126
Samantha Samuelson, Mihailo R. Jovanović, 2024 American Control Conference (ACC), 2024, 650
Hesameddin Mohammadi, Samantha Samuelson, Mihailo R. Jovanović, “Transient Growth of Accelerated Optimization Algorithms”, IEEE Trans. Automat. Contr., 68:3 (2023), 1823

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	151
Full-text PDF :	94
References:	36

Что такое QR-код?

Registration to the website

Logotypes