Related papers: Acceleration Methods

A Universal Catalyst for First-Order Optimization

We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective…

Optimization and Control · Mathematics 2015-10-27 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice

We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective…

Machine Learning · Statistics 2018-06-20 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

Accelerating Proximal Gradient-type Algorithms using Damped Anderson Acceleration with Restarts and Nesterov Initialization

Despite their frequent slow convergence, proximal gradient schemes are widely used in large-scale optimization tasks due to their tremendous stability, scalability, and ease of computation. In this paper, we develop and investigate a…

Computation · Statistics 2025-08-19 Nicholas C. Henderson , Ravi Varadhan

On Adapting Nesterov's Scheme to Accelerate Iterative Methods for Linear Problems

Nesterov's well-known scheme for accelerating gradient descent in convex optimization problems is adapted to accelerating stationary iterative solvers for linear systems. Compared with classical Krylov subspace acceleration methods, the…

Optimization and Control · Mathematics 2021-08-10 Tao Hong , Irad Yavneh

Nonlinear Acceleration of Momentum and Primal-Dual Algorithms

We describe convergence acceleration schemes for multistep optimization algorithms. The extrapolated solution is written as a nonlinear average of the iterates produced by the original optimization method. Our analysis does not need the…

Optimization and Control · Mathematics 2019-10-18 Raghu Bollapragada , Damien Scieur , Alexandre d'Aspremont

Adaptive Restart of the Optimized Gradient Method for Convex Optimization

First-order methods with momentum such as Nesterov's fast gradient method are very useful for convex optimization problems, but can exhibit undesirable oscillations yielding slow convergence rates for some applications. An adaptive…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

A Lyapunov Analysis of Momentum Methods in Optimization

Momentum methods play a significant role in optimization. Examples include Nesterov's accelerated gradient method and the conditional gradient algorithm. Several momentum methods are provably optimal under standard oracle models, and all…

Optimization and Control · Mathematics 2018-03-13 Ashia C. Wilson , Benjamin Recht , Michael I. Jordan

Understanding Nesterov's Acceleration via Proximal Point Method

The proximal point method (PPM) is a fundamental method in optimization that is often used as a building block for designing optimization algorithms. In this work, we use the PPM method to provide conceptually simple derivations along with…

Optimization and Control · Mathematics 2022-06-03 Kwangjun Ahn , Suvrit Sra

Catalyst Acceleration for Gradient-Based Non-Convex Optimization

We introduce a generic scheme to solve nonconvex optimization problems using gradient-based algorithms originally designed for minimizing convex functions. Even though these methods may originally require convexity to operate, the proposed…

Machine Learning · Statistics 2019-01-03 Courtney Paquette , Hongzhou Lin , Dmitriy Drusvyatskiy , Julien Mairal , Zaid Harchaoui

Unifying restart accelerated gradient and proximal bundle methods

This paper presents a novel restarted version of Nesterov's accelerated gradient method and establishes its optimal iteration-complexity for solving convex smooth composite optimization problems. The proposed restart accelerated gradient…

Optimization and Control · Mathematics 2025-01-09 Jiaming Liang

Delayed supermartingale convergence lemmas for stochastic approximation with Nesterov momentum

This paper focus on the convergence of stochastic approximation with Nesterov momentum. Nesterov acceleration has proven effective in machine learning for its ability to reduce computational complexity. The issue of delayed information in…

Optimization and Control · Mathematics 2024-06-11 Zhang Ming-Kun

Cubic Regularization with Momentum for Nonconvex Optimization

Momentum is a popular technique to accelerate the convergence in practical training, and its impact on convergence guarantee has been well-studied for first-order algorithms. However, such a successful acceleration technique has not yet…

Optimization and Control · Mathematics 2019-06-28 Zhe Wang , Yi Zhou , Yingbin Liang , Guanghui Lan

Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent

We present a unifying framework for adapting the update direction in gradient-based iterative optimization methods. As natural special cases we re-derive classical momentum and Nesterov's accelerated gradient method, lending a new intuitive…

Machine Learning · Statistics 2016-07-12 Aleksandar Botev , Guy Lever , David Barber

Continuized Nesterov Acceleration for Non-Convex Optimization

In convex optimization, continuous-time counterparts have been a fruitful tool for analyzing momentum algorithms. Fewer such examples are available when the function to minimize is non-convex. In several cases, discrepancies arise between…

Optimization and Control · Mathematics 2026-01-07 Julien Hermant , Jean-François Aujol , Charles Dossal , Lorick Huang , Aude Rondepierre

On a Combination of Alternating Minimization and Nesterov's Momentum

Alternating minimization (AM) procedures are practically efficient in many applications for solving convex and non-convex optimization problems. On the other hand, Nesterov's accelerated gradient is theoretically optimal first-order method…

Optimization and Control · Mathematics 2021-09-16 Sergey Guminov , Pavel Dvurechensky , Nazarii Tupitsa , Alexander Gasnikov

Accelerated Gradient Methods with Gradient Restart: Global Linear Convergence

Gradient restarting has been shown to improve the numerical performance of accelerated gradient methods. This paper provides a mathematical analysis to understand these advantages. First, we establish global linear convergence guarantees…

Optimization and Control · Mathematics 2025-05-28 Chenglong Bao , Liang Chen , Jiahong Li , Zuowei Shen

On Accelerated Methods in Optimization

In convex optimization, there is an {\em acceleration} phenomenon in which we can boost the convergence rate of certain gradient-based algorithms. We can observe this phenomenon in Nesterov's accelerated gradient descent, accelerated mirror…

Optimization and Control · Mathematics 2015-09-14 Andre Wibisono , Ashia C. Wilson

From differential equation solvers to accelerated first-order methods for convex optimization

Convergence analysis of accelerated first-order methods for convex optimization problems are presented from the point of view of ordinary differential equation solvers. A new dynamical system, called Nesterov accelerated gradient flow, has…

Optimization and Control · Mathematics 2022-03-01 Hao Luo , Long Chen

Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems

The optimization step in many machine learning problems rarely relies on vanilla gradient descent but it is common practice to use momentum-based accelerated methods. Despite these algorithms being widely applied to arbitrary loss…

Disordered Systems and Neural Networks · Physics 2021-10-29 Stefano Sarao Mannelli , Pierfrancesco Urbani

A Unified Scheme to Accelerate Adaptive Cubic Regularization and Gradient Methods for Convex Optimization

In this paper we propose a unified two-phase scheme for convex optimization to accelerate: (1) the adaptive cubic regularization methods with exact/inexact Hessian matrices, and (2) the adaptive gradient method, without any knowledge of the…

Optimization and Control · Mathematics 2017-12-29 Bo Jiang , Tianyi Lin , Shuzhong Zhang