English
Related papers

Related papers: Acceleration Methods

200 papers

We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective…

Optimization and Control · Mathematics 2015-10-27 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective…

Machine Learning · Statistics 2018-06-20 Hongzhou Lin , Julien Mairal , Zaid Harchaoui

Despite their frequent slow convergence, proximal gradient schemes are widely used in large-scale optimization tasks due to their tremendous stability, scalability, and ease of computation. In this paper, we develop and investigate a…

Computation · Statistics 2025-08-19 Nicholas C. Henderson , Ravi Varadhan

Nesterov's well-known scheme for accelerating gradient descent in convex optimization problems is adapted to accelerating stationary iterative solvers for linear systems. Compared with classical Krylov subspace acceleration methods, the…

Optimization and Control · Mathematics 2021-08-10 Tao Hong , Irad Yavneh

We describe convergence acceleration schemes for multistep optimization algorithms. The extrapolated solution is written as a nonlinear average of the iterates produced by the original optimization method. Our analysis does not need the…

Optimization and Control · Mathematics 2019-10-18 Raghu Bollapragada , Damien Scieur , Alexandre d'Aspremont

First-order methods with momentum such as Nesterov's fast gradient method are very useful for convex optimization problems, but can exhibit undesirable oscillations yielding slow convergence rates for some applications. An adaptive…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

Momentum methods play a significant role in optimization. Examples include Nesterov's accelerated gradient method and the conditional gradient algorithm. Several momentum methods are provably optimal under standard oracle models, and all…

Optimization and Control · Mathematics 2018-03-13 Ashia C. Wilson , Benjamin Recht , Michael I. Jordan

The proximal point method (PPM) is a fundamental method in optimization that is often used as a building block for designing optimization algorithms. In this work, we use the PPM method to provide conceptually simple derivations along with…

Optimization and Control · Mathematics 2022-06-03 Kwangjun Ahn , Suvrit Sra

We introduce a generic scheme to solve nonconvex optimization problems using gradient-based algorithms originally designed for minimizing convex functions. Even though these methods may originally require convexity to operate, the proposed…

Machine Learning · Statistics 2019-01-03 Courtney Paquette , Hongzhou Lin , Dmitriy Drusvyatskiy , Julien Mairal , Zaid Harchaoui

This paper presents a novel restarted version of Nesterov's accelerated gradient method and establishes its optimal iteration-complexity for solving convex smooth composite optimization problems. The proposed restart accelerated gradient…

Optimization and Control · Mathematics 2025-01-09 Jiaming Liang

This paper focus on the convergence of stochastic approximation with Nesterov momentum. Nesterov acceleration has proven effective in machine learning for its ability to reduce computational complexity. The issue of delayed information in…

Optimization and Control · Mathematics 2024-06-11 Zhang Ming-Kun

Momentum is a popular technique to accelerate the convergence in practical training, and its impact on convergence guarantee has been well-studied for first-order algorithms. However, such a successful acceleration technique has not yet…

Optimization and Control · Mathematics 2019-06-28 Zhe Wang , Yi Zhou , Yingbin Liang , Guanghui Lan

We present a unifying framework for adapting the update direction in gradient-based iterative optimization methods. As natural special cases we re-derive classical momentum and Nesterov's accelerated gradient method, lending a new intuitive…

Machine Learning · Statistics 2016-07-12 Aleksandar Botev , Guy Lever , David Barber

In convex optimization, continuous-time counterparts have been a fruitful tool for analyzing momentum algorithms. Fewer such examples are available when the function to minimize is non-convex. In several cases, discrepancies arise between…

Optimization and Control · Mathematics 2026-01-07 Julien Hermant , Jean-François Aujol , Charles Dossal , Lorick Huang , Aude Rondepierre

Alternating minimization (AM) procedures are practically efficient in many applications for solving convex and non-convex optimization problems. On the other hand, Nesterov's accelerated gradient is theoretically optimal first-order method…

Optimization and Control · Mathematics 2021-09-16 Sergey Guminov , Pavel Dvurechensky , Nazarii Tupitsa , Alexander Gasnikov

Gradient restarting has been shown to improve the numerical performance of accelerated gradient methods. This paper provides a mathematical analysis to understand these advantages. First, we establish global linear convergence guarantees…

Optimization and Control · Mathematics 2025-05-28 Chenglong Bao , Liang Chen , Jiahong Li , Zuowei Shen

In convex optimization, there is an {\em acceleration} phenomenon in which we can boost the convergence rate of certain gradient-based algorithms. We can observe this phenomenon in Nesterov's accelerated gradient descent, accelerated mirror…

Optimization and Control · Mathematics 2015-09-14 Andre Wibisono , Ashia C. Wilson

Convergence analysis of accelerated first-order methods for convex optimization problems are presented from the point of view of ordinary differential equation solvers. A new dynamical system, called Nesterov accelerated gradient flow, has…

Optimization and Control · Mathematics 2022-03-01 Hao Luo , Long Chen

The optimization step in many machine learning problems rarely relies on vanilla gradient descent but it is common practice to use momentum-based accelerated methods. Despite these algorithms being widely applied to arbitrary loss…

Disordered Systems and Neural Networks · Physics 2021-10-29 Stefano Sarao Mannelli , Pierfrancesco Urbani

In this paper we propose a unified two-phase scheme for convex optimization to accelerate: (1) the adaptive cubic regularization methods with exact/inexact Hessian matrices, and (2) the adaptive gradient method, without any knowledge of the…

Optimization and Control · Mathematics 2017-12-29 Bo Jiang , Tianyi Lin , Shuzhong Zhang
‹ Prev 1 2 3 10 Next ›