Related papers: Generalizing the optimized gradient method for smo…

On the convergence analysis of the optimized gradient method

This paper considers the problem of unconstrained minimization of smooth convex functions having Lipschitz continuous gradients with known Lipschitz constant. We recently proposed an optimized gradient method (OGM) for this problem and…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

Optimizing the Efficiency of First-Order Methods for Decreasing the Gradient of Smooth Convex Functions

This paper optimizes the step coefficients of first-order methods for smooth convex minimization in terms of the worst-case convergence bound (i.e., efficiency) of the decrease in the gradient norm. This work is based on the performance…

Optimization and Control · Mathematics 2020-10-28 Donghwan Kim , Jeffrey A. Fessler

Beyond Minimax Optimality: A Subgame Perfect Gradient Method

The study of convex optimization has historically been concerned with worst-case convergence rates. The development of the Optimized Gradient Method (OGM), due to \citet{drori2012PerformanceOF,Kim2016optimal}, marked a major milestone in…

Optimization and Control · Mathematics 2026-04-21 Benjamin Grimmer , Kevin Shu , Alex L. Wang

Optimization via First-Order Switching Methods: Skew-Symmetric Dynamics and Optimistic Discretization

Large-scale constrained optimization problems are at the core of many tasks in control, signal processing, and machine learning. Notably, problems with functional constraints arise when, beyond a performance{\nobreakdash-}centric goal…

Optimization and Control · Mathematics 2025-05-15 Antesh Upadhyay , Sang Bin Moon , Abolfazl Hashemi

Making Gradient Descent Optimal for Strongly Convex Stochastic Optimization

Stochastic gradient descent (SGD) is a simple and popular method to solve stochastic optimization problems which arise in machine learning. For strongly convex problems, its convergence rate was known to be O(\log(T)/T), by running SGD for…

Machine Learning · Computer Science 2015-03-19 Alexander Rakhlin , Ohad Shamir , Karthik Sridharan

Efficient First-order Methods for Convex Minimization: a Constructive Approach

We describe a novel constructive technique for devising efficient first-order methods for a wide range of large-scale convex minimization settings, including smooth, non-smooth, and strongly convex minimization. The technique builds upon a…

Optimization and Control · Mathematics 2019-06-27 Yoel Drori , Adrien B. Taylor

Optimized first-order methods for smooth convex minimization

We introduce new optimized first-order methods for smooth unconstrained convex minimization. Drori and Teboulle recently described a numerical method for computing the $N$-iteration optimal step coefficients in a class of first-order…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

On the Curved Geometry of Accelerated Optimization

In this work we propose a differential geometric motivation for Nesterov's accelerated gradient method (AGM) for strongly-convex problems. By considering the optimization procedure as occurring on a Riemannian manifold with a natural…

Machine Learning · Computer Science 2019-11-21 Aaron Defazio

Factor-$\sqrt{2}$ Acceleration of Accelerated Gradient Methods

The optimized gradient method (OGM) provides a factor-$\sqrt{2}$ speedup upon Nesterov's celebrated accelerated gradient method in the convex (but non-strongly convex) setup. However, this improved acceleration mechanism has not been well…

Optimization and Control · Mathematics 2021-05-25 Chanwoo Park , Jisun Park , Ernest K. Ryu

A Note on the Gradient-Evaluation Sequence in Accelerated Gradient Methods

Nesterov's accelerated gradient descent method (AGD) is a seminal deterministic first-order method known to achieve the optimal order of iteration complexity for solving convex smooth optimization problems. Two distinct sequences of…

Optimization and Control · Mathematics 2026-03-10 Yan Wu , Yipeng Zhang , Lu Liu , Yuyuan Ouyang

On Optimal Universal First-Order Methods for Minimizing Heterogeneous Sums

This work considers minimizing a sum of convex functions, each with potentially different structure ranging from nonsmooth to smooth, Lipschitz to non-Lipschitz. Nesterov's universal fast gradient method provides an optimal black-box…

Optimization and Control · Mathematics 2023-06-14 Benjamin Grimmer

An optimal gradient method for smooth strongly convex minimization

We present an optimal gradient method for smooth strongly convex optimization. The method is optimal in the sense that its worst-case bound on the distance to an optimal point exactly matches the lower bound on the oracle complexity for the…

Optimization and Control · Mathematics 2022-06-15 Adrien Taylor , Yoel Drori

Adaptive Restart of the Optimized Gradient Method for Convex Optimization

First-order methods with momentum such as Nesterov's fast gradient method are very useful for convex optimization problems, but can exhibit undesirable oscillations yielding slow convergence rates for some applications. An adaptive…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

Minimizing Finite Sums with the Stochastic Average Gradient

We propose the stochastic average gradient (SAG) method for optimizing the sum of a finite number of smooth convex functions. Like stochastic gradient (SG) methods, the SAG method's iteration cost is independent of the number of terms in…

Optimization and Control · Mathematics 2016-05-12 Mark Schmidt , Nicolas Le Roux , Francis Bach

High-Resolution Modeling of the Fastest First-Order Optimization Method for Strongly Convex Functions

Motivated by the fact that the gradient-based optimization algorithms can be studied from the perspective of limiting ordinary differential equations (ODEs), here we derive an ODE representation of the accelerated triple momentum (TM)…

Optimization and Control · Mathematics 2020-08-26 Boya Sun , Jemin George , Solmaz Kia

Regularized Risk Minimization by Nesterov's Accelerated Gradient Methods: Algorithmic Extensions and Empirical Studies

Nesterov's accelerated gradient methods (AGM) have been successfully applied in many machine learning areas. However, their empirical performance on training max-margin models has been inferior to existing specialized solvers. In this…

Machine Learning · Computer Science 2010-11-03 Xinhua Zhang , Ankan Saha , S. V. N. Vishwanathan

Near-Optimal Methods for Minimizing Star-Convex Functions and Beyond

In this paper, we provide near-optimal accelerated first-order methods for minimizing a broad class of smooth nonconvex functions that are strictly unimodal on all lines through a minimizer. This function class, which we call the class of…

Optimization and Control · Mathematics 2023-02-27 Oliver Hinder , Aaron Sidford , Nimit S. Sohoni

On Graduated Optimization for Stochastic Non-Convex Problems

The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. Despite its popularity, very little is known in terms…

Machine Learning · Computer Science 2015-07-28 Elad Hazan , Kfir Y. Levy , Shai Shalev-Shwartz

Optimal and parameter-free gradient minimization methods for convex and nonconvex optimization

We propose novel optimal and parameter-free algorithms for computing an approximate solution with small (projected) gradient norm. Specifically, for computing an approximate solution such that the norm of its (projected) gradient does not…

Optimization and Control · Mathematics 2024-11-18 Guanghui Lan , Yuyuan Ouyang , Zhe Zhang

Accelerated Gradient Methods for Nonconvex Nonlinear and Stochastic Programming

In this paper, we generalize the well-known Nesterov's accelerated gradient (AG) method, originally designed for convex smooth optimization, to solve nonconvex and possibly stochastic optimization problems. We demonstrate that by properly…

Optimization and Control · Mathematics 2013-10-15 Saeed Ghadimi , Guanghui Lan