Related papers: Last-iterate convergence rates for min-max optimiz…

An Improved Last-Iterate Convergence Rate for Anchored Gradient Descent Ascent

We analyze the last-iterate convergence of the Anchored Gradient Descent Ascent algorithm for smooth convex-concave min-max problems. While previous work established a last-iterate rate of $\mathcal{O}(1/t^{2-2p})$ for the squared gradient…

Optimization and Control · Mathematics 2026-04-07 Anja Surina , Arun Suggala , George Tsoukalas , Anton Kovsharov , Sergey Shirobokov , Francisco J. R. Ruiz , Pushmeet Kohli , Swarat Chaudhuri

Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems

There are much recent interests in solving noncovnex min-max optimization problems due to its broad applications in many areas including machine learning, networked resource allocations, and distributed optimization. Perhaps, the most…

Optimization and Control · Mathematics 2021-12-20 Thinh T. Doan

Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods

In the past several years, the last-iterate convergence of the Stochastic Gradient Descent (SGD) algorithm has triggered people's interest due to its good performance in practice but lack of theoretical understanding. For Lipschitz convex…

Machine Learning · Computer Science 2026-03-20 Zijian Liu , Zhengyuan Zhou

Last iterate convergence in no-regret learning: constrained min-max optimization for convex-concave landscapes

In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate…

Machine Learning · Computer Science 2025-09-30 Qi Lei , Sai Ganesh Nagarajan , Ioannis Panageas , Xiao Wang

Min-Max Optimization under Delays

Delays and asynchrony are inevitable in large-scale machine-learning problems where communication plays a key role. As such, several works have extensively analyzed stochastic optimization with delayed gradients. However, as far as we are…

Machine Learning · Computer Science 2023-08-28 Arman Adibi , Aritra Mitra , Hamed Hassani

Convergence Rates of Subgradient Methods for Quasi-convex Optimization Problems

Quasi-convex optimization acts a pivotal part in many fields including economics and finance; the subgradient method is an effective iterative algorithm for solving large-scale quasi-convex optimization problems. In this paper, we…

Optimization and Control · Mathematics 2019-10-25 Yaohua Hu , Jiawen Li , Carisa Kwok Wai Yu

Multi-consensus Decentralized Accelerated Gradient Descent

This paper considers the decentralized convex optimization problem, which has a wide range of applications in large-scale machine learning, sensor networks, and control theory. We propose novel algorithms that achieve optimal computation…

Machine Learning · Computer Science 2023-10-11 Haishan Ye , Luo Luo , Ziang Zhou , Tong Zhang

Convergence Analysis of the Last Iterate in Distributed Stochastic Gradient Descent with Momentum

Distributed stochastic gradient methods are widely used to preserve data privacy and ensure scalability in large-scale learning tasks. While existing theory on distributed momentum Stochastic Gradient Descent (mSGD) mainly focuses on…

Optimization and Control · Mathematics 2025-05-19 Difei Cheng , Ruinan Jin , Hong Qiao , Bo Zhang

ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems

Despite remarkable empirical success, the training dynamics of generative adversarial networks (GAN), which involves solving a minimax game using stochastic gradients, is still poorly understood. In this work, we analyze last-iterate…

Machine Learning · Computer Science 2020-10-13 Ernest K. Ryu , Kun Yuan , Wotao Yin

Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization

We study the convergence of the shuffling gradient method, a popular algorithm employed to minimize the finite-sum function with regularization, in which functions are passed to apply (Proximal) Gradient Descent (GD) one by one whose order…

Optimization and Control · Mathematics 2025-05-30 Zijian Liu , Zhengyuan Zhou

On Distributed Stochastic Gradient Algorithms for Global Optimization

The paper considers the problem of network-based computation of global minima in smooth nonconvex optimization problems. It is known that distributed gradient-descent-type algorithms can achieve convergence to the set of global minima by…

Optimization and Control · Mathematics 2019-10-24 Brian Swenson , Anirudh Sridhar , H. Vincent Poor

Last-Iterate Convergence: Zero-Sum Games and Constrained Min-Max Optimization

Motivated by applications in Game Theory, Optimization, and Generative Adversarial Networks, recent work of Daskalakis et al \cite{DISZ17} and follow-up work of Liang and Stokes \cite{LiangS18} have established that a variant of the widely…

Optimization and Control · Mathematics 2025-09-30 Constantinos Daskalakis , Ioannis Panageas

Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems

In this paper we study the smooth convex-concave saddle point problem. Specifically, we analyze the last iterate convergence properties of the Extragradient (EG) algorithm. It is well known that the ergodic (averaged) iterates of EG…

Machine Learning · Computer Science 2020-07-08 Noah Golowich , Sarath Pattathil , Constantinos Daskalakis , Asuman Ozdaglar

Last-Iterate Convergence of Saddle-Point Optimizers via High-Resolution Differential Equations

Several widely-used first-order saddle-point optimization methods yield an identical continuous-time ordinary differential equation (ODE) that is identical to that of the Gradient Descent Ascent (GDA) method when derived naively. However,…

Optimization and Control · Mathematics 2023-08-01 Tatjana Chavdarova , Michael I. Jordan , Manolis Zampetakis

On the Last-Iterate Convergence of Shuffling Gradient Methods

Shuffling gradient methods are widely used in modern machine learning tasks and include three popular implementations: Random Reshuffle (RR), Shuffle Once (SO), and Incremental Gradient (IG). Compared to the empirical success, the…

Machine Learning · Computer Science 2024-06-07 Zijian Liu , Zhengyuan Zhou

Exact convergence rate of the last iterate in subgradient methods

We study the convergence of the last iterate in subgradient methods applied to the minimization of a nonsmooth convex function with bounded subgradients. We first introduce a proof technique that generalizes the standard analysis of…

Optimization and Control · Mathematics 2023-07-24 Moslem Zamani , François Glineur

Accelerating Ill-Conditioned Low-Rank Matrix Estimation via Scaled Gradient Descent

Low-rank matrix estimation is a canonical problem that finds numerous applications in signal processing, machine learning and imaging science. A popular approach in practice is to factorize the matrix into two compact low-rank factors, and…

Machine Learning · Computer Science 2021-06-16 Tian Tong , Cong Ma , Yuejie Chi

Last-iterate convergence analysis of stochastic momentum methods for neural networks

The stochastic momentum method is a commonly used acceleration technique for solving large-scale stochastic optimization problems in artificial neural networks. Current convergence results of stochastic momentum methods under non-convex…

Optimization and Control · Mathematics 2023-01-26 Dongpo Xu , Jinlan Liu , Yinghua Lu , Jun Kong , Danilo Mandic

Stochastic Hamiltonian Gradient Methods for Smooth Games

The success of adversarial formulations in machine learning has brought renewed motivation for smooth games. In this work, we focus on the class of stochastic Hamiltonian methods and provide the first convergence guarantees for certain…

Machine Learning · Computer Science 2020-07-09 Nicolas Loizou , Hugo Berard , Alexia Jolicoeur-Martineau , Pascal Vincent , Simon Lacoste-Julien , Ioannis Mitliagkas

First-Order Algorithms for Min-Max Optimization in Geodesic Metric Spaces

From optimal transport to robust dimensionality reduction, a plethora of machine learning applications can be cast into the min-max optimization problems over Riemannian manifolds. Though many min-max algorithms have been analyzed in the…

Optimization and Control · Mathematics 2022-09-29 Michael I. Jordan , Tianyi Lin , Emmanouil-Vasileios Vlatakis-Gkaragkounis