Related papers: Last-iterate convergence rates for min-max optimiz…

Generalization to the Natural Gradient Descent

Optimization problem, which is aimed at finding the global minimal value of a given cost function, is one of the central problem in science and engineering. Various numerical methods have been proposed to solve this problem, among which the…

Optimization and Control · Mathematics 2022-10-07 Shaojun Dong , Fengyu Le , Meng Zhang , Si-Jing Tao , Chao Wang , Yong-Jian Han , Guo-Ping Guo

Learning Operators with Stochastic Gradient Descent in General Hilbert Spaces

This study investigates leveraging stochastic gradient descent (SGD) to learn operators between general Hilbert spaces. We propose weak and strong regularity conditions for the target operator to depict its intrinsic structure and…

Machine Learning · Statistics 2026-01-13 Lei Shi , Jia-Qi Yang

Dual Acceleration for Minimax Optimization: Linear Convergence Under Relaxed Assumptions

This paper addresses the bilinearly coupled minimax optimization problem: $\min_{x \in \mathbb{R}^{d_x}}\max_{y \in \mathbb{R}^{d_y}} \ f_1(x) + f_2(x) + y^{\top} Bx - g_1(y) - g_2(y)$, where $f_1$ and $g_1$ are smooth convex functions,…

Optimization and Control · Mathematics 2025-05-27 Jingwang Li , Xiao Li

Gradient Descent Averaging and Primal-dual Averaging for Strongly Convex Optimization

Averaging scheme has attracted extensive attention in deep learning as well as traditional machine learning. It achieves theoretically optimal convergence and also improves the empirical model performance. However, there is still a lack of…

Machine Learning · Computer Science 2021-01-19 Wei Tao , Wei Li , Zhisong Pan , Qing Tao

Adaptive Conditional Gradient Descent

Selecting an effective step-size is a fundamental challenge in first-order optimization, especially for problems with non-Euclidean geometries. This paper presents a novel adaptive step-size strategy for optimization algorithms that rely on…

Optimization and Control · Mathematics 2025-10-14 Abbas Khademi , Antonio Silveti-Falls

On the convergence of stochastic primal-dual hybrid gradient

In this paper, we analyze the recently proposed stochastic primal-dual hybrid gradient (SPDHG) algorithm and provide new theoretical results. In particular, we prove almost sure convergence of the iterates to a solution with convexity and…

Optimization and Control · Mathematics 2022-06-23 Ahmet Alacaoglu , Olivier Fercoq , Volkan Cevher

Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point

In this work, we analyze two of the most fundamental algorithms in geodesically convex optimization: Riemannian gradient descent and (possibly inexact) Riemannian proximal point. We quantify their rates of convergence and produce different…

Optimization and Control · Mathematics 2024-03-18 David Martínez-Rubio , Christophe Roux , Sebastian Pokutta

Stability and Generalization of Stochastic Gradient Methods for Minimax Problems

Many machine learning problems can be formulated as minimax problems such as Generative Adversarial Networks (GANs), AUC maximization and robust estimation, to mention but a few. A substantial amount of studies are devoted to studying the…

Machine Learning · Computer Science 2021-07-14 Yunwen Lei , Zhenhuan Yang , Tianbao Yang , Yiming Ying

Convergence of an Asynchronous Block-Coordinate Forward-Backward Algorithm for Convex Composite Optimization

In this paper, we study the convergence properties of a randomized block-coordinate descent algorithm for the minimization of a composite convex objective function, where the block-coordinates are updated asynchronously and randomly…

Optimization and Control · Mathematics 2023-04-14 Cheik Traoré , Saverio Salzo , Silvia Villa

Last-iterate Convergence Separation between Extra-gradient and Optimism in Constrained Periodic Games

Last-iterate behaviors of learning algorithms in repeated two-player zero-sum games have been extensively studied due to their wide applications in machine learning and related tasks. Typical algorithms that exhibit the last-iterate…

Machine Learning · Computer Science 2024-06-18 Yi Feng , Ping Li , Ioannis Panageas , Xiao Wang

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Efficient Dictionary Learning with Gradient Descent

Randomly initialized first-order optimization algorithms are the method of choice for solving many high-dimensional nonconvex problems in machine learning, yet general theoretical guarantees cannot rule out convergence to critical points of…

Optimization and Control · Mathematics 2018-09-28 Dar Gilboa , Sam Buchanan , John Wright

On convergence of a $q$-random coordinate constrained algorithm for non-convex problems

We propose a random coordinate descent algorithm for optimizing a non-convex objective function subject to one linear constraint and simple bounds on the variables. Although it is common use to update only two random coordinates…

Optimization and Control · Mathematics 2024-08-27 Alireza Ghaffari-Hadigheh , Lennart Sinjorgo , Renata Sotirov

Convergence property of the Quantized Distributed Gradient descent with constant stepsizes and an effective strategy for the stepsize selection

In this paper, we establish new convergence results for the quantized distributed gradient descent and suggest a novel strategy of choosing the stepsizes for the high-performance of the algorithm. Under the strongly convexity assumption on…

Optimization and Control · Mathematics 2023-07-03 Woocheol Choi , Myeong-Su Lee

Linear Convergence of Stochastic Iterative Greedy Algorithms with Sparse Constraints

Motivated by recent work on stochastic gradient descent methods, we develop two stochastic variants of greedy algorithms for possibly non-convex optimization problems with sparsity constraints. We prove linear convergence in expectation to…

Numerical Analysis · Mathematics 2014-07-02 Nam Nguyen , Deanna Needell , Tina Woolf

Continuous-time Lower Bounds for Gradient-based Algorithms

This article derives lower bounds on the convergence rate of continuous-time gradient-based optimization algorithms. The algorithms are subjected to a time-normalization constraint that avoids a reparametrization of time in order to make…

Optimization and Control · Mathematics 2020-08-04 Michael Muehlebach , Michael I. Jordan

The Convergence of Stochastic Gradient Descent in Asynchronous Shared Memory

Stochastic Gradient Descent (SGD) is a fundamental algorithm in machine learning, representing the optimization backbone for training several classic models, from regression to neural networks. Given the recent practical focus on…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-06-25 Dan Alistarh , Christopher De Sa , Nikola Konstantinov

General Proximal Incremental Aggregated Gradient Algorithms: Better and Novel Results under General Scheme

The incremental aggregated gradient algorithm is popular in network optimization and machine learning research. However, the current convergence results require the objective function to be strongly convex. And the existing convergence…

Optimization and Control · Mathematics 2019-10-14 Tao Sun , Yuejiao Sun , Dongsheng Li , Qing Liao

A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

Nonconvex-concave min-max problem arises in many machine learning applications including minimizing a pointwise maximum of a set of nonconvex functions and robust adversarial training of neural networks. A popular approach to solve this…

Optimization and Control · Mathematics 2025-03-21 Jiawei Zhang , Peijun Xiao , Ruoyu Sun , Zhi-Quan Luo

Gradient Descent on Infinitely Wide Neural Networks: Global Convergence and Generalization

Many supervised machine learning methods are naturally cast as optimization problems. For prediction models which are linear in their parameters, this often leads to convex problems for which many mathematical guarantees exist. Models which…

Machine Learning · Computer Science 2021-10-18 Francis Bach , Lenaïc Chizat