Related papers: Last-iterate convergence rates for min-max optimiz…

Exact Coordinate Descent for High-Dimensional Regularized Huber Regression

We develop an exact coordinate descent algorithm for high-dimensional regularized Huber regression. In contrast to composite gradient descent methods, our algorithm fully exploits the advantages of coordinate descent when the underlying…

Methodology · Statistics 2025-10-16 Younghoon Kim , Po-Ling Loh , Sumanta Basu

Stochastic Gradient Descent in the Viewpoint of Graduated Optimization

Stochastic gradient descent (SGD) method is popular for solving non-convex optimization problems in machine learning. This work investigates SGD from a viewpoint of graduated optimization, which is a widely applied approach for non-convex…

Optimization and Control · Mathematics 2023-08-15 Da Li , Jingjing Wu , Qingrun Zhang

Exponentially Converging Distributed Gradient Descent with Intermittent Communication via Hybrid Methods

We present a hybrid systems framework for multi-agent optimization in which agents execute computations in continuous time and communicate in discrete time. The optimization algorithm is a hybrid version of parallelized coordinate descent.…

Optimization and Control · Mathematics 2021-10-04 Katherine Hendrickson , Dawn Hustig-Schultz , Matthew Hale , Ricardo G. Sanfelice

Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems

Nonconvex minimax problems appear frequently in emerging machine learning applications, such as generative adversarial networks and adversarial learning. Simple algorithms such as the gradient descent ascent (GDA) are the common practice…

Optimization and Control · Mathematics 2020-02-25 Junchi Yang , Negar Kiyavash , Niao He

On the Convergence of Stochastic Gradient Descent with Low-Rank Projections for Convex Low-Rank Matrix Problems

We revisit the use of Stochastic Gradient Descent (SGD) for solving convex optimization problems that serve as highly popular convex relaxations for many important low-rank matrix recovery problems such as \textit{matrix completion},…

Machine Learning · Computer Science 2020-06-16 Dan Garber

Bias of Homotopic Gradient Descent for the Hinge Loss

Gradient descent is a simple and widely used optimization method for machine learning. For homogeneous linear classifiers applied to separable data, gradient descent has been shown to converge to the maximal margin (or equivalently, the…

Machine Learning · Statistics 2019-07-30 Denali Molitor , Deanna Needell , Rachel Ward

Stochastic Gradient Descent for Constrained Optimization based on Adaptive Relaxed Barrier Functions

This paper presents a novel stochastic gradient descent algorithm for constrained optimization. The proposed algorithm randomly samples constraints and components of the finite sum objective function and relies on a relaxed logarithmic…

Optimization and Control · Mathematics 2025-05-13 Naum Dimitrieski , Jing Cao , Christian Ebenbauer

Gradient-Based Non-Linear Inverse Learning

We study statistical inverse learning in the context of nonlinear inverse problems under random design. Specifically, we address a class of nonlinear problems by employing gradient descent (GD) and stochastic gradient descent (SGD) with…

Machine Learning · Statistics 2024-12-24 Abhishake , Nicole Mücke , Tapio Helin

On fast convergence rates for generalized conditional gradient methods with backtracking stepsize

A generalized conditional gradient method for minimizing the sum of two convex functions, one of them differentiable, is presented. This iterative method relies on two main ingredients: First, the minimization of a partially linearized…

Optimization and Control · Mathematics 2021-10-01 Karl Kunisch , Daniel Walter

Beyond Convexity -- Contraction and Global Convergence of Gradient Descent

This paper considers the analysis of continuous time gradient-based optimization algorithms through the lens of nonlinear contraction theory. It demonstrates that in the case of a time-invariant objective, most elementary results on…

Optimization and Control · Mathematics 2022-12-23 Patrick M. Wensing , Jean-Jacques E. Slotine

Higher-order tensor methods for minimizing difference of convex functions

Higher-order tensor methods were recently proposed for minimizing smooth convex and nonconvex functions. Higher-order algorithms accelerate the convergence of the classical first-order methods thanks to the higher-order derivatives used in…

Optimization and Control · Mathematics 2024-01-11 Ion Necoara

An Algebraically Converging Stochastic Gradient Descent Algorithm for Global Optimization

We propose a new gradient descent algorithm with added stochastic terms for finding the global optimizers of nonconvex optimization problems. A key component in the algorithm is the adaptive tuning of the randomness based on the value of…

Optimization and Control · Mathematics 2025-06-16 Björn Engquist , Kui Ren , Yunan Yang

Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees

The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based…

Optimization and Control · Mathematics 2020-09-17 Brian Swenson , Soummya Kar , H. Vincent Poor , José M. F. Moura , Aaron Jaech

Accelerated Gradient Methods for Networked Optimization

We develop multi-step gradient methods for network-constrained optimization of strongly convex functions with Lipschitz-continuous gradients. Given the topology of the underlying network and bounds on the Hessian of the objective function,…

Optimization and Control · Mathematics 2015-06-12 Euhanna Ghadimi , Iman Shames , Mikael Johansson

On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand the algorithm's convergence properties in non-convex problems. We first show that the sequence of iterates generated by SGD remains bounded and…

Optimization and Control · Mathematics 2020-06-22 Panayotis Mertikopoulos , Nadav Hallak , Ali Kavis , Volkan Cevher

Global Convergence of Non-Convex Gradient Descent for Computing Matrix Squareroot

While there has been a significant amount of work studying gradient descent techniques for non-convex optimization problems over the last few years, all existing results establish either local convergence with good rates or global…

Numerical Analysis · Mathematics 2017-03-10 Prateek Jain , Chi Jin , Sham M. Kakade , Praneeth Netrapalli

Surpassing Gradient Descent Provably: A Cyclic Incremental Method with Linear Convergence Rate

Recently, there has been growing interest in developing optimization methods for solving large-scale machine learning problems. Most of these problems boil down to the problem of minimizing an average of a finite set of smooth and strongly…

Optimization and Control · Mathematics 2018-02-09 Aryan Mokhtari , Mert Gürbüzbalaban , Alejandro Ribeiro

Stochastic Subspace Descent

We present two stochastic descent algorithms that apply to unconstrained optimization and are particularly efficient when the objective function is slow to evaluate and gradients are not easily obtained, as in some PDE-constrained…

Optimization and Control · Mathematics 2019-04-30 David Kozak , Stephen Becker , Alireza Doostan , Luis Tenorio

Tackling benign nonconvexity with smoothing and stochastic gradients

Non-convex optimization problems are ubiquitous in machine learning, especially in Deep Learning. While such complex problems can often be successfully optimized in practice by using stochastic gradient descent (SGD), theoretical analysis…

Machine Learning · Computer Science 2022-02-21 Harsh Vardhan , Sebastian U. Stich

Primal-Dual Damping algorithms for optimization

We propose an unconstrained optimization method based on the well-known primal-dual hybrid gradient (PDHG) algorithm. We first formulate the optimality condition of the unconstrained optimization problem as a saddle point problem. We then…

Optimization and Control · Mathematics 2024-08-29 X. Zuo , S. Osher , W. Li