Related papers: Adaptive Proximal Gradient Method for Convex Optim…

On the Convergence and Complexity of Proximal Gradient and Accelerated Proximal Gradient Methods under Adaptive Gradient Estimation

In this paper, we propose a proximal gradient method and an accelerated proximal gradient method for solving composite optimization problems, where the objective function is the sum of a smooth and a convex, possibly nonsmooth, function. We…

Optimization and Control · Mathematics 2025-07-22 Raghu Bollapragada , Shagun Gupta

An adaptive framework for first-order gradient methods

Gradient methods are widely used in optimization problems. In practice, while the smoothness parameter can be estimated utilizing techniques such as backtracking, estimating the strong convexity parameter remains a challenge; moreover, even…

Optimization and Control · Mathematics 2026-02-17 Xiaozhe Hu , Sara Pollock , Zhongqin Xue , Yunrong Zhu

Universal Adaptive Proximal Gradient Methods via Gradient Mapping Accumulation

We propose an adaptive proximal gradient method for minimizing the sum of two functions, where one is a simple convex function, and the other belongs to one of the three classes: nonconvex smooth, convex nonsmooth, or convex smooth. The key…

Optimization and Control · Mathematics 2026-05-08 Zimeng Wang , Alp Yurtsever

First-Order Methods for Convex Optimization

First-order methods for solving convex optimization problems have been at the forefront of mathematical optimization in the last 20 years. The rapid development of this important class of algorithms is motivated by the success stories…

Optimization and Control · Mathematics 2021-01-07 Pavel Dvurechensky , Mathias Staudigl , Shimrit Shtern

Adaptive Gradient Descent without Descent

We present a strikingly simple proof that two rules are sufficient to automate gradient descent: 1) don't increase the stepsize too fast and 2) don't overstep the local curvature. No need for functional values, no line search, no…

Optimization and Control · Mathematics 2020-08-18 Yura Malitsky , Konstantin Mishchenko

Convergence Rates of Inexact Proximal-Gradient Methods for Convex Optimization

We consider the problem of optimizing the sum of a smooth convex function and a non-smooth convex function using proximal-gradient methods, where an error is present in the calculation of the gradient of the smooth term or in the proximity…

Machine Learning · Computer Science 2011-12-02 Mark Schmidt , Nicolas Le Roux , Francis Bach

Inexact proximal $\epsilon$-subgradient methods for composite convex optimization problems

We present two approximate versions of the proximal subgradient method for minimizing the sum of two convex functions (not necessarily differentiable). The algorithms involve, at each iteration, inexact evaluations of the proximal operator…

Optimization and Control · Mathematics 2019-07-12 Reinier Díaz Millán , Majela Pentón Machado

Adaptive Gradient Descent for Convex and Non-Convex Stochastic Optimization

In this paper we propose several adaptive gradient methods for stochastic optimization. Unlike AdaGrad-type of methods, our algorithms are based on Armijo-type line search and they simultaneously adapt to the unknown Lipschitz constant of…

Optimization and Control · Mathematics 2020-06-15 Darina Dvinskikh , Aleksandr Ogaltsov , Alexander Gasnikov , Pavel Dvurechensky , Alexander Tyurin , Vladimir Spokoiny

Proximal gradient-type method with generalized distance and convergence analysis without global descent lemma

We consider solving nonconvex composite optimization problems in which the sum of a smooth function and a nonsmooth function is minimized. Many of convergence analyses of proximal gradient-type methods rely on global descent property…

Optimization and Control · Mathematics 2026-04-09 Shotaro Yagishita , Masaru Ito

Adaptive Accelerated Gradient Descent Methods for Convex Optimization

This work proposes A$^2$GD, a novel adaptive accelerated gradient descent method for convex and composite optimization. Smoothness and convexity constants are updated via Lyapunov analysis. Inspired by stability analysis in ODE solvers, the…

Optimization and Control · Mathematics 2026-02-10 Zeyi Xu , Long Chen

Efficient Inexact Proximal Gradient Algorithm for Nonconvex Problems

The proximal gradient algorithm has been popularly used for convex optimization. Recently, it has also been extended for nonconvex problems, and the current state-of-the-art is the nonmonotone accelerated proximal gradient algorithm.…

Optimization and Control · Mathematics 2017-05-24 Quanming Yao , James T. Kwok , Fei Gao , Wei Chen , Tie-Yan Liu

Adaptive Restart of the Optimized Gradient Method for Convex Optimization

First-order methods with momentum such as Nesterov's fast gradient method are very useful for convex optimization problems, but can exhibit undesirable oscillations yielding slow convergence rates for some applications. An adaptive…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

Gradient Descent Methods for Regularized Optimization

Regularization is a widely recognized technique in mathematical optimization. It can be used to smooth out objective functions, refine the feasible solution set, or prevent overfitting in machine learning models. Due to its simplicity and…

Optimization and Control · Mathematics 2024-12-31 Filip Nikolovski , Irena Stojkovska , Katerina Hadzi-Velkova Saneva , Zoran Hadzi-Velkov

Optimal Adaptive and Accelerated Stochastic Gradient Descent

Stochastic gradient descent (\textsc{Sgd}) methods are the most powerful optimization tools in training machine learning and deep learning models. Moreover, acceleration (a.k.a. momentum) methods and diagonal scaling (a.k.a. adaptive…

Machine Learning · Statistics 2018-10-02 Qi Deng , Yi Cheng , Guanghui Lan

A Proximal Stochastic Gradient Method with Adaptive Step Size and Variance Reduction for Convex Composite Optimization

In this paper, we propose a proximal stochasitc gradient algorithm (PSGA) for solving composite optimization problems by incorporating variance reduction techniques and an adaptive step-size strategy. In the PSGA method, the objective…

Optimization and Control · Mathematics 2026-04-06 Changjie Fang , Hao Yang , Shenglan Chen

Improved Last-Iterate Convergence of Shuffling Gradient Methods for Nonsmooth Convex Optimization

We study the convergence of the shuffling gradient method, a popular algorithm employed to minimize the finite-sum function with regularization, in which functions are passed to apply (Proximal) Gradient Descent (GD) one by one whose order…

Optimization and Control · Mathematics 2025-05-30 Zijian Liu , Zhengyuan Zhou

Adaptive First-and Zeroth-order Methods for Weakly Convex Stochastic Optimization Problems

In this paper, we design and analyze a new family of adaptive subgradient methods for solving an important class of weakly convex (possibly nonsmooth) stochastic optimization problems. Adaptive methods that use exponential moving averages…

Optimization and Control · Mathematics 2020-05-26 Parvin Nazari , Davoud Ataee Tarzanagh , George Michailidis

Optimizing the Efficiency of First-Order Methods for Decreasing the Gradient of Smooth Convex Functions

This paper optimizes the step coefficients of first-order methods for smooth convex minimization in terms of the worst-case convergence bound (i.e., efficiency) of the decrease in the gradient norm. This work is based on the performance…

Optimization and Control · Mathematics 2020-10-28 Donghwan Kim , Jeffrey A. Fessler

Random coordinate descent methods for nonseparable composite optimization

In this paper we consider large-scale composite optimization problems having the objective function formed as a sum of two terms (possibly nonconvex), one has (block) coordinate-wise Lipschitz continuous gradient and the other is…

Optimization and Control · Mathematics 2024-01-10 Flavia Chorobura , Ion Necoara

Adaptive proximal algorithms for convex optimization under local Lipschitz continuity of the gradient

Backtracking linesearch is the de facto approach for minimizing continuously differentiable functions with locally Lipschitz gradient. In recent years, it has been shown that in the convex setting it is possible to avoid linesearch…

Optimization and Control · Mathematics 2024-03-14 Puya Latafat , Andreas Themelis , Lorenzo Stella , Panagiotis Patrinos