Related papers: Accelerated Parameter-Free Stochastic Optimization

Accelerated Distance-adaptive Methods for H\"{o}lder Smooth and Convex Optimization

This paper introduces new parameter-free first-order methods for convex optimization problems in which the objective function exhibits H\"{o}lder smoothness. Inspired by the recently proposed distance-over-gradient (DOG) technique, we…

Optimization and Control · Mathematics 2025-10-28 Yijin Ren , Haifeng Xu , Qi Deng

UniXGrad: A Universal, Adaptive Algorithm with Optimal Guarantees for Constrained Optimization

We propose a novel adaptive, accelerated algorithm for the stochastic constrained convex optimization setting. Our method, which is inspired by the Mirror-Prox method, \emph{simultaneously} achieves the optimal rates for smooth/non-smooth…

Optimization and Control · Mathematics 2019-10-31 Ali Kavis , Kfir Y. Levy , Francis Bach , Volkan Cevher

A Universally Optimal Multistage Accelerated Stochastic Gradient Method

We study the problem of minimizing a strongly convex, smooth function when we have noisy estimates of its gradient. We propose a novel multistage accelerated algorithm that is universally optimal in the sense that it achieves the optimal…

Optimization and Control · Mathematics 2019-10-29 Necdet Serhat Aybat , Alireza Fallah , Mert Gurbuzbalaban , Asuman Ozdaglar

An Accelerated Method for Derivative-Free Smooth Stochastic Convex Optimization

We consider an unconstrained problem of minimizing a smooth convex function which is only available through noisy observations of its values, the noise consisting of two parts. Similar to stochastic optimization problems, the first part is…

Optimization and Control · Mathematics 2020-09-22 Eduard Gorbunov , Pavel Dvurechensky , Alexander Gasnikov

High Probability Convergence of Stochastic Gradient Methods

In this work, we describe a generic approach to show convergence with high probability for both stochastic convex and non-convex optimization with sub-Gaussian noise. In previous works for convex optimization, either the convergence is only…

Optimization and Control · Mathematics 2023-03-01 Zijian Liu , Ta Duy Nguyen , Thien Hang Nguyen , Alina Ene , Huy Lê Nguyen

Stochastic smoothing accelerated gradient method for general constrained nonsmooth convex composite optimization

We propose a novel stochastic smoothing accelerated gradient (SSAG) method for general constrained nonsmooth convex composite optimization, and analyze the convergence rates. The SSAG method allows various smoothing techniques, and can deal…

Optimization and Control · Mathematics 2026-02-03 Ruyu Wang , Chao Zhang

$\mu^2$-SGD: Stable Stochastic Optimization via a Double Momentum Mechanism

We consider stochastic convex optimization problems where the objective is an expectation over smooth functions. For this setting we suggest a novel gradient estimate that combines two recent mechanism that are related to notion of…

Machine Learning · Computer Science 2025-03-06 Tehila Dahan , Kfir Y. Levy

Dealing with unbounded gradients in stochastic saddle-point optimization

We study the performance of stochastic first-order methods for finding saddle points of convex-concave functions. A notorious challenge faced by such methods is that the gradients can grow arbitrarily large during optimization, which may…

Machine Learning · Computer Science 2024-06-10 Gergely Neu , Nneka Okolo

A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization

A number of optimization approaches have been proposed for optimizing nonconvex objectives (e.g. deep learning models), such as batch gradient descent, stochastic gradient descent and stochastic variance reduced gradient descent. Theory…

Machine Learning · Computer Science 2019-05-15 Jia Bi , Steve R. Gunn

A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which…

Optimization and Control · Mathematics 2025-04-21 Spyridon Pougkakiotis , Dionysios S. Kalogerias

Stochastic Auto-conditioned Fast Gradient Methods with Optimal Rates

Achieving optimal rates for stochastic composite convex optimization without prior knowledge of problem parameters remains a central challenge. In the deterministic setting, the auto-conditioned fast gradient method has recently been…

Optimization and Control · Mathematics 2026-04-15 Yao Ji , Guanghui Lan

Optimized convergence of stochastic gradient descent by weighted averaging

Under mild assumptions stochastic gradient methods asymptotically achieve an optimal rate of convergence if the arithmetic mean of all iterates is returned as an approximate optimal solution. However, in the absence of stochastic noise, the…

Optimization and Control · Mathematics 2022-10-06 Melinda Hagedorn , Florian Jarre

DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method

This paper proposes a new easy-to-implement parameter-free gradient-based optimizer: DoWG (Distance over Weighted Gradients). We prove that DoWG is efficient -- matching the convergence rate of optimally tuned gradient descent in convex…

Machine Learning · Computer Science 2024-01-31 Ahmed Khaled , Konstantin Mishchenko , Chi Jin

Tuning-Free Stochastic Optimization

Large-scale machine learning problems make the cost of hyperparameter tuning ever more prohibitive. This creates a need for algorithms that can tune themselves on-the-fly. We formalize the notion of "tuning-free" algorithms that can match…

Optimization and Control · Mathematics 2024-03-20 Ahmed Khaled , Chi Jin

DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule

We propose a tuning-free dynamic SGD step size formula, which we call Distance over Gradients (DoG). The DoG step sizes depend on simple empirical quantities (distance from the initial point and norms of gradients) and have no ``learning…

Machine Learning · Computer Science 2023-07-18 Maor Ivgi , Oliver Hinder , Yair Carmon

Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization

This paper considers stochastic optimization problems for a large class of objective functions, including convex and continuous submodular. Stochastic proximal gradient methods have been widely used to solve such problems; however, their…

Optimization and Control · Mathematics 2018-11-13 Aryan Mokhtari , Hamed Hassani , Amin Karbasi

Stochastic Proximal Methods for Non-Smooth Non-Convex Constrained Sparse Optimization

This paper focuses on stochastic proximal gradient methods for optimizing a smooth non-convex loss function with a non-smooth non-convex regularizer and convex constraints. To the best of our knowledge we present the first non-asymptotic…

Optimization and Control · Mathematics 2019-05-27 Michael R. Metel , Akiko Takeda

The Sample Complexity of Parameter-Free Stochastic Convex Optimization

We study the sample complexity of stochastic convex optimization when problem parameters, e.g., the distance to optimality, are unknown. We pursue two strategies. First, we develop a reliable model selection method that avoids overfitting…

Machine Learning · Computer Science 2025-06-16 Jared Lawrence , Ari Kalinsky , Hannah Bradfield , Yair Carmon , Oliver Hinder

Inexact-Proximal Accelerated Gradient Method for Stochastic Nonconvex Constrained Optimization Problems

Stochastic nonconvex optimization problems with nonlinear constraints have a broad range of applications in intelligent transportation, cyber-security, and smart grids. In this paper, first, we propose an inexact-proximal accelerated…

Optimization and Control · Mathematics 2021-07-08 Morteza Boroun , Afrooz Jalilzadeh

A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization

We analyze stochastic gradient algorithms for optimizing nonconvex, nonsmooth finite-sum problems. In particular, the objective function is given by the summation of a differentiable (possibly nonconvex) component, together with a possibly…

Optimization and Control · Mathematics 2018-12-04 Zhize Li , Jian Li