Related papers: Nesterov-aided Stochastic Gradient Methods using L…

Variance-reduced first-order methods for deterministically constrained stochastic nonconvex optimization with strong convergence guarantees

In this paper, we study a class of deterministically constrained stochastic optimization problems. Existing methods typically aim to find an $\epsilon$-stochastic stationary point, where the expected violations of both constraints and…

Optimization and Control · Mathematics 2025-09-03 Zhaosong Lu , Sanyou Mei , Yifeng Xiao

Estimate Sequences for Variance-Reduced Stochastic Composite Optimization

In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. This point of view covers the stochastic gradient…

Machine Learning · Statistics 2019-05-08 Andrei Kulunchakov , Julien Mairal

Multilevel Stochastic Gradient Descent for Optimal Control Under Uncertainty

We present a multilevel stochastic gradient descent method for the optimal control of systems governed by partial differential equations under uncertain input data. The gradient descent method used to find the optimal control leverages a…

Optimization and Control · Mathematics 2025-06-04 Niklas Baumgarten , David Schneiderhan

Adaptive Restart of the Optimized Gradient Method for Convex Optimization

First-order methods with momentum such as Nesterov's fast gradient method are very useful for convex optimization problems, but can exhibit undesirable oscillations yielding slow convergence rates for some applications. An adaptive…

Optimization and Control · Mathematics 2019-06-14 Donghwan Kim , Jeffrey A. Fessler

Adaptive Single-Loop Methods for Stochastic Minimax Optimization on Riemannian Manifolds

Stochastic minimax optimization on Riemannian manifolds has recently attracted significant attention due to its broad range of applications, such as robust training of neural networks and robust maximum likelihood estimation. Existing…

Optimization and Control · Mathematics 2026-02-11 Hongye Wang , Chang He , Bo Jiang

Fast Bayesian experimental design: Laplace-based importance sampling for the expected information gain

In calculating expected information gain in optimal Bayesian experimental design, the computation of the inner loop in the classical double-loop Monte Carlo requires a large number of samples and suffers from underflow if the number of…

Numerical Analysis · Mathematics 2018-04-04 Joakim Beck , Ben Mansour Dia , Luis FR Espath , Quan Long , Raul Tempone

Biased Stochastic First-Order Methods for Conditional Stochastic Optimization and Applications in Meta Learning

Conditional stochastic optimization covers a variety of applications ranging from invariant learning and causal inference to meta-learning. However, constructing unbiased gradient estimators for such problems is challenging due to the…

Optimization and Control · Mathematics 2024-06-04 Yifan Hu , Siqi Zhang , Xin Chen , Niao He

Approximate Bayesian Inference for Structural Equation Models using Integrated Nested Laplace Approximations

Markov chain Monte Carlo (MCMC) methods remain the mainstay of Bayesian estimation of structural equation models (SEM), though they often incur a high computational cost. We present a bespoke approximate Bayesian approach to SEM, drawing on…

Methodology · Statistics 2026-05-20 Haziq Jamil , Håvard Rue

A geometric alternative to Nesterov's accelerated gradient descent

We propose a new method for unconstrained optimization of a smooth and strongly convex function, which attains the optimal rate of convergence of Nesterov's accelerated gradient descent. The new algorithm has a simple geometric…

Optimization and Control · Mathematics 2015-06-30 Sébastien Bubeck , Yin Tat Lee , Mohit Singh

Multi-level Monte-Carlo Gradient Methods for Stochastic Optimization with Biased Oracles

We consider stochastic optimization when one only has access to biased stochastic oracles of the objective and the gradient, and obtaining stochastic gradients with low biases comes at high costs. This setting captures various optimization…

Optimization and Control · Mathematics 2024-08-22 Yifan Hu , Jie Wang , Xin Chen , Niao He

Decentralized Stochastic Gradient Descent Ascent for Finite-Sum Minimax Problems

Minimax optimization problems have attracted significant attention in recent years due to their widespread application in numerous machine learning models. To solve the minimax problem, a wide variety of stochastic optimization methods have…

Machine Learning · Computer Science 2024-06-12 Hongchang Gao

Fast Margin Maximization via Dual Acceleration

We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (e.g., the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of…

Machine Learning · Computer Science 2021-08-24 Ziwei Ji , Nathan Srebro , Matus Telgarsky

Stochastic Bias-Reduced Gradient Methods

We develop a new primitive for stochastic optimization: a low-bias, low-cost estimator of the minimizer $x_\star$ of any Lipschitz strongly-convex function. In particular, we use a multilevel Monte-Carlo approach due to Blanchet and Glynn…

Optimization and Control · Mathematics 2021-10-29 Hilal Asi , Yair Carmon , Arun Jambulapati , Yujia Jin , Aaron Sidford

Accelerated First-Order Methods: Differential Equations and Lyapunov Functions

We develop a theory of accelerated first-order optimization from the viewpoint of differential equations and Lyapunov functions. Building upon the previous work of many researchers, we consider differential equations which model the…

Optimization and Control · Mathematics 2021-04-02 Jonathan W. Siegel

Approximating Hessian matrices using Bayesian inference: a new approach for quasi-Newton methods in stochastic optimization

Using quasi-Newton methods in stochastic optimization is not a trivial task given the difficulty of extracting curvature information from the noisy gradients. Moreover, pre-conditioning noisy gradient observations tend to amplify the noise.…

Optimization and Control · Mathematics 2024-04-02 Andre Carlon , Luis Espath , Raul Tempone

A Variational Perspective on Accelerated Methods in Optimization

Accelerated gradient methods play a central role in optimization, achieving optimal rates in many settings. While many generalizations and extensions of Nesterov's original acceleration method have been proposed, it is not yet clear what is…

Optimization and Control · Mathematics 2022-06-08 Andre Wibisono , Ashia C. Wilson , Michael I. Jordan

Randomized Subspace Nesterov Accelerated Gradient

Randomized-subspace methods reduce the cost of first-order optimization by using only low-dimensional projected-gradient information, a feature that is attractive in forward-mode automatic differentiation and communication-limited settings.…

Optimization and Control · Mathematics 2026-05-04 Gaku Omiya , Pierre-Louis Poirion , Akiko Takeda

A stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertainty

The study of optimal control problems under uncertainty plays an important role in scientific numerical simulations. This class of optimization problems is strongly utilized in engineering, biology and finance. In this paper, a stochastic…

Optimization and Control · Mathematics 2023-04-06 Caroline Geiersbach , Teresa Scarinci

On the Convergence and Complexity of the Stochastic Central Finite-Difference Based Gradient Estimation Methods

This paper presents an algorithmic framework for solving unconstrained stochastic optimization problems using only stochastic function evaluations. We employ central finite-difference based gradient estimation methods to approximate the…

Optimization and Control · Mathematics 2025-01-14 Raghu Bollapragada , Cem Karamanli

Stochastic Gradient Descent in the Viewpoint of Graduated Optimization

Stochastic gradient descent (SGD) method is popular for solving non-convex optimization problems in machine learning. This work investigates SGD from a viewpoint of graduated optimization, which is a widely applied approach for non-convex…

Optimization and Control · Mathematics 2023-08-15 Da Li , Jingjing Wu , Qingrun Zhang