Related papers: Error analysis for stochastic gradient optimizatio…

Backward error analysis and the qualitative behaviour of stochastic optimization algorithms: Application to stochastic coordinate descent

Stochastic optimization methods have been hugely successful in making large-scale optimization problems feasible when computing the full gradient is computationally prohibitive. Using the theory of modified equations for numerical…

Optimization and Control · Mathematics 2023-09-06 Stefano Di Giovacchino , Desmond J. Higham , Konstantinos Zygalakis

Stochastic modified equations and adaptive stochastic gradient algorithms

We develop the method of stochastic modified equations (SME), in which stochastic gradient algorithms are approximated in the weak sense by continuous-time stochastic differential equations. We exploit the continuous formulation together…

Machine Learning · Computer Science 2017-06-21 Qianxiao Li , Cheng Tai , Weinan E

General convergence analysis of stochastic first order methods for composite optimization

In this paper we consider stochastic composite convex optimization problems with the objective function satisfying a stochastic bounded gradient condition, with or without a quadratic functional growth property. These models include the…

Optimization and Control · Mathematics 2020-03-10 Ion Necoara

Inexact Sequential Quadratic Optimization for Minimizing a Stochastic Objective Function Subject to Deterministic Nonlinear Equality Constraints

An algorithm is proposed, analyzed, and tested experimentally for solving stochastic optimization problems in which the decision variables are constrained to satisfy equations defined by deterministic, smooth, and nonlinear functions. It is…

Optimization and Control · Mathematics 2021-07-09 Frank E. Curtis , Daniel P. Robinson , Baoyu Zhou

Stochastic subgradient for composite optimization with functional constraints

In this paper we consider convex optimization problems with stochastic composite objective function subject to (possibly) infinite intersection of constraints. The objective function is expressed in terms of expectation operator over a sum…

Optimization and Control · Mathematics 2024-12-03 Ion Necoara , Nitesh Kumar Singh

Weak error analysis for stochastic gradient descent optimization algorithms

Stochastic gradient descent (SGD) type optimization schemes are fundamental ingredients in a large number of machine learning based algorithms. In particular, SGD type optimization schemes are frequently employed in applications involving…

Numerical Analysis · Mathematics 2020-07-22 Aritz Bercher , Lukas Gonon , Arnulf Jentzen , Diyora Salimova

Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite-Sum Structure

Stochastic optimization algorithms with variance reduction have proven successful for minimizing large finite sums of functions. Unfortunately, these techniques are unable to deal with stochastic perturbations of input data, induced for…

Machine Learning · Statistics 2017-11-16 Alberto Bietti , Julien Mairal

On Stochastic Gradient and Subgradient Methods with Adaptive Steplength Sequences

The performance of standard stochastic approximation implementations can vary significantly based on the choice of the steplength sequence, and in general, little guidance is provided about good choices. Motivated by this gap, in the first…

Optimization and Control · Mathematics 2015-03-19 Farzad Yousefian , Angelia Nedić , Uday V. Shanbhag

Stochastic Gradient Descent for Constrained Optimization based on Adaptive Relaxed Barrier Functions

This paper presents a novel stochastic gradient descent algorithm for constrained optimization. The proposed algorithm randomly samples constraints and components of the finite sum objective function and relies on a relaxed logarithmic…

Optimization and Control · Mathematics 2025-05-13 Naum Dimitrieski , Jing Cao , Christian Ebenbauer

Sequential Quadratic Optimization for Stochastic Optimization with Deterministic Nonlinear Inequality and Equality Constraints

A sequential quadratic optimization algorithm for minimizing an objective function defined by an expectation subject to nonlinear inequality and equality constraints is proposed, analyzed, and tested. The context of interest is when it is…

Optimization and Control · Mathematics 2023-03-01 Frank E. Curtis , Daniel P. Robinson , Baoyu Zhou

A Stochastic Gradient Method with Mesh Refinement for PDE Constrained Optimization under Uncertainty

Models incorporating uncertain inputs, such as random forces or material parameters, have been of increasing interest in PDE-constrained optimization. In this paper, we focus on the efficient numerical minimization of a convex and smooth…

Optimization and Control · Mathematics 2021-06-18 Caroline Geiersbach , Winnifried Wollner

Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations

We develop the mathematical foundations of the stochastic modified equations (SME) framework for analyzing the dynamics of stochastic gradient algorithms, where the latter is approximated by a class of stochastic differential equations with…

Machine Learning · Computer Science 2018-11-06 Qianxiao Li , Cheng Tai , Weinan E

Uniform-in-Time Weak Error Analysis for Stochastic Gradient Descent Algorithms via Diffusion Approximation

Diffusion approximation provides weak approximation for stochastic gradient descent algorithms in a finite time horizon. In this paper, we introduce new tools motivated by the backward error analysis of numerical stochastic differential…

Machine Learning · Computer Science 2019-09-05 Yuanyuan Feng , Tingran Gao , Lei Li , Jian-Guo Liu , Yulong Lu

On the Convergence and Complexity of the Stochastic Central Finite-Difference Based Gradient Estimation Methods

This paper presents an algorithmic framework for solving unconstrained stochastic optimization problems using only stochastic function evaluations. We employ central finite-difference based gradient estimation methods to approximate the…

Optimization and Control · Mathematics 2025-01-14 Raghu Bollapragada , Cem Karamanli

Distributed Variable Sample-size Stochastic Optimization with Fixed Step-sizes

The paper considers distributed stochastic optimization over randomly switching networks, where agents collaboratively minimize the average of all agents' local expectation-valued convex cost functions. Due to the stochasticity in gradient…

Optimization and Control · Mathematics 2022-04-07 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Multi-Level Stochastic Gradient Methods for Nested Composition Optimization

Stochastic gradient methods are scalable for solving large-scale optimization problems that involve empirical expectations of loss functions. Existing results mainly apply to optimization problems where the objectives are one- or two-level…

Optimization and Control · Mathematics 2018-01-15 Shuoguang Yang , Mengdi Wang , Ethan X. Fang

Stochastic Subspace Descent

We present two stochastic descent algorithms that apply to unconstrained optimization and are particularly efficient when the objective function is slow to evaluate and gradients are not easily obtained, as in some PDE-constrained…

Optimization and Control · Mathematics 2019-04-30 David Kozak , Stephen Becker , Alireza Doostan , Luis Tenorio

The estimation error of general first order methods

Modern large-scale statistical models require to estimate thousands to millions of parameters. This is often accomplished by iterative algorithms such as gradient descent, projected gradient descent or their accelerated versions. What are…

Machine Learning · Statistics 2020-03-04 Michael Celentano , Andrea Montanari , Yuchen Wu

A Proximal Stochastic Gradient Method with Progressive Variance Reduction

We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping. We assume the whole…

Optimization and Control · Mathematics 2014-03-20 Lin Xiao , Tong Zhang

A Doubly Stochastically Perturbed Algorithm for Linearly Constrained Bilevel Optimization

In this work, we develop analysis and algorithms for a class of (stochastic) bilevel optimization problems whose lower-level (LL) problem is strongly convex and linearly constrained. Most existing approaches for solving such problems rely…

Optimization and Control · Mathematics 2025-04-08 Prashant Khanduri , Ioannis Tsaknakis , Yihua Zhang , Sijia Liu , Mingyi Hong