Related papers: Constrained and Preconditioned Stochastic Gradient…

Stochastic Proximal Gradient Algorithms for Penalized Mixed Models

Motivated by penalized likelihood maximization in complex models, we study optimization problems where neither the function to optimize nor its gradient have an explicit expression, but its gradient can be approximated by a Monte Carlo…

Computation · Statistics 2017-09-28 Gersende Fort , Edouard Ollier , Adeline Samson

A Projected Stochastic Gradient Method for Finite-Sum Problems with Linear Equality Constraints

A stochastic gradient method for finite-sum minimization subject to deterministic linear constraints is proposed and analyzed. The procedure presented adapts the projected gradient method on convex set to the use of both a stochastic…

Optimization and Control · Mathematics 2026-05-19 Natasa Krklec Jerinkic , Benedetta Morini , Mahsa Yousefi

Faster One-Sample Stochastic Conditional Gradient Method for Composite Convex Minimization

We propose a stochastic conditional gradient method (CGM) for minimizing convex finite-sum objectives formed as a sum of smooth and non-smooth terms. Existing CGM variants for this template either suffer from slow convergence rates, or…

Machine Learning · Computer Science 2022-04-19 Gideon Dresdner , Maria-Luiza Vladarean , Gunnar Rätsch , Francesco Locatello , Volkan Cevher , Alp Yurtsever

Constrained Stochastic Spectral Preconditioning Converges for Nonconvex Objectives

In this work, we develop proximal preconditioned gradient methods with a focus on spectral gradient methods providing a proximal extension to the Muon and Scion optimizers. We introduce a family of stochastic algorithms that can handle a…

Optimization and Control · Mathematics 2026-05-13 Konstantinos Oikonomidis , Jan Quan , Kimon Antonakopoulos , Antonio Silveti-Falls , Volkan Cevher , Panagiotis Patrinos

How to optimize preconditioners for the conjugate gradient method: a stochastic approach

The conjugate gradient method (CG) is typically used with a preconditioner which improves efficiency and robustness of the method. Many preconditioners include parameters and a proper choice of a preconditioner and its parameters is often…

Numerical Analysis · Mathematics 2019-06-04 Alexandr Katrutsa , Mike Botchev , George Ovchinnikov , Ivan Oseledets

Preconditioned Stochastic Gradient Descent

Stochastic gradient descent (SGD) still is the workhorse for many practical problems. However, it converges slow, and can be difficult to tune. It is possible to precondition SGD to accelerate its convergence remarkably. But many attempts…

Machine Learning · Statistics 2017-02-23 Xi-Lin Li

A Stochastic Conjugate Gradient Method for Approximation of Functions

A stochastic conjugate gradient method for approximation of a function is proposed. The proposed method avoids computing and storing the covariance matrix in the normal equations for the least squares solution. In addition, the method…

Numerical Analysis · Mathematics 2013-02-11 Hong Jiang , Paul Wilford

On the Convergence of Constrained Gradient Method

The constrained gradient method (CGM) has recently been proposed to solve convex optimization and monotone variational inequality (VI) problems with general functional constraints. While existing literature has established convergence…

Optimization and Control · Mathematics 2025-11-24 Danqing Zhou , Hongmei Chen , Shiqian Ma , Junfeng Yang

Online Statistical Inference for Parameters Estimation with Linear-Equality Constraints

Stochastic gradient descent (SGD) and projected stochastic gradient descent (PSGD) are scalable algorithms to compute model parameters in unconstrained and constrained optimization problems. In comparison with SGD, PSGD forces its iterative…

Machine Learning · Statistics 2022-03-24 Ruiqi Liu , Mingao Yuan , Zuofeng Shang

Convergence Properties of Stochastic Hypergradients

Bilevel optimization problems are receiving increasing attention in machine learning as they provide a natural framework for hyperparameter optimization and meta-learning. A key step to tackle these problems is the efficient computation of…

Machine Learning · Statistics 2025-05-20 Riccardo Grazzi , Massimiliano Pontil , Saverio Salzo

A Proximal Stochastic Gradient Method with Adaptive Step Size and Variance Reduction for Convex Composite Optimization

In this paper, we propose a proximal stochasitc gradient algorithm (PSGA) for solving composite optimization problems by incorporating variance reduction techniques and an adaptive step-size strategy. In the PSGA method, the objective…

Optimization and Control · Mathematics 2026-04-06 Changjie Fang , Hao Yang , Shenglan Chen

First and Second Order Approximations to Stochastic Gradient Descent Methods with Momentum Terms

Stochastic Gradient Descent (SGD) methods see many uses in optimization problems. Modifications to the algorithm, such as momentum-based SGD methods have been known to produce better results in certain cases. Much of this, however, is due…

Machine Learning · Computer Science 2025-04-22 Eric Lu

Stochastic Gradient Descent for Constrained Optimization based on Adaptive Relaxed Barrier Functions

This paper presents a novel stochastic gradient descent algorithm for constrained optimization. The proposed algorithm randomly samples constraints and components of the finite sum objective function and relies on a relaxed logarithmic…

Optimization and Control · Mathematics 2025-05-13 Naum Dimitrieski , Jing Cao , Christian Ebenbauer

On the Adaptivity of Stochastic Gradient-Based Optimization

Stochastic-gradient-based optimization has been a core enabling methodology in applications to large-scale problems in machine learning and related areas. Despite the progress, the gap between theory and practice remains significant, with…

Optimization and Control · Mathematics 2021-01-01 Lihua Lei , Michael I. Jordan

Sampling and Update Frequencies in Proximal Variance-Reduced Stochastic Gradient Methods

Variance-reduced stochastic gradient methods have gained popularity in recent times. Several variants exist with different strategies for the storing and sampling of gradients and this work concerns the interactions between these two…

Optimization and Control · Mathematics 2022-10-19 Martin Morin , Pontus Giselsson

Constrained and Composite Optimization via Adaptive Sampling Methods

The motivation for this paper stems from the desire to develop an adaptive sampling method for solving constrained optimization problems in which the objective function is stochastic and the constraints are deterministic. The method…

Optimization and Control · Mathematics 2021-01-01 Yuchen Xie , Raghu Bollapragada , Richard Byrd , Jorge Nocedal

Tail bounds for stochastic approximation

Stochastic-approximation gradient methods are attractive for large-scale convex optimization because they offer inexpensive iterations. They are especially popular in data-fitting and machine-learning applications where the data arrives in…

Optimization and Control · Mathematics 2014-01-09 Michael P. Friedlander , Gabriel Goh

A Light Touch for Heavily Constrained SGD

Minimizing empirical risk subject to a set of constraints can be a useful strategy for learning restricted classes of functions, such as monotonic functions, submodular functions, classifiers that guarantee a certain class label for some…

Machine Learning · Computer Science 2016-10-26 Andrew Cotter , Maya Gupta , Jan Pfeifer

Asymptotic Analysis of Conditioned Stochastic Gradient Descent

In this paper, we investigate a general class of stochastic gradient descent (SGD) algorithms, called Conditioned SGD, based on a preconditioning of the gradient direction. Using a discrete-time approach with martingale tools, we establish…

Statistics Theory · Mathematics 2023-10-17 Rémi Leluc , François Portier

A Stochastic Gradient Method with Mesh Refinement for PDE Constrained Optimization under Uncertainty

Models incorporating uncertain inputs, such as random forces or material parameters, have been of increasing interest in PDE-constrained optimization. In this paper, we focus on the efficient numerical minimization of a convex and smooth…

Optimization and Control · Mathematics 2021-06-18 Caroline Geiersbach , Winnifried Wollner