Related papers: Fast and Correct Gradient-Based Optimisation for P…

A Smoothing Stochastic Gradient Method for Composite Optimization

We consider the unconstrained optimization problem whose objective function is composed of a smooth and a non-smooth conponents where the smooth component is the expectation a random function. This type of problem arises in some interesting…

Optimization and Control · Mathematics 2011-07-01 Qihang Lin , Xi Chen , Javier Pena

Diagonalisation SGD: Fast & Convergent SGD for Non-Differentiable Models via Reparameterisation and Smoothing

It is well-known that the reparameterisation gradient estimator, which exhibits low variance in practice, is biased for non-differentiable models. This may compromise correctness of gradient-based optimisation methods such as stochastic…

Machine Learning · Computer Science 2024-02-21 Dominik Wagner , Basim Khajwal , C. -H. Luke Ong

Randomized Smoothing for Stochastic Optimization

We analyze convergence rates of stochastic optimization procedures for non-smooth convex optimization problems. By combining randomized smoothing techniques with accelerated gradient methods, we obtain convergence rates of stochastic…

Optimization and Control · Mathematics 2012-04-10 John C. Duchi , Peter L. Bartlett , Martin J. Wainwright

Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization

This paper considers stochastic optimization problems for a large class of objective functions, including convex and continuous submodular. Stochastic proximal gradient methods have been widely used to solve such problems; however, their…

Optimization and Control · Mathematics 2018-11-13 Aryan Mokhtari , Hamed Hassani , Amin Karbasi

Making Gradient Descent Optimal for Strongly Convex Stochastic Optimization

Stochastic gradient descent (SGD) is a simple and popular method to solve stochastic optimization problems which arise in machine learning. For strongly convex problems, its convergence rate was known to be O(\log(T)/T), by running SGD for…

Machine Learning · Computer Science 2015-03-19 Alexander Rakhlin , Ohad Shamir , Karthik Sridharan

Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization…

Machine Learning · Statistics 2020-02-13 Christian A. Naesseth , Francisco J. R. Ruiz , Scott W. Linderman , David M. Blei

Importance Sampled Stochastic Optimization for Variational Inference

Variational inference approximates the posterior distribution of a probabilistic model with a parameterized density by maximizing a lower bound for the model evidence. Modern solutions fit a flexible approximation with stochastic gradient…

Machine Learning · Statistics 2017-07-13 Joseph Sakaya , Arto Klami

Gradient Estimation and Variance Reduction in Stochastic and Deterministic Models

It seems that in the current age, computers, computation, and data have an increasingly important role to play in scientific research and discovery. This is reflected in part by the rise of machine learning and artificial intelligence,…

Machine Learning · Computer Science 2024-05-15 Ronan Keane

A Stochastic Gradient Method with Mesh Refinement for PDE Constrained Optimization under Uncertainty

Models incorporating uncertain inputs, such as random forces or material parameters, have been of increasing interest in PDE-constrained optimization. In this paper, we focus on the efficient numerical minimization of a convex and smooth…

Optimization and Control · Mathematics 2021-06-18 Caroline Geiersbach , Winnifried Wollner

Non-smooth stochastic gradient descent using smoothing functions

In this paper, we address stochastic optimization problems involving a composition of a non-smooth outer function and a smooth inner function, a formulation frequently encountered in machine learning and operations research. To deal with…

Optimization and Control · Mathematics 2026-05-15 Tommaso Giovannelli , Jingfu Tan , Luis Nunes Vicente

Probabilistic Best Subset Selection via Gradient-Based Optimization

In high-dimensional statistics, variable selection recovers the latent sparse patterns from all possible covariate combinations. This paper proposes a novel optimization method to solve the exact L0-regularized regression problem, which is…

Methodology · Statistics 2022-06-02 Mingzhang Yin , Nhat Ho , Bowei Yan , Xiaoning Qian , Mingyuan Zhou

Efficient Stochastic Gradient Descent for Learning with Distributionally Robust Optimization

Distributionally robust optimization (DRO) problems are increasingly seen as a viable method to train machine learning models for improved model generalization. These min-max formulations, however, are more difficult to solve. We therefore…

Machine Learning · Statistics 2020-11-03 Soumyadip Ghosh , Mark Squillante , Ebisa Wollega

A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which…

Optimization and Control · Mathematics 2025-04-21 Spyridon Pougkakiotis , Dionysios S. Kalogerias

A Continuous-time Stochastic Gradient Descent Method for Continuous Data

Optimization problems with continuous data appear in, e.g., robust machine learning, functional data analysis, and variational inference. Here, the target function is given as an integral over a family of (continuously) indexed target…

Machine Learning · Computer Science 2023-11-01 Kexin Jin , Jonas Latz , Chenguang Liu , Carola-Bibiane Schönlieb

On the Regularizing Property of Stochastic Gradient Descent

Stochastic gradient descent is one of the most successful approaches for solving large-scale problems, especially in machine learning and statistics. At each iteration, it employs an unbiased estimator of the full gradient computed from one…

Numerical Analysis · Mathematics 2018-12-05 Bangti Jin , Xiliang Lu

Reparameterization Gradient for Non-differentiable Models

We present a new algorithm for stochastic variational inference that targets at models with non-differentiable densities. One of the key challenges in stochastic variational inference is to come up with a low-variance estimator of the…

Machine Learning · Computer Science 2018-10-26 Wonyeol Lee , Hangyeol Yu , Hongseok Yang

A Proximal Stochastic Gradient Method with Progressive Variance Reduction

We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping. We assume the whole…

Optimization and Control · Mathematics 2014-03-20 Lin Xiao , Tong Zhang

Probabilistic Smoothing with Ratio-Monotone Transforms for Global Optimization

Probabilistic smoothing is a standard tool for global optimization, but existing methods rely on Gaussian kernels and specific transforms, often resulting in strong hyperparameter sensitivity and limited robustness. We propose a general…

Machine Learning · Computer Science 2026-05-27 Kukyoung Jang , Taehyun Cho , Junrui Zhang , Ping Xu , Kyungjae Lee

The Practicality of Stochastic Optimization in Imaging Inverse Problems

In this work we investigate the practicality of stochastic gradient descent and recently introduced variants with variance-reduction techniques in imaging inverse problems. Such algorithms have been shown in the machine learning literature…

Optimization and Control · Mathematics 2021-01-26 Junqi Tang , Karen Egiazarian , Mohammad Golbabaee , Mike Davies

An Algebraically Converging Stochastic Gradient Descent Algorithm for Global Optimization

We propose a new gradient descent algorithm with added stochastic terms for finding the global optimizers of nonconvex optimization problems. A key component in the algorithm is the adaptive tuning of the randomness based on the value of…

Optimization and Control · Mathematics 2025-06-16 Björn Engquist , Kui Ren , Yunan Yang