Related papers: An Adaptive Sampling Algorithm for Level-set Appro…

On Stochastic Gradient and Subgradient Methods with Adaptive Steplength Sequences

The performance of standard stochastic approximation implementations can vary significantly based on the choice of the steplength sequence, and in general, little guidance is provided about good choices. Motivated by this gap, in the first…

Optimization and Control · Mathematics 2015-03-19 Farzad Yousefian , Angelia Nedić , Uday V. Shanbhag

A Gradient Sampling Algorithm for Noisy Nonsmooth Optimization

An algorithm is proposed, analyzed, and tested for minimizing locally Lipschitz objective functions that may be nonconvex and/or nonsmooth. The algorithm, which is built upon the gradient-sampling methodology, is designed specifically for…

Optimization and Control · Mathematics 2026-04-02 Albert S. Berahas , Frank E. Curtis , Lara Zebiane

A Stochastic Gradient Descent Method for Globally Minimizing Nearly Convex Functions

This paper proposes a stochastic gradient descent method with an adaptive Gaussian noise term for the global minimization of nearly convex functions, which are nonconvex and possess multiple strict local minimizers. The noise term,…

Optimization and Control · Mathematics 2025-08-05 Chenglong Bao , Liang Chen , Weizhi Shao

Hybrid least squares for learning functions from highly noisy data

Motivated by the need for efficient estimation of conditional expectations, we consider a least-squares function approximation problem with heavily polluted data. Existing methods that are effective in the small-noise regime are suboptimal…

Machine Learning · Statistics 2026-05-26 Ben Adcock , Bernhard Hientzsch , Akil Narayan , Yiming Xu

A fully adaptive multilevel stochastic collocation strategy for solving elliptic PDEs with random data

We propose and analyse a fully adaptive strategy for solving elliptic PDEs with random data in this work. A hierarchical sequence of adaptive mesh refinements for the spatial approximation is combined with adaptive anisotropic sparse…

Numerical Analysis · Mathematics 2020-08-26 Jens Lang , Robert Scheichl , David Silvester

Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets

We study distributed optimization algorithms for minimizing the average of \emph{heterogeneous} functions distributed across several machines with a focus on communication efficiency. In such settings, naively using the classical stochastic…

Machine Learning · Computer Science 2020-11-18 Ilqar Ramazanli , Han Nguyen , Hai Pham , Sashank J. Reddi , Barnabas Poczos

Multilevel Sparse Grid Methods for Elliptic Partial Differential Equations with Random Coefficients

Stochastic sampling methods are arguably the most direct and least intrusive means of incorporating parametric uncertainty into numerical simulations of partial differential equations with random inputs. However, to achieve an overall error…

Numerical Analysis · Mathematics 2014-04-09 Hans-Werner van Wyk

Universal Adaptive Proximal Gradient Methods via Gradient Mapping Accumulation

We propose an adaptive proximal gradient method for minimizing the sum of two functions, where one is a simple convex function, and the other belongs to one of the three classes: nonconvex smooth, convex nonsmooth, or convex smooth. The key…

Optimization and Control · Mathematics 2026-05-08 Zimeng Wang , Alp Yurtsever

Adaptive Algorithms with Sharp Convergence Rates for Stochastic Hierarchical Optimization

Hierarchical optimization refers to problems with interdependent decision variables and objectives, such as minimax and bilevel formulations. While various algorithms have been proposed, existing methods and analyses lack adaptivity in…

Machine Learning · Computer Science 2025-10-27 Xiaochuan Gong , Jie Hao , Mingrui Liu

A Stochastic Subgradient Method for Nonsmooth Nonconvex Multi-Level Composition Optimization

We propose a single time-scale stochastic subgradient method for constrained optimization of a composition of several nonsmooth and nonconvex functions. The functions are assumed to be locally Lipschitz and differentiable in a generalized…

Optimization and Control · Mathematics 2020-12-22 Andrzej Ruszczynski

Near-optimal delta-convex estimation of Lipschitz functions

This paper presents a tractable algorithm for estimating an unknown Lipschitz function from noisy observations and establishes an upper bound on its convergence rate. The approach extends max-affine methods from convex shape-restricted…

Machine Learning · Statistics 2025-11-20 Gábor Balázs

Value Function Approximation in Noisy Environments Using Locally Smoothed Regularized Approximate Linear Programs

Recently, Petrik et al. demonstrated that L1Regularized Approximate Linear Programming (RALP) could produce value functions and policies which compared favorably to established linear value function approximation techniques like LSPI.…

Machine Learning · Computer Science 2012-10-19 Gavin Taylor , Ron Parr

Error estimation and adaptivity for stochastic collocation finite elements Part I: single-level approximation

A general adaptive refinement strategy for solving linear elliptic partial differential equation with random data is proposed and analysed herein. The adaptive strategy extends the a posteriori error estimation framework introduced by…

Numerical Analysis · Mathematics 2022-08-23 Alex Bespalov , David Silvester , Feng Xu

Stochastic Localization Methods for Convex Discrete Optimization via Simulation

We develop and analyze a set of new sequential simulation-optimization algorithms for large-scale multi-dimensional discrete optimization via simulation problems with a convexity structure. The "large-scale" notion refers to that the…

Optimization and Control · Mathematics 2022-01-20 Haixiang Zhang , Zeyu Zheng , Javad Lavaei

Error estimation and adaptivity for stochastic collocation finite elements Part II: multilevel approximation

A multilevel adaptive refinement strategy for solving linear elliptic partial differential equations with random data is recalled in this work. The strategy extends the a posteriori error estimation framework introduced by Guignard and…

Numerical Analysis · Mathematics 2022-02-21 Alex Bespalov , David J. Silvester

A Gradient Sampling Algorithm for Stratified Maps with Applications to Topological Data Analysis

We introduce a novel gradient descent algorithm extending the well-known Gradient Sampling methodology to the class of stratifiably smooth objective functions, which are defined as locally Lipschitz functions that are smooth on some regular…

Computational Geometry · Computer Science 2021-09-06 Jacob Leygonie , Mathieu Carrière , Théo Lacombe , Steve Oudot

Convergence Properties of Stochastic Hypergradients

Bilevel optimization problems are receiving increasing attention in machine learning as they provide a natural framework for hyperparameter optimization and meta-learning. A key step to tackle these problems is the efficient computation of…

Machine Learning · Statistics 2025-05-20 Riccardo Grazzi , Massimiliano Pontil , Saverio Salzo

Hinge-Proximal Stochastic Gradient Methods for Convex Optimization with Functional Constraints

This paper considers stochastic convex optimization problems with smooth functional constraints arising in constrained estimation and robust signal recovery. We operate in the high-dimensional and highly-constrained setting, where oracle…

Optimization and Control · Mathematics 2025-12-16 Vaibhav Rajoriya , Prateek Priyaranjan Pradhan , Ketan Rajawat

An Adaptive Stochastic Gradient Method with Non-negative Gauss-Newton Stepsizes

We consider the problem of minimizing the average of a large number of smooth but possibly non-convex functions. In the context of most machine learning applications, each loss function is non-negative and thus can be expressed as the…

Optimization and Control · Mathematics 2024-07-08 Antonio Orvieto , Lin Xiao

A Proximal Stochastic Gradient Method with Progressive Variance Reduction

We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping. We assume the whole…

Optimization and Control · Mathematics 2014-03-20 Lin Xiao , Tong Zhang