Related papers: Nesterov-aided Stochastic Gradient Methods using L…

A posteriori stochastic correction of reduced models in delayed acceptance MCMC, with application to multiphase subsurface inverse problems

Sample-based Bayesian inference provides a route to uncertainty quantification in the geosciences, and inverse problems in general, though is very computationally demanding in the naive form that requires simulating an accurate computer…

Computation · Statistics 2019-04-12 Tiangang Cui , Colin Fox , Michael J O'Sullivan

Nesterov Acceleration for Equality-Constrained Convex Optimization via Continuously Differentiable Penalty Functions

We propose a framework to use Nesterov's accelerated method for constrained convex optimization problems. Our approach consists of first reformulating the original problem as an unconstrained optimization problem using a continuously…

Optimization and Control · Mathematics 2021-03-12 Priyank Srivastava , Jorge Cortes

Accelerated Gradient Methods for Nonconvex Nonlinear and Stochastic Programming

In this paper, we generalize the well-known Nesterov's accelerated gradient (AG) method, originally designed for convex smooth optimization, to solve nonconvex and possibly stochastic optimization problems. We demonstrate that by properly…

Optimization and Control · Mathematics 2013-10-15 Saeed Ghadimi , Guanghui Lan

On Accelerated Methods in Optimization

In convex optimization, there is an {\em acceleration} phenomenon in which we can boost the convergence rate of certain gradient-based algorithms. We can observe this phenomenon in Nesterov's accelerated gradient descent, accelerated mirror…

Optimization and Control · Mathematics 2015-09-14 Andre Wibisono , Ashia C. Wilson

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

We propose a new stochastic optimization framework for empirical risk minimization problems such as those that arise in machine learning. The traditional approaches, such as (mini-batch) stochastic gradient descent (SGD), utilize an…

Machine Learning · Statistics 2020-02-04 Kenji Kawaguchi , Haihao Lu

Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent

Gradient descent and its variants are widely used in machine learning. However, oracle access of gradient may not be available in many applications, limiting the direct use of gradient descent. This paper proposes a method of estimating…

Optimization and Control · Mathematics 2019-10-07 Qinbo Bai , Mridul Agarwal , Vaneet Aggarwal

Online Learning Rate Adaptation with Hypergradient Descent

We introduce a general method for improving the convergence rate of gradient-based optimizers that is easy to implement and works well in practice. We demonstrate the effectiveness of the method in a range of optimization problems by…

Machine Learning · Computer Science 2018-08-23 Atilim Gunes Baydin , Robert Cornish , David Martinez Rubio , Mark Schmidt , Frank Wood

Convergence of First-Order Methods for Constrained Nonconvex Optimization with Dependent Data

We focus on analyzing the classical stochastic projected gradient methods under a general dependent data sampling scheme for constrained smooth nonconvex optimization. We show the worst-case rate of convergence $\tilde{O}(t^{-1/4})$ and…

Optimization and Control · Mathematics 2023-06-26 Ahmet Alacaoglu , Hanbaek Lyu

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

We study a new two-time-scale stochastic gradient method for solving optimization problems, where the gradients are computed with the aid of an auxiliary variable under samples generated by time-varying MDPs controlled by the underlying…

Optimization and Control · Mathematics 2024-08-27 Sihan Zeng , Thinh T. Doan , Justin Romberg

Randomised Splitting Methods and Stochastic Gradient Descent

We explore an explicit link between stochastic gradient descent using common batching strategies and splitting methods for ordinary differential equations. From this perspective, we introduce a new minibatching strategy (called Symmetric…

Optimization and Control · Mathematics 2025-04-08 Luke Shaw , Peter A. Whalley

On the Stochastic (Variance-Reduced) Proximal Gradient Method for Regularized Expected Reward Optimization

We consider a regularized expected reward optimization problem in the non-oblivious setting that covers many existing problems in reinforcement learning (RL). In order to solve such an optimization problem, we apply and analyze the…

Machine Learning · Computer Science 2024-08-21 Ling Liang , Haizhao Yang

A Framework for Nonlinearly-Constrained Gradient-Enhanced Local Bayesian Optimization with Comparisons to Quasi-Newton Optimizers

Bayesian optimization is a popular and versatile approach that is well suited to solve challenging optimization problems. Their popularity comes from their effective minimization of expensive function evaluations, their capability to…

Optimization and Control · Mathematics 2026-05-14 André L. Marchildon , David W. Zingg

Monte Carlo Algorithms for Optimal Stopping and Statistical Learning

We extend the Longstaff-Schwartz algorithm for approximately solving optimal stopping problems on high-dimensional state spaces. We reformulate the optimal stopping problem for Markov processes in discrete time as a generalized statistical…

Probability · Mathematics 2007-05-23 Daniel Egloff

BFE and AdaBFE: A New Approach in Learning Rate Automation for Stochastic Optimization

In this paper, a new gradient-based optimization approach by automatically adjusting the learning rate is proposed. This approach can be applied to design non-adaptive learning rate and adaptive learning rate. Firstly, I will introduce the…

Machine Learning · Computer Science 2022-07-07 Xin Cao

Linear Convergence of Accelerated Stochastic Gradient Descent for Nonconvex Nonsmooth Optimization

In this paper, we study the stochastic gradient descent (SGD) method for the nonconvex nonsmooth optimization, and propose an accelerated SGD method by combining the variance reduction technique with Nesterov's extrapolation technique.…

Optimization and Control · Mathematics 2019-02-18 Feihu Huang , Songcan Chen

Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels

Selecting hyperparameters in deep learning greatly impacts its effectiveness but requires manual effort and expertise. Recent works show that Bayesian model selection with Laplace approximations can allow to optimize such hyperparameters…

Machine Learning · Statistics 2023-06-08 Alexander Immer , Tycho F. A. van der Ouderaa , Mark van der Wilk , Gunnar Rätsch , Bernhard Schölkopf

MAGMA: Multi-level accelerated gradient mirror descent algorithm for large-scale convex composite minimization

Composite convex optimization models arise in several applications, and are especially prevalent in inverse problems with a sparsity inducing norm and in general convex optimization with simple constraints. The most widely used algorithms…

Optimization and Control · Mathematics 2016-07-15 Vahan Hovhannisyan , Panos Parpas , Stefanos Zafeiriou

Nesterov's Accelerated Gradient Method for Nonlinear Ill-Posed Problems with a Locally Convex Residual Functional

In this paper, we consider Nesterov's Accelerated Gradient method for solving Nonlinear Inverse and Ill-Posed Problems. Known to be a fast gradient-based iterative method for solving well-posed convex optimization problems, this method also…

Numerical Analysis · Mathematics 2020-01-13 Simon Hubmer , Ronny Ramlau

A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum

This paper proposes a new algorithm -- the \underline{S}ingle-timescale Do\underline{u}ble-momentum \underline{St}ochastic \underline{A}pprox\underline{i}matio\underline{n} (SUSTAIN) -- for tackling stochastic unconstrained bilevel…

Optimization and Control · Mathematics 2021-06-16 Prashant Khanduri , Siliang Zeng , Mingyi Hong , Hoi-To Wai , Zhaoran Wang , Zhuoran Yang

First Order Methods with Markovian Noise: from Acceleration to Variational Inequalities

This paper delves into stochastic optimization problems that involve Markovian noise. We present a unified approach for the theoretical analysis of first-order gradient methods for stochastic optimization and variational inequalities. Our…

Optimization and Control · Mathematics 2024-04-02 Aleksandr Beznosikov , Sergey Samsonov , Marina Sheshukova , Alexander Gasnikov , Alexey Naumov , Eric Moulines