Related papers: Nesterov-aided Stochastic Gradient Methods using L…

Balancing Rates and Variance via Adaptive Batch-Size for Stochastic Optimization Problems

Stochastic gradient descent is a canonical tool for addressing stochastic optimization problems, and forms the bedrock of modern machine learning and statistics. In this work, we seek to balance the fact that attenuating step-size is…

Signal Processing · Electrical Eng. & Systems 2020-07-10 Zhan Gao , Alec Koppel , Alejandro Ribeiro

Maximum Likelihood Estimation for Stochastic Differential Equations Using Sequential Kriging-Based Optimization

Stochastic Differential Equations (SDEs) are used as statistical models in many disciplines. However, intractable likelihood functions for SDEs make inference challenging, and we need to resort to simulation-based techniques to estimate and…

Methodology · Statistics 2014-08-12 Grant Schneider , Peter F. Craigmile , Radu Herbei

The Practicality of Stochastic Optimization in Imaging Inverse Problems

In this work we investigate the practicality of stochastic gradient descent and recently introduced variants with variance-reduction techniques in imaging inverse problems. Such algorithms have been shown in the machine learning literature…

Optimization and Control · Mathematics 2021-01-26 Junqi Tang , Karen Egiazarian , Mohammad Golbabaee , Mike Davies

Stochastic Particle Gradient Descent for Infinite Ensembles

The superior performance of ensemble methods with infinite models are well known. Most of these methods are based on optimization problems in infinite-dimensional spaces with some regularization, for instance, boosting methods and convex…

Machine Learning · Statistics 2017-12-18 Atsushi Nitanda , Taiji Suzuki

Laplace-based strategies for Bayesian optimal experimental design with nuisance uncertainty

Finding the optimal design of experiments in the Bayesian setting typically requires estimation and optimization of the expected information gain functional. This functional consists of one outer and one inner integral, separated by the…

Numerical Analysis · Mathematics 2024-11-26 Arved Bartuska , Luis Espath , Raúl Tempone

A Stochastic Quasi-Newton Method with Nesterov's Accelerated Gradient

Incorporating second order curvature information in gradient based methods have shown to improve convergence drastically despite its computational intensity. In this paper, we propose a stochastic (online) quasi-Newton method with…

Machine Learning · Computer Science 2020-10-16 S. Indrapriyadarsini , Shahrzad Mahboubi , Hiroshi Ninomiya , Hideki Asai

Robust Optimization of PDEs with Random Coefficients Using a Multilevel Monte Carlo Method

This paper addresses optimization problems constrained by partial differential equations with uncertain coefficients. In particular, the robust control problem and the average control problem are considered for a tracking type cost…

Optimization and Control · Mathematics 2017-11-08 Andreas Van Barel , Stefan Vandewalle

A Unified Stochastic Gradient Approach to Designing Bayesian-Optimal Experiments

We introduce a fully stochastic gradient based approach to Bayesian optimal experimental design (BOED). Our approach utilizes variational lower bounds on the expected information gain (EIG) of an experiment that can be simultaneously…

Machine Learning · Statistics 2020-02-28 Adam Foster , Martin Jankowiak , Matthew O'Meara , Yee Whye Teh , Tom Rainforth

New Penalized Stochastic Gradient Methods for Linearly Constrained Strongly Convex Optimization

For minimizing a strongly convex objective function subject to linear inequality constraints, we consider a penalty approach that allows one to utilize stochastic methods for problems with a large number of constraints and/or objective…

Optimization and Control · Mathematics 2022-02-16 Meng Li , Paul Grigas , Alper Atamturk

Stochastic Nested Variance Reduction for Nonconvex Optimization

We study finite-sum nonconvex optimization problems, where the objective function is an average of $n$ nonconvex functions. We propose a new stochastic gradient descent algorithm based on nested variance reduction. Compared with…

Machine Learning · Computer Science 2020-10-20 Dongruo Zhou , Pan Xu , Quanquan Gu

A subsampling approach for Bayesian model selection

It is common practice to use Laplace approximations to compute marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to…

Methodology · Statistics 2022-02-01 Jon Lachmann , Geir Storvik , Florian Frommlet , Aliaksadr Hubin

A stochastic subspace approach to gradient-free optimization in high dimensions

We present a stochastic descent algorithm for unconstrained optimization that is particularly efficient when the objective function is slow to evaluate and gradients are not easily obtained, as in some PDE-constrained optimization and…

Optimization and Control · Mathematics 2024-07-08 David Kozak , Stephen Becker , Alireza Doostan , Luis Tenorio

Constructing unbiased gradient estimators with finite variance for conditional stochastic optimization

We study stochastic gradient descent for solving conditional stochastic optimization problems, in which an objective to be minimized is given by a parametric nested expectation with an outer expectation taken with respect to one random…

Numerical Analysis · Mathematics 2023-04-28 Takashi Goda , Wataru Kitade

Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization

We present an optimizer which uses Bayesian optimization to tune the system parameters of distributed stochastic gradient descent (SGD). Given a specific context, our goal is to quickly find efficient configurations which appropriately…

Machine Learning · Statistics 2016-12-04 Valentin Dalibard , Michael Schaarschmidt , Eiko Yoneki

Understanding the Acceleration Phenomenon via High-Resolution Differential Equations

Gradient-based optimization algorithms can be studied from the perspective of limiting ordinary differential equations (ODEs). Motivated by the fact that existing ODEs do not distinguish between two fundamentally different…

Optimization and Control · Mathematics 2018-11-05 Bin Shi , Simon S. Du , Michael I. Jordan , Weijie J. Su

Restarted Nonconvex Accelerated Gradient Descent: No More Polylogarithmic Factor in the $O(\epsilon^{-7/4})$ Complexity

This paper studies accelerated gradient methods for nonconvex optimization with Lipschitz continuous gradient and Hessian. We propose two simple accelerated gradient methods, restarted accelerated gradient descent (AGD) and restarted heavy…

Optimization and Control · Mathematics 2023-04-27 Huan Li , Zhouchen Lin

Learning to Accelerate by the Methods of Step-size Planning

Gradient descent is slow to converge for ill-conditioned problems and non-convex problems. An important technique for acceleration is step-size adaptation. The first part of this paper contains a detailed review of step-size adaptation…

Machine Learning · Computer Science 2022-05-27 Hengshuai Yao

Variance reduction for Random Coordinate Descent-Langevin Monte Carlo

Sampling from a log-concave distribution function is one core problem that has wide applications in Bayesian statistics and machine learning. While most gradient free methods have slow convergence rate, the Langevin Monte Carlo (LMC) that…

Machine Learning · Statistics 2020-10-23 Zhiyan Ding , Qin Li

Bayesian polynomial neural networks and polynomial neural ordinary differential equations

Symbolic regression with polynomial neural networks and polynomial neural ordinary differential equations (ODEs) are two recent and powerful approaches for equation recovery of many science and engineering problems. However, these methods…

Machine Learning · Computer Science 2023-08-28 Colby Fronk , Jaewoong Yun , Prashant Singh , Linda Petzold

An Extended Simplified Laplace strategy for Approximate Bayesian inference of Latent Gaussian Models using R-INLA

Various computational challenges arise when applying Bayesian inference approaches to complex hierarchical models. Sampling-based inference methods, such as Markov Chain Monte Carlo strategies, are renowned for providing accurate results…

Methodology · Statistics 2022-03-29 Cristian Chiuchiolo , Janet van Niekerk , Håvard Rue