Related papers: Nesterov-aided Stochastic Gradient Methods using L…

Gradient-Based Markov Chain Monte Carlo for MIMO Detection

Accurately detecting symbols transmitted over multiple-input multiple-output (MIMO) wireless channels is crucial in realizing the benefits of MIMO techniques. However, optimal MIMO detection is associated with a complexity that grows…

Signal Processing · Electrical Eng. & Systems 2024-10-28 Xingyu Zhou , Le Liang , Jing Zhang , Chao-Kai Wen , Shi Jin

Bayesian Optimization Meets Laplace Approximation for Robotic Introspection

In robotics, deep learning (DL) methods are used more and more widely, but their general inability to provide reliable confidence estimates will ultimately lead to fragile and unreliable systems. This impedes the potential deployments of DL…

Robotics · Computer Science 2020-11-02 Matthias Humt , Jongseok Lee , Rudolph Triebel

Faster Convergence of a Randomized Coordinate Descent Method for Linearly Constrained Optimization Problems

The problem of minimizing a separable convex function under linearly coupled constraints arises from various application domains such as economic systems, distributed control, and network flow. The main challenge for solving this problem is…

Optimization and Control · Mathematics 2017-09-05 Qin Fan , Min Xu , Yiming Ying

Stochastic gradient algorithms from ODE splitting perspective

We present a different view on stochastic optimization, which goes back to the splitting schemes for approximate solutions of ODE. In this work, we provide a connection between stochastic gradient descent approach and first-order splitting…

Machine Learning · Statistics 2020-04-21 Daniil Merkulov , Ivan Oseledets

Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization

This paper considers stochastic optimization problems for a large class of objective functions, including convex and continuous submodular. Stochastic proximal gradient methods have been widely used to solve such problems; however, their…

Optimization and Control · Mathematics 2018-11-13 Aryan Mokhtari , Hamed Hassani , Amin Karbasi

On Markov Chain Gradient Descent

Stochastic gradient methods are the workhorse (algorithms) of large-scale optimization problems in machine learning, signal processing, and other computational sciences and engineering. This paper studies Markov chain gradient descent, a…

Optimization and Control · Mathematics 2018-09-13 Tao Sun , Yuejiao Sun , Wotao Yin

Nesterov Acceleration of Alternating Least Squares for Canonical Tensor Decomposition: Momentum Step Size Selection and Restart Mechanisms

We present Nesterov-type acceleration techniques for Alternating Least Squares (ALS) methods applied to canonical tensor decomposition. While Nesterov acceleration turns gradient descent into an optimal first-order method for convex…

Optimization and Control · Mathematics 2019-12-03 Drew Mitchell , Nan Ye , Hans De Sterck

Large Stepsizes Accelerate Gradient Descent for Regularized Logistic Regression

We study gradient descent (GD) with a constant stepsize for $\ell_2$-regularized logistic regression with linearly separable data. Classical theory suggests small stepsizes to ensure monotonic reduction of the optimization objective,…

Machine Learning · Statistics 2025-11-04 Jingfeng Wu , Pierre Marion , Peter Bartlett

Optimal Experimental Design Using A Consistent Bayesian Approach

We consider the utilization of a computational model to guide the optimal acquisition of experimental data to inform the stochastic description of model input parameters. Our formulation is based on the recently developed consistent…

Computation · Statistics 2021-05-04 Scott N. Walsh , Tim M. Wildey , John D. Jakeman

A Stochastic Gradient Method with Mesh Refinement for PDE Constrained Optimization under Uncertainty

Models incorporating uncertain inputs, such as random forces or material parameters, have been of increasing interest in PDE-constrained optimization. In this paper, we focus on the efficient numerical minimization of a convex and smooth…

Optimization and Control · Mathematics 2021-06-18 Caroline Geiersbach , Winnifried Wollner

A Single Time-Scale Stochastic Approximation Method for Nested Stochastic Optimization

We study constrained nested stochastic optimization problems in which the objective function is a composition of two smooth functions whose exact values and derivatives are not available. We propose a single time-scale stochastic…

Optimization and Control · Mathematics 2019-09-09 Saeed Ghadimi , Andrzej Ruszczyński , Mengdi Wang

Gradient Estimation and Variance Reduction in Stochastic and Deterministic Models

It seems that in the current age, computers, computation, and data have an increasingly important role to play in scientific research and discovery. This is reflected in part by the rise of machine learning and artificial intelligence,…

Machine Learning · Computer Science 2024-05-15 Ronan Keane

A Stochastic Approach to Bi-Level Optimization for Hyperparameter Optimization and Meta Learning

We tackle the general differentiable meta learning problem that is ubiquitous in modern deep learning, including hyperparameter optimization, loss function learning, few-shot learning, invariance learning and more. These problems are often…

Machine Learning · Computer Science 2024-10-15 Minyoung Kim , Timothy M. Hospedales

An accelerated gradient method with adaptive restart for convex multiobjective optimization problems

In this work, based on the continuous time approach, we propose an accelerated gradient method with adaptive residual restart for convex multiobjective optimization problems. For the first, we derive rigorously the continuous limit of the…

Optimization and Control · Mathematics 2025-02-06 Hao Luo , Liping Tang , Xinmin Yang

Fast Distributed Gradient Methods

We study distributed optimization problems when $N$ nodes minimize the sum of their individual costs subject to a common vector variable. The costs are convex, have Lipschitz continuous gradient (with constant $L$), and bounded gradient. We…

Information Theory · Computer Science 2014-04-15 Dusan Jakovetic , Joao Xavier , Jose M. F. Moura

The Laplace approximation accuracy in high dimensions: a refined analysis and new skew adjustment

In Bayesian inference, making deductions about a parameter of interest requires one to sample from or compute an integral against a posterior distribution. A popular method to make these computations cheaper in high-dimensional settings is…

Statistics Theory · Mathematics 2024-06-10 Anya Katsevich

Bolstering Stochastic Gradient Descent with Model Building

Stochastic gradient descent method and its variants constitute the core optimization algorithms that achieve good convergence rates for solving machine learning problems. These rates are obtained especially when these algorithms are…

Machine Learning · Computer Science 2024-03-14 S. Ilker Birbil , Ozgur Martin , Gonenc Onay , Figen Oztoprak

Stochastic Analysis of an Adaptive Cubic Regularisation Method under Inexact Gradient Evaluations and Dynamic Hessian Accuracy

We here adapt an extended version of the adaptive cubic regularisation method with dynamic inexact Hessian information for nonconvex optimisation in [3] to the stochastic optimisation setting. While exact function evaluations are still…

Numerical Analysis · Mathematics 2020-09-15 Stefania Bellavia , Gianmarco Gurioli

Shuffling Momentum Gradient Algorithm for Convex Optimization

The Stochastic Gradient Descent method (SGD) and its stochastic variants have become methods of choice for solving finite-sum optimization problems arising from machine learning and data science thanks to their ability to handle large-scale…

Optimization and Control · Mathematics 2024-03-06 Trang H. Tran , Quoc Tran-Dinh , Lam M. Nguyen

Variational Linearized Laplace Approximation for Bayesian Deep Learning

The Linearized Laplace Approximation (LLA) has been recently used to perform uncertainty estimation on the predictions of pre-trained deep neural networks (DNNs). However, its widespread application is hindered by significant computational…

Machine Learning · Statistics 2024-05-24 Luis A. Ortega , Simón Rodríguez Santana , Daniel Hernández-Lobato