Related papers: RES: Regularized Stochastic BFGS Algorithm

Global Convergence of Online Limited Memory BFGS

Global convergence of an online (stochastic) limited memory version of the Broyden-Fletcher- Goldfarb-Shanno (BFGS) quasi-Newton method for solving optimization problems with stochastic objectives that arise in large scale machine learning…

Optimization and Control · Mathematics 2014-09-09 Aryan Mokhtari , Alejandro Ribeiro

Deep Reinforcement Learning via L-BFGS Optimization

Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selections so as to increase rewarding experiences in their environments. Deep Reinforcement Learning algorithms require solving a nonconvex and…

Machine Learning · Computer Science 2019-04-18 Jacob Rafati , Roummel F. Marcia

On stochastic and deterministic quasi-Newton methods for non-Strongly convex optimization: Asymptotic convergence and rate analysis

Motivated by applications arising from large scale optimization and machine learning, we consider stochastic quasi-Newton (SQN) methods for solving unconstrained convex optimization problems. The convergence analysis of the SQN methods,…

Optimization and Control · Mathematics 2019-10-02 Farzad Yousefian , Angelia Nedić , Uday Shanbhag

Quasi-Newton Optimization Methods For Deep Learning Applications

Deep learning algorithms often require solving a highly non-linear and nonconvex unconstrained optimization problem. Methods for solving optimization problems in large-scale machine learning, such as deep learning and deep reinforcement…

Machine Learning · Computer Science 2019-09-06 Jacob Rafati , Roummel F. Marcia

A robust BFGS algorithm for unconstrained nonlinear optimization problems

In this paper, a modified BFGS algorithm is proposed. The modified BFGS matrix estimates a modified Hessian matrix which is a convex combination of an identity matrix for the steepest descent algorithm and a Hessian matrix for the Newton…

Optimization and Control · Mathematics 2025-11-14 Yaguang Yang

Comparing BFGS and OGR for Second-Order Optimization

Estimating the Hessian matrix, especially for neural network training, is a challenging problem due to high dimensionality and cost. In this work, we compare the classical Sherman-Morrison update used in the popular BFGS method…

Machine Learning · Computer Science 2025-12-09 Adrian Przybysz , Mikołaj Kołek , Franciszek Sobota , Jarek Duda

Stochastic Block BFGS: Squeezing More Curvature out of Data

We propose a novel limited-memory stochastic block BFGS update for incorporating enriched curvature information in stochastic approximation methods. In our method, the estimate of the inverse Hessian matrix that is maintained by it, is…

Optimization and Control · Mathematics 2016-04-01 Robert M. Gower , Donald Goldfarb , Peter Richtárik

On the efficiency of Stochastic Quasi-Newton Methods for Deep Learning

While first-order methods are popular for solving optimization problems that arise in large-scale deep learning problems, they come with some acute deficiencies. To diminish such shortcomings, there has been recent interest in applying…

Machine Learning · Computer Science 2023-10-05 Mahsa Yousefi , Angeles Martinez

Stochastic Subspace Descent

We present two stochastic descent algorithms that apply to unconstrained optimization and are particularly efficient when the objective function is slow to evaluate and gradients are not easily obtained, as in some PDE-constrained…

Optimization and Control · Mathematics 2019-04-30 David Kozak , Stephen Becker , Alireza Doostan , Luis Tenorio

Efficient Stochastic BFGS methods Inspired by Bayesian Principles

Quasi-Newton methods are ubiquitous in deterministic local search due to their efficiency and low computational cost. This class of methods uses the history of gradient evaluations to approximate second-order derivatives. However, only…

Optimization and Control · Mathematics 2025-11-24 André Carlon , Luis Espath , Raúl Tempone

Non-asymptotic Global Convergence Rates of BFGS with Exact Line Search

In this paper, we explore the non-asymptotic global convergence rates of the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method implemented with exact line search. Notably, due to Dixon's equivalence result, our findings are also applicable to…

Optimization and Control · Mathematics 2025-07-16 Qiujiang Jin , Ruichen Jiang , Aryan Mokhtari

qNBO: quasi-Newton Meets Bilevel Optimization

Bilevel optimization, addressing challenges in hierarchical learning tasks, has gained significant interest in machine learning. The practical implementation of the gradient descent method to bilevel optimization encounters computational…

Machine Learning · Computer Science 2025-02-04 Sheng Fang , Yong-Jin Liu , Wei Yao , Chengming Yu , Jin Zhang

Stochastic Nested Variance Reduction for Nonconvex Optimization

We study finite-sum nonconvex optimization problems, where the objective function is an average of $n$ nonconvex functions. We propose a new stochastic gradient descent algorithm based on nested variance reduction. Compared with…

Machine Learning · Computer Science 2020-10-20 Dongruo Zhou , Pan Xu , Quanquan Gu

A Linearly-Convergent Stochastic L-BFGS Algorithm

We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a…

Optimization and Control · Mathematics 2016-04-15 Philipp Moritz , Robert Nishihara , Michael I. Jordan

A Quasi-Newton Method for Large Scale Support Vector Machines

This paper adapts a recently developed regularized stochastic version of the Broyden, Fletcher, Goldfarb, and Shanno (BFGS) quasi-Newton method for the solution of support vector machine classification problems. The proposed method is shown…

Machine Learning · Computer Science 2014-02-21 Aryan Mokhtari , Alejandro Ribeiro

Hessian Initialization Strategies for L-BFGS Solving Non-linear Inverse Problems

L-BFGS is the state-of-the-art optimization method for many large scale inverse problems. It has a small memory footprint and achieves superlinear convergence. The method approximates Hessian based on an initial approximation and an update…

Numerical Analysis · Mathematics 2021-03-19 Hari Om Aggrawal , Jan Modersitzki

A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization with Applications

This paper proposes a novel stochastic version of damped and regularized BFGS method for addressing the above problems.

Numerical Analysis · Mathematics 2019-12-11 H. Chen , H. C. Wu , S. C. Chan , W. H. Lam

A structured L-BFGS method with diagonal scaling and its application to image registration

We devise an L-BFGS method for optimization problems in which the objective is the sum of two functions, where the Hessian of the first function is computationally unavailable while the Hessian of the second function has a computationally…

Optimization and Control · Mathematics 2024-09-10 Florian Mannel , Hari Om Aggrawal

Stochastic quasi-Newton methods for non-strongly convex problems: convergence and rate analysis

Motivated by applications in optimization and machine learning, we consider stochastic quasi-Newton (SQN) methods for solving stochastic optimization problems. In the literature, the convergence analysis of these algorithms relies on strong…

Optimization and Control · Mathematics 2016-03-16 Farzad Yousefian , Angelia Nedić , Uday V. Shanbha

A Retrospective Approximation Approach for Smooth Stochastic Optimization

Stochastic Gradient (SG) is the defacto iterative technique to solve stochastic optimization (SO) problems with a smooth (non-convex) objective $f$ and a stochastic first-order oracle. SG's attractiveness is due in part to its simplicity of…

Optimization and Control · Mathematics 2024-03-08 David Newton , Raghu Bollapragada , Raghu Pasupathy , Nung Kwan Yip