Related papers: QN Optimization with Hessian Sample

Advancing Multi-Secant Quasi-Newton Methods for General Convex Functions

Quasi-Newton (QN) methods provide an efficient alternative to second-order methods for minimizing smooth unconstrained problems. While QN methods generally compose a Hessian estimate based on one secant interpolation per iteration,…

Optimization and Control · Mathematics 2025-04-11 Mokhwa Lee , Yifan Sun

Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods

We present an algorithm for minimizing a sum of functions that combines the computational efficiency of stochastic gradient descent (SGD) with the second order curvature information leveraged by quasi-Newton methods. We unify these…

Machine Learning · Computer Science 2014-12-02 Jascha Sohl-Dickstein , Ben Poole , Surya Ganguli

Block-diagonal Hessian-free Optimization for Training Neural Networks

Second-order methods for neural network optimization have several advantages over methods based on first-order gradient descent, including better scaling to large mini-batch sizes and fewer updates needed for convergence. But they are…

Machine Learning · Computer Science 2017-12-21 Huishuai Zhang , Caiming Xiong , James Bradbury , Richard Socher

Soft quasi-Newton: Guaranteed positive definiteness by relaxing the secant constraint

We propose a novel algorithm, termed soft quasi-Newton (soft QN), for optimization in the presence of bounded noise. Traditional quasi-Newton algorithms are vulnerable to such perturbations. To develop a more robust quasi-Newton method, we…

Optimization and Control · Mathematics 2024-03-06 Erik Berglund , Jiaojiao Zhang , Mikael Johansson

Symmetric Rank-One Quasi-Newton Methods for Deep Learning Using Cubic Regularization

Stochastic gradient descent and other first-order variants, such as Adam and AdaGrad, are commonly used in the field of deep learning due to their computational efficiency and low-storage memory requirements. However, these methods do not…

Optimization and Control · Mathematics 2025-02-19 Aditya Ranganath , Mukesh Singhal , Roummel Marcia

Quasi-Newton Optimization Methods For Deep Learning Applications

Deep learning algorithms often require solving a highly non-linear and nonconvex unconstrained optimization problem. Methods for solving optimization problems in large-scale machine learning, such as deep learning and deep reinforcement…

Machine Learning · Computer Science 2019-09-06 Jacob Rafati , Roummel F. Marcia

Sub-Sampled Newton Methods I: Globally Convergent Algorithms

Large scale optimization problems are ubiquitous in machine learning and data analysis and there is a plethora of algorithms for solving such problems. Many of these algorithms employ sub-sampling, as a way to either speed up the…

Optimization and Control · Mathematics 2016-02-29 Farbod Roosta-Khorasani , Michael W. Mahoney

Sub-Sampled Newton Methods II: Local Convergence Rates

Many data-fitting applications require the solution of an optimization problem involving a sum of large number of functions of high dimensional parameter. Here, we consider the problem of minimizing a sum of $n$ functions over a convex…

Optimization and Control · Mathematics 2016-02-29 Farbod Roosta-Khorasani , Michael W. Mahoney

Distributed Hessian-Free Optimization for Deep Neural Network

Training deep neural network is a high dimensional and a highly non-convex optimization problem. Stochastic gradient descent (SGD) algorithm and it's variations are the current state-of-the-art solvers for this task. However, due to…

Machine Learning · Computer Science 2017-01-17 Xi He , Dheevatsa Mudigere , Mikhail Smelyanskiy , Martin Takáč

Accelerating Adaptive Cubic Regularization of Newton's Method via Random Sampling

In this paper, we consider an unconstrained optimization model where the objective is a sum of a large number of possibly nonconvex functions, though overall the objective is assumed to be smooth and convex. Our bid to solving such model…

Optimization and Control · Mathematics 2022-03-15 Xi Chen , Bo Jiang , Tianyi Lin , Shuzhong Zhang

Stochastic Block BFGS: Squeezing More Curvature out of Data

We propose a novel limited-memory stochastic block BFGS update for incorporating enriched curvature information in stochastic approximation methods. In our method, the estimate of the inverse Hessian matrix that is maintained by it, is…

Optimization and Control · Mathematics 2016-04-01 Robert M. Gower , Donald Goldfarb , Peter Richtárik

A Stochastic Quasi-Newton Method for Large-Scale Optimization

The question of how to incorporate curvature information in stochastic approximation methods is challenging. The direct application of classical quasi- Newton updating techniques for deterministic optimization leads to noisy curvature…

Optimization and Control · Mathematics 2015-02-19 R. H. Byrd , S. L. Hansen , J. Nocedal , Y. Singer

Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods

In this paper, we consider stochastic second-order methods for minimizing a finite summation of nonconvex functions. One important key is to find an ingenious but cheap scheme to incorporate local curvature information. Since the true…

Optimization and Control · Mathematics 2021-03-26 Minghan Yang , Dong Xu , Hongyu Chen , Zaiwen Wen , Mengyun Chen

Error-Tolerant Quantum State Discrimination: Optimization and Quantum Circuit Synthesis

We develop error-tolerant quantum state discrimination(QSD) strategies that maintain reliable performance under moderate noise. Two complementary approaches are proposed: CrossQSD, which generalizes unambiguous discrimination with tunable…

Quantum Physics · Physics 2026-05-19 Chien-Kai Ma , Bo-Hung Chen , Tian-Fu Chen , Dah-Wei Chiou , Jie-Hong Roland Jiang

Fast Unconstrained Optimization via Hessian Averaging and Adaptive Gradient Sampling Methods

We consider minimizing finite-sum and expectation objective functions via Hessian-averaging based subsampled Newton methods. These methods allow for gradient inexactness and have fixed per-iteration Hessian approximation costs. The recent…

Optimization and Control · Mathematics 2024-08-15 Thomas O'Leary-Roseberry , Raghu Bollapragada

BFGS-ADMM for Large-Scale Distributed Optimization

We consider a class of distributed optimization problem where the objective function consists of a sum of strongly convex and smooth functions and a (possibly nonsmooth) convex regularizer. A multi-agent network is assumed, where each agent…

Optimization and Control · Mathematics 2021-10-01 Yichuan Li , Yonghai Gong , Nikolaos M. Freris , Petros Voulgaris , Dusan Stipanovic

Quasi-Quadratic Gradient: A New Direction for Accelerating the BFGS Method in Quasi-Newton Optimization

In this paper, we introduce the Quasi-Quadratic Gradient (QQG), a novel search direction designed to accelerate the BFGS method within the quasi-Newton framework. By defining the QQG as the product of the inverse Hessian approximation and…

Optimization and Control · Mathematics 2026-04-28 John Chiang

Active Sampling Sample-based Quantum Diagonalization from Finite-Shot Measurements

Near-term quantum devices provide only finite-shot measurements and prepare imperfect, contaminated states. This motivates algorithms that convert samples into reliable low-energy estimates without full tomography or exhaustive…

Quantum Physics · Physics 2026-05-12 Rinka Miura

Convergence Analysis of Block Coordinate Algorithms with Determinantal Sampling

We analyze the convergence rate of the randomized Newton-like method introduced by Qu et. al. (2016) for smooth and convex objectives, which uses random coordinate blocks of a Hessian-over-approximation matrix $\bM$ instead of the true…

Numerical Analysis · Mathematics 2020-02-13 Mojmír Mutný , Michał Dereziński , Andreas Krause

Nys-Newton: Nystr\"om-Approximated Curvature for Stochastic Optimization

Second-order optimization methods are among the most widely used optimization approaches for convex optimization problems, and have recently been used to optimize non-convex optimization problems such as deep learning models. The widely…

Optimization and Control · Mathematics 2022-02-01 Dinesh Singh , Hardik Tankaria , Makoto Yamada