Related papers: Sample Efficient Stochastic Variance-Reduced Cubic…

Stochastic Recursive Variance-Reduced Cubic Regularization Methods

Stochastic Variance-Reduced Cubic regularization (SVRC) algorithms have received increasing attention due to its improved gradient/Hessian complexities (i.e., number of queries to stochastic gradient/Hessian oracles) to find local minima…

Optimization and Control · Mathematics 2019-10-14 Dongruo Zhou , Quanquan Gu

Stochastic Variance-Reduced Cubic Regularization for Nonconvex Optimization

Cubic regularization (CR) is an optimization method with emerging popularity due to its capability to escape saddle points and converge to second-order stationary solutions for nonconvex optimization. However, CR encounters a high sample…

Optimization and Control · Mathematics 2018-10-10 Zhe Wang , Yi Zhou , Yingbin Liang , Guanghui Lan

Stochastic Variance-Reduced Cubic Regularized Newton Method

We propose a stochastic variance-reduced cubic regularized Newton method for non-convex optimization. At the core of our algorithm is a novel semi-stochastic gradient along with a semi-stochastic Hessian, which are specifically designed for…

Machine Learning · Computer Science 2018-02-14 Dongruo Zhou , Pan Xu , Quanquan Gu

Stochastic Cubic Regularization for Fast Nonconvex Optimization

This paper proposes a stochastic variant of a classic algorithm---the cubic-regularized Newton method [Nesterov and Polyak 2006]. The proposed algorithm efficiently escapes saddle points and finds approximate local minima for general…

Machine Learning · Computer Science 2017-12-07 Nilesh Tripuraneni , Mitchell Stern , Chi Jin , Jeffrey Regier , Michael I. Jordan

Adaptive Stochastic Variance Reduction for Subsampled Newton Method with Cubic Regularization

The cubic regularized Newton method of Nesterov and Polyak has become increasingly popular for non-convex optimization because of its capability of finding an approximate local solution with second-order guarantee. Several recent works…

Optimization and Control · Mathematics 2018-11-29 Junyu Zhang , Lin Xiao , Shuzhong Zhang

Accelerating Adaptive Cubic Regularization of Newton's Method via Random Sampling

In this paper, we consider an unconstrained optimization model where the objective is a sum of a large number of possibly nonconvex functions, though overall the objective is assumed to be smooth and convex. Our bid to solving such model…

Optimization and Control · Mathematics 2022-03-15 Xi Chen , Bo Jiang , Tianyi Lin , Shuzhong Zhang

A Stochastic Trust Region Method for Non-convex Minimization

We target the problem of finding a local minimum in non-convex finite-sum minimization. Towards this goal, we first prove that the trust region method with inexact gradient and Hessian estimation can achieve a convergence rate of order…

Optimization and Control · Mathematics 2019-03-06 Zebang Shen , Pan Zhou , Cong Fang , Alejandro Ribeiro

Sub-sampled Cubic Regularization for Non-convex Optimization

We consider the minimization of non-convex functions that typically arise in machine learning. Specifically, we focus our attention on a variant of trust region methods known as cubic regularization. This approach is particularly attractive…

Machine Learning · Computer Science 2017-07-04 Jonas Moritz Kohler , Aurelien Lucchi

Stochastic Variance Reduction for Nonconvex Optimization

We study nonconvex finite-sum problems and analyze stochastic variance reduced gradient (SVRG) methods for them. SVRG and related methods have recently surged into prominence for convex optimization given their edge over stochastic gradient…

Optimization and Control · Mathematics 2016-04-06 Sashank J. Reddi , Ahmed Hefny , Suvrit Sra , Barnabas Poczos , Alex Smola

Improved SVRG for quadratic functions

We analyse an iterative algorithm to minimize quadratic functions whose Hessian matrix $H$ is the expectation of a random symmetric $d\times d$ matrix. The algorithm is a variant of the stochastic variance reduced gradient (SVRG). In…

Machine Learning · Computer Science 2021-06-16 Nabil Kahale

Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function

Trust-region (TR) and adaptive regularization using cubics (ARC) have proven to have some very appealing theoretical properties for non-convex optimization by concurrently computing function value, gradient, and Hessian matrix to obtain the…

Machine Learning · Computer Science 2023-10-19 Liu Liu , Xuanqing Liu , Cho-Jui Hsieh , Dacheng Tao

Stochastic Nested Variance Reduction for Nonconvex Optimization

We study finite-sum nonconvex optimization problems, where the objective function is an average of $n$ nonconvex functions. We propose a new stochastic gradient descent algorithm based on nested variance reduction. Compared with…

Machine Learning · Computer Science 2020-10-20 Dongruo Zhou , Pan Xu , Quanquan Gu

Finding Local Minima via Stochastic Nested Variance Reduction

We propose two algorithms that can find local minima faster than the state-of-the-art algorithms in both finite-sum and general stochastic nonconvex optimization. At the core of the proposed algorithms is $\text{One-epoch-SNVRG}^+$ using…

Machine Learning · Computer Science 2018-06-25 Dongruo Zhou , Pan Xu , Quanquan Gu

Cubic regularization methods with second-order complexity guarantee based on a new subproblem reformulation

The cubic regularization (CR) algorithm has attracted a lot of attentions in the literature in recent years. We propose a new reformulation of the cubic regularization subproblem. The reformulation is an unconstrained convex problem that…

Optimization and Control · Mathematics 2021-12-20 Rujun Jiang , Zhishuo Zhou , Zirui Zhou

Stochastic Analysis of an Adaptive Cubic Regularisation Method under Inexact Gradient Evaluations and Dynamic Hessian Accuracy

We here adapt an extended version of the adaptive cubic regularisation method with dynamic inexact Hessian information for nonconvex optimisation in [3] to the stochastic optimisation setting. While exact function evaluations are still…

Numerical Analysis · Mathematics 2020-09-15 Stefania Bellavia , Gianmarco Gurioli

Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information

We consider variants of trust-region and cubic regularization methods for non-convex optimization, in which the Hessian matrix is approximated. Under mild conditions on the inexact Hessian, and using approximate solution of the…

Optimization and Control · Mathematics 2019-05-15 Peng Xu , Fred Roosta , Michael W. Mahoney

Stochastic Second-order Methods for Non-convex Optimization with Inexact Hessian and Gradient

Trust region and cubic regularization methods have demonstrated good performance in small scale non-convex optimization, showing the ability to escape from saddle points. Each iteration of these methods involves computation of gradient,…

Optimization and Control · Mathematics 2018-09-27 Liu Liu , Xuanqing Liu , Cho-Jui Hsieh , Dacheng Tao

Stochastic Variance-Reduced Newton: Accelerating Finite-Sum Minimization with Large Batches

Stochastic variance reduction has proven effective at accelerating first-order algorithms for solving convex finite-sum optimization tasks such as empirical risk minimization. Incorporating second-order information has proven helpful in…

Optimization and Control · Mathematics 2025-04-30 Michał Dereziński

Adaptive Cubic Regularization Methods with Dynamic Inexact Hessian Information and Applications to Finite-Sum Minimization

We consider the Adaptive Regularization with Cubics approach for solving nonconvex optimization problems and propose a new variant based on inexact Hessian information chosen dynamically. The theoretical analysis of the proposed procedure…

Optimization and Control · Mathematics 2019-12-04 Stefania Bellavia , Gianmarco Gurioli , Benedetta Morini

Cubic regularized subspace Newton for non-convex optimization

This paper addresses the optimization problem of minimizing non-convex continuous functions, which is relevant in the context of high-dimensional machine learning applications characterized by over-parametrization. We analyze a randomized…

Machine Learning · Computer Science 2025-02-28 Jim Zhao , Aurelien Lucchi , Nikita Doikov