Related papers: Explicit Second-Order Min-Max Optimization: Practi…

Second-Order Min-Max Optimization with Lazy Hessians

This paper studies second-order methods for convex-concave minimax optimization. Monteiro and Svaiter (2012) proposed a method to solve the problem with an optimal iteration complexity of $\mathcal{O}(\epsilon^{-3/2})$ to find an…

Optimization and Control · Mathematics 2025-04-16 Lesi Chen , Chengchang Liu , Jingzhao Zhang

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

A central challenge to many fields of science and engineering involves minimizing non-convex error functions over continuous, high dimensional spaces. Gradient descent or quasi-Newton methods are almost ubiquitously used to perform such…

Machine Learning · Computer Science 2014-06-11 Yann Dauphin , Razvan Pascanu , Caglar Gulcehre , Kyunghyun Cho , Surya Ganguli , Yoshua Bengio

Second-Order Methods with Cubic Regularization Under Inexact Information

In this paper, we generalize (accelerated) Newton's method with cubic regularization under inexact second-order information for (strongly) convex optimization problems. Under mild assumptions, we provide global rate of convergence of these…

Optimization and Control · Mathematics 2017-10-17 Saeed Ghadimi , Han Liu , Tong Zhang

A Generic Approach for Escaping Saddle points

A central challenge to using first-order methods for optimizing nonconvex problems is the presence of saddle points. First-order methods often get stuck at saddle points, greatly deteriorating their performance. Typically, to escape from…

Machine Learning · Computer Science 2017-09-06 Sashank J Reddi , Manzil Zaheer , Suvrit Sra , Barnabas Poczos , Francis Bach , Ruslan Salakhutdinov , Alexander J Smola

Primal-Dual Sequential Subspace Optimization for Saddle-point Problems

We introduce a new sequential subspace optimization method for large-scale saddle-point problems. It solves iteratively a sequence of auxiliary saddle-point problems in low-dimensional subspaces, spanned by directions derived from…

Optimization and Control · Mathematics 2020-08-24 Yoni Choukroun , Michael Zibulevsky , Pavel Kisilev

On Solving Minimization and Min-Max Problems by First-Order Methods with Relative Error in Gradients

First-order methods for minimization and saddle point (min-max) problems are widely used for solving large-scale problems, in particular arising in machine learning. The majority of works obtain favorable complexity guarantees of such…

Optimization and Control · Mathematics 2025-12-10 Artem Vasin , Valery Krivchenko , Dmitry Kovalev , Fedyor Stonyakin , Nazarii Tupitsa , Pavel Dvurechensky , Mohammad Alkousa , Nikita Kornilov , Alexander Gasnikov

Inexact Successive Quadratic Approximation for Regularized Optimization

Successive quadratic approximations, or second-order proximal methods, are useful for minimizing functions that are a sum of a smooth part and a convex, possibly nonsmooth part that promotes regularization. Most analyses of iteration…

Optimization and Control · Mathematics 2019-01-25 Ching-pei Lee , Stephen J. Wright

Zeroth-Order Algorithms for Smooth Saddle-Point Problems

Saddle-point problems have recently gained increased attention from the machine learning community, mainly due to applications in training Generative Adversarial Networks using stochastic gradients. At the same time, in some applications…

Optimization and Control · Mathematics 2021-09-07 Abdurakhmon Sadiev , Aleksandr Beznosikov , Pavel Dvurechensky , Alexander Gasnikov

Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

This work proposes a universal and adaptive second-order method for minimizing second-order smooth, convex functions. Our algorithm achieves $O(\sigma / \sqrt{T})$ convergence when the oracle feedback is stochastic with variance $\sigma^2$,…

Optimization and Control · Mathematics 2022-12-13 Kimon Antonakopoulos , Ali Kavis , Volkan Cevher

Faster Riemannian Newton-type Optimization by Subsampling and Cubic Regularization

This work is on constrained large-scale non-convex optimization where the constraint set implies a manifold structure. Solving such problems is important in a multitude of fundamental machine learning tasks. Recent advances on Riemannian…

Machine Learning · Computer Science 2023-02-23 Yian Deng , Tingting Mu

On the saddle point problem for non-convex optimization

A central challenge to many fields of science and engineering involves minimizing non-convex error functions over continuous, high dimensional spaces. Gradient descent or quasi-Newton methods are almost ubiquitously used to perform such…

Machine Learning · Computer Science 2014-05-29 Razvan Pascanu , Yann N. Dauphin , Surya Ganguli , Yoshua Bengio

Escaping saddle points in zeroth-order optimization: the power of two-point estimators

Two-point zeroth order methods are important in many applications of zeroth-order optimization, such as robotics, wind farms, power systems, online optimization, and adversarial robustness to black-box attacks in deep neural networks, where…

Optimization and Control · Mathematics 2023-05-10 Zhaolin Ren , Yujie Tang , Na Li

An Approach for Non-Convex Uniformly Concave Structured Saddle Point Problem

Recently, saddle point problems have received much attention due to their powerful modeling capability for a lot of problems from diverse domains. Applications of these problems occur in many applied areas, such as robust optimization,…

Optimization and Control · Mathematics 2022-02-15 Mohammad Alkousa , Alexander Gasnikov , Pavel Dvurechensky , Abdurakhmon Sadiev , Lama Razouk

Beyond first-order methods for non-convex non-concave min-max optimization

We propose a study of structured non-convex non-concave min-max problems which goes beyond standard first-order approaches. Inspired by the tight understanding established in recent works [Adil et al., 2022, Lin and Jordan, 2022b], we…

Optimization and Control · Mathematics 2023-04-18 Abhijeet Vyas , Brian Bullins

Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study

While first-order optimization methods such as stochastic gradient descent (SGD) are popular in machine learning (ML), they come with well-known deficiencies, including relatively-slow convergence, sensitivity to the settings of…

Optimization and Control · Mathematics 2018-02-19 Peng Xu , Farbod Roosta-Khorasani , Michael W. Mahoney

Solving Convex-Concave Problems with $\tilde{\mathcal{O}}(\epsilon^{-4/7})$ Second-Order Oracle Complexity

Previous algorithms can solve convex-concave minimax problems $\min_{x \in \mathcal{X}} \max_{y \in \mathcal{Y}} f(x,y)$ with $\mathcal{O}(\epsilon^{-2/3})$ second-order oracle calls using Newton-type methods. This result has been…

Optimization and Control · Mathematics 2025-06-11 Lesi Chen , Chengchang Liu , Luo Luo , Jingzhao Zhang

A Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle Points

Machine learning problems such as neural network training, tensor decomposition, and matrix factorization, require local minimization of a nonconvex function. This local minimization is challenged by the presence of saddle points, of which…

Optimization and Control · Mathematics 2018-07-23 Santiago Paternain , Aryan Mokhtari , Alejandro Ribeiro

Second-Order Optimization via Quiescence

Second-order optimization methods exhibit fast convergence to critical points, however, in nonconvex optimization, these methods often require restrictive step-sizes to ensure a monotonically decreasing objective function. In the presence…

Optimization and Control · Mathematics 2024-10-11 Aayushya Agarwal , Larry Pileggi , Ronald Rohrer

Cubic regularized subspace Newton for non-convex optimization

This paper addresses the optimization problem of minimizing non-convex continuous functions, which is relevant in the context of high-dimensional machine learning applications characterized by over-parametrization. We analyze a randomized…

Machine Learning · Computer Science 2025-02-28 Jim Zhao , Aurelien Lucchi , Nikita Doikov

Yet another fast variant of Newton's method for nonconvex optimization

A class of second-order algorithms is proposed for minimizing smooth nonconvex functions that alternates between regularized Newton and negative curvature steps in an iteration-dependent subspace. In most cases, the Hessian matrix is…

Optimization and Control · Mathematics 2023-08-22 Serge Gratton , Sadok Jerad , Philippe L. Toint