Related papers: Using second-order information in gradient samplin…

Gradient-Normalized Smoothness for Optimization with Approximate Hessians

In this work, we develop new optimization algorithms that use approximate second-order information combined with the gradient regularization technique to achieve fast global convergence rates for both convex and non-convex objectives. The…

Optimization and Control · Mathematics 2025-06-17 Andrei Semenov , Martin Jaggi , Nikita Doikov

Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

We design an algorithm which finds an $\epsilon$-approximate stationary point (with $\|\nabla F(x)\|\le \epsilon$) using $O(\epsilon^{-3})$ stochastic gradient and Hessian-vector products, matching guarantees that were previously available…

Machine Learning · Computer Science 2020-06-25 Yossi Arjevani , Yair Carmon , John C. Duchi , Dylan J. Foster , Ayush Sekhari , Karthik Sridharan

Gradient sampling algorithm for subsmooth functions

This paper considers non-smooth optimization problems where we seek to minimize the pointwise maximum of a continuously parameterized family of functions. Since the objective function is given as the solution to a maximization problem,…

Optimization and Control · Mathematics 2026-01-12 Dimitris Boskos , Jorge Cortés , Sonia Martínez

Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

This work proposes a universal and adaptive second-order method for minimizing second-order smooth, convex functions. Our algorithm achieves $O(\sigma / \sqrt{T})$ convergence when the oracle feedback is stochastic with variance $\sigma^2$,…

Optimization and Control · Mathematics 2022-12-13 Kimon Antonakopoulos , Ali Kavis , Volkan Cevher

Zeroth-Order Non-smooth Non-convex Optimization via Gaussian Smoothing

This paper addresses stochastic optimization of Lipschitz-continuous, nonsmooth and nonconvex objectives over compact convex sets, where only noisy function evaluations are available. While gradient-free methods have been developed for…

Optimization and Control · Mathematics 2025-08-26 Anik Kumar Paul , Shalabh Bhatnagar

A Stochastic Semismooth Newton Method for Nonsmooth Nonconvex Optimization

In this work, we present a globalized stochastic semismooth Newton method for solving stochastic optimization problems involving smooth nonconvex and nonsmooth convex terms in the objective function. We assume that only noisy gradient and…

Optimization and Control · Mathematics 2018-03-12 Andre Milzarek , Xiantao Xiao , Shicong Cen , Zaiwen Wen , Michael Ulbrich

A non-monotone trust-region method with noisy oracles and additional sampling

In this work, we introduce a novel stochastic second-order method, within the framework of a non-monotone trust-region approach, for solving the unconstrained, nonlinear, and non-convex optimization problems arising in the training of deep…

Optimization and Control · Mathematics 2024-01-18 Natasa Krejic , Natasa Krklec Jerinkic , Angeles Martinez , Mahsa Yousefi

Second-Order Subdifferential Optimality Conditions in Nonsmooth Optimization

The paper is devoted to deriving novel second-order necessary and sufficient optimality conditions for local minimizers in rather general classes of nonsmooth unconstrained and constrained optimization problems in finite-dimensional spaces.…

Optimization and Control · Mathematics 2025-01-07 Pham Duy Khanh , Vu Vinh Huy Khoa , Boris S. Mordukhovich , Vo Thanh Phat

Second order semi-smooth Proximal Newton methods in Hilbert spaces

We develop a globalized Proximal Newton method for composite and possibly non-convex minimization problems in Hilbert spaces. Additionally, we impose less restrictive assumptions on the composite objective functional considering…

Optimization and Control · Mathematics 2021-11-02 Bastian Pötzl , Anton Schiela , Patrick Jaap

Faster Differentially Private Convex Optimization via Second-Order Methods

Differentially private (stochastic) gradient descent is the workhorse of DP private machine learning in both the convex and non-convex settings. Without privacy constraints, second-order methods, like Newton's method, converge faster than…

Machine Learning · Computer Science 2023-05-23 Arun Ganesh , Mahdi Haghifam , Thomas Steinke , Abhradeep Thakurta

A Stochastic Extra-Step Quasi-Newton Method for Nonsmooth Nonconvex Optimization

In this paper, a novel stochastic extra-step quasi-Newton method is developed to solve a class of nonsmooth nonconvex composite optimization problems. We assume that the gradient of the smooth part of the objective function can only be…

Optimization and Control · Mathematics 2019-10-22 Minghan Yang , Andre Milzarek , Zaiwen Wen , Tong Zhang

A new method based on the bundle idea and gradient sampling technique for minimizing nonsmooth convex functions

In this paper, we combine the positive aspects of the Gradient Sampling (GS) and bundle methods, as the most efficient methods in nonsmooth optimization, to develop a robust method for solving unconstrained nonsmooth convex optimization…

Optimization and Control · Mathematics 2019-11-26 M. Maleknia , M. Shamsi

Explicit Second-Order Min-Max Optimization: Practical Algorithms and Complexity Analysis

We propose and analyze several inexact regularized Newton-type methods for finding a global saddle point of convex-concave unconstrained min-max optimization problems. Compared to first-order methods, our understanding of second-order…

Optimization and Control · Mathematics 2026-05-27 Tianyi Lin , Panayotis Mertikopoulos , Michael I. Jordan

A second order primal-dual method for nonsmooth convex composite optimization

We develop a second order primal-dual method for optimization problems in which the objective function is given by the sum of a strongly convex twice differentiable term and a possibly nondifferentiable convex regularizer. After introducing…

Optimization and Control · Mathematics 2020-08-31 Neil K. Dhingra , Sei Zhen Khong , Mihailo R. Jovanović

Low-Order Explicit Hessian Imitation Method for Large-Scale Supervised Machine Learning

An algorithm is proposed for solving optimization problems arising in neural network training for supervised learning. The unique feature of the algorithm is the use of an auxiliary loss, in addition to the original loss employed for model…

Optimization and Control · Mathematics 2026-05-11 Yunlang Zhu , Lingjun Guo , Zahra Khatti , Xiaoyi Qu , Chia-Yuan Wu , Lara Zebiane , Frank E. Curtis

A Chain Rule for Strict Twice Epi-Differentiability and its Applications

The presence of second-order smoothness for objective functions of optimization problems can provide valuable information about their stability properties and help us design efficient numerical algorithms for solving these problems. Such…

Optimization and Control · Mathematics 2023-08-04 N. T. V. Hang , M. E. Sarabi

Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

We present new algorithms for optimizing non-smooth, non-convex stochastic objectives based on a novel analysis technique. This improves the current best-known complexity for finding a $(\delta,\epsilon)$-stationary point from…

Machine Learning · Computer Science 2025-08-08 Ashok Cutkosky , Harsh Mehta , Francesco Orabona

Zeroth-Order Nonconvex Nonsmooth Optimization with Heavy-Tailed Noise

This paper considers the nonconvex nonsmooth problem in which the objective function is Lipschitz continuous. We focus on the stochastic setting where the algorithm can access stochastic function value evaluations with heavy-tailed noise,…

Machine Learning · Computer Science 2026-05-26 Zhuanghua Liu , Luo Luo

The bilinear Hessian for large scale optimization

Second order information is useful in many ways in smooth optimization problems, including for the design of step size rules and descent directions, or the analysis of the local properties of the objective functional. However, the…

Optimization and Control · Mathematics 2025-02-06 Marcus Carlsson , Viktor Nikitin , Erik Troedsson , Herwig Wendt

A Subsampling Line-Search Method with Second-Order Results

In many contemporary optimization problems such as those arising in machine learning, it can be computationally challenging or even infeasible to evaluate an entire function or its derivatives. This motivates the use of stochastic…

Optimization and Control · Mathematics 2021-07-01 El-houcine Bergou , Youssef Diouane , Vladimir Kunc , Vyacheslav Kungurtsev , Clément W. Royer