Related papers: Stochastic Steffensen method

General convergence analysis of stochastic first order methods for composite optimization

In this paper we consider stochastic composite convex optimization problems with the objective function satisfying a stochastic bounded gradient condition, with or without a quadratic functional growth property. These models include the…

Optimization and Control · Mathematics 2020-03-10 Ion Necoara

Adaptive First- and Second-Order Algorithms for Large-Scale Machine Learning

In this paper, we consider both first- and second-order techniques to address continuous optimization problems arising in machine learning. In the first-order case, we propose a framework of transition from deterministic or…

Machine Learning · Computer Science 2021-11-30 Sanae Lotfi , Tiphaine Bonniot de Ruisselet , Dominique Orban , Andrea Lodi

Stochastic Newton and Cubic Newton Methods with Simple Local Linear-Quadratic Rates

We present two new remarkably simple stochastic second-order methods for minimizing the average of a very large number of sufficiently smooth and strongly convex functions. The first is a stochastic variant of Newton's method (SN), and the…

Machine Learning · Computer Science 2019-12-04 Dmitry Kovalev , Konstantin Mishchenko , Peter Richtárik

Step size adaptation in first-order method for stochastic strongly convex programming

We propose a first-order method for stochastic strongly convex optimization that attains $O(1/n)$ rate of convergence, analysis show that the proposed method is simple, easily to implement, and in worst case, asymptotically four times…

Optimization and Control · Mathematics 2011-10-14 Peng Cheng

Random directions stochastic approximation with deterministic perturbations

We introduce deterministic perturbation schemes for the recently proposed random directions stochastic approximation (RDSA) [17], and propose new first-order and second-order algorithms. In the latter case, these are the first second-order…

Optimization and Control · Mathematics 2019-03-29 Prashanth L A , Shalabh Bhatnagar , Nirav Bhavsar , Michael Fu , Steven I. Marcus

Lower Bounds for Finding Stationary Points II: First-Order Methods

We establish lower bounds on the complexity of finding $\epsilon$-stationary points of smooth, non-convex high-dimensional functions using first-order methods. We prove that deterministic first-order methods, even applied to arbitrarily…

Optimization and Control · Mathematics 2017-11-03 Yair Carmon , John C. Duchi , Oliver Hinder , Aaron Sidford

Stochastic Analysis of an Adaptive Cubic Regularisation Method under Inexact Gradient Evaluations and Dynamic Hessian Accuracy

We here adapt an extended version of the adaptive cubic regularisation method with dynamic inexact Hessian information for nonconvex optimisation in [3] to the stochastic optimisation setting. While exact function evaluations are still…

Numerical Analysis · Mathematics 2020-09-15 Stefania Bellavia , Gianmarco Gurioli

First-order methods for stochastic and finite-sum convex optimization with deterministic constraints

In this paper, we study a class of stochastic and finite-sum convex optimization problems with deterministic constraints. Existing methods typically aim to find an $\epsilon$-$expectedly\ feasible\ stochastic\ optimal$ solution, in which…

Optimization and Control · Mathematics 2025-06-26 Zhaosong Lu , Yifeng Xiao

Efficiently Escaping Saddle Points under Generalized Smoothness via Self-Bounding Regularity

We study the optimization of non-convex functions that are not necessarily smooth (gradient and/or Hessian are Lipschitz) using first order methods. Smoothness is a restrictive assumption in machine learning in both theory and practice,…

Optimization and Control · Mathematics 2025-06-27 Daniel Yiming Cao , August Y. Chen , Karthik Sridharan , Benjamin Tang

First-order methods almost always avoid saddle points: the case of vanishing step-sizes

In a series of papers \cite{LSJR16, PP17, LPP}, it was established that some of the most commonly used first order methods almost surely (under random initializations) and with step-size being small enough, avoid strict saddle points, as…

Optimization and Control · Mathematics 2025-09-30 Ioannis Panageas , Georgios Piliouras , Xiao Wang

On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems

We introduce a class of first-order methods for smooth constrained optimization that are based on an analogy to non-smooth dynamical systems. Two distinctive features of our approach are that (i) projections or optimizations over the entire…

Optimization and Control · Mathematics 2025-04-15 Michael Muehlebach , Michael I. Jordan

Stochastic Variance-Reduced Newton: Accelerating Finite-Sum Minimization with Large Batches

Stochastic variance reduction has proven effective at accelerating first-order algorithms for solving convex finite-sum optimization tasks such as empirical risk minimization. Incorporating second-order information has proven helpful in…

Optimization and Control · Mathematics 2025-04-30 Michał Dereziński

Layer-wise Adaptive Step-Sizes for Stochastic First-Order Methods for Deep Learning

We propose a new per-layer adaptive step-size procedure for stochastic first-order optimization methods for minimizing empirical loss functions in deep learning, eliminating the need for the user to tune the learning rate (LR). The proposed…

Machine Learning · Computer Science 2023-07-07 Achraf Bahamou , Donald Goldfarb

The Speed-Robustness Trade-Off for First-Order Methods with Additive Gradient Noise

We study the trade-off between convergence rate and sensitivity to stochastic additive gradient noise for first-order optimization methods. Ordinary Gradient Descent (GD) can be made fast-and-sensitive or slow-and-robust by increasing or…

Optimization and Control · Mathematics 2025-11-07 Bryan Van Scoy , Laurent Lessard

Sequential Quadratic Optimization for Solving Expectation Equality Constrained Stochastic Optimization Problems

A sequential quadratic programming method is designed for solving general smooth nonlinear stochastic optimization problems subject to expectation equality constraints. We consider the setting where the objective and constraint function…

Optimization and Control · Mathematics 2026-03-17 Haoming Shen , Yang Zeng , Baoyu Zhou

Bridging Constraints and Stochasticity: A Fully First-Order Method for Stochastic Bilevel Optimization with Linear Constraints

This work provides the first finite-time convergence guarantees for linearly constrained stochastic bilevel optimization using only first-order methods, requiring solely gradient information without any Hessian computations or second-order…

Optimization and Control · Mathematics 2025-11-18 Cac Phan , Kai Wang

Faster Differentially Private Convex Optimization via Second-Order Methods

Differentially private (stochastic) gradient descent is the workhorse of DP private machine learning in both the convex and non-convex settings. Without privacy constraints, second-order methods, like Newton's method, converge faster than…

Machine Learning · Computer Science 2023-05-23 Arun Ganesh , Mahdi Haghifam , Thomas Steinke , Abhradeep Thakurta

Adaptive Multilevel Newton: A Quadratically Convergent Optimization Method

Newton's method may exhibit slower convergence than vanilla Gradient Descent in its initial phase on strongly convex problems. Classical Newton-type multilevel methods mitigate this but, like Gradient Descent, achieve only linear…

Optimization and Control · Mathematics 2026-02-25 Nick Tsipinakis , Panos Parpas , Matthias Voigt

Adaptive Multilevel Newton: A Quadratically Convergent Optimization Method

Newton's method may exhibit slower convergence than vanilla Gradient Descent in its initial phase on strongly convex problems. Classical Newton-type multilevel methods mitigate this but, like Gradient Descent, achieve only linear…

Optimization and Control · Mathematics 2026-03-05 Nick Tsipinakis , Panagiotis Tigkas , Panos Parpas

Stochastic first-order methods: non-asymptotic and computer-aided analyses via potential functions

We provide a novel computer-assisted technique for systematically analyzing first-order methods for optimization. In contrast with previous works, the approach is particularly suited for handling sublinear convergence rates and stochastic…

Optimization and Control · Mathematics 2021-12-22 Adrien Taylor , Francis Bach