Related papers: OFFO minimization algorithms for second-order opti…

Complexity of a Class of First-Order Objective-Function-Free Optimization Algorithms

A parametric class of trust-region algorithms for unconstrained nonconvex optimization is considered where the value of the objective function is never computed. The class contains a deterministic version of the first-order Adagrad method…

Optimization and Control · Mathematics 2023-06-08 S. Gratton , S. Jerad , Ph. L. Toint

An objective-function-free algorithm for general smooth constrained optimization

A new algorithm for smooth constrained optimization is proposed that never computes the value of the problem's objective function and that handles both equality and inequality constraints. The algorithm uses an adaptive switching strategy…

Optimization and Control · Mathematics 2026-02-13 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

Objective-Function Free Multi-Objective Optimization: Rate of Convergence and Performance of an Adagrad-like algorithm

We propose an Adagrad-like algorithm for multi-objective unconstrained optimization that relies on the computation of a common descent direction only. Unlike classical local algorithms for multi-objective optimization, our approach does not…

Optimization and Control · Mathematics 2026-02-06 Marianna De Santis , Gabriele Eichfelder , Margherita Porcelli

An objective-function-free algorithm for nonconvex stochastic optimization with deterministic equality and inequality constraints

An algorithm is proposed for solving optimization problems with stochastic objective and deterministic equality and inequality constraints. This algorithm is objective-function-free in the sense that it only uses the objective's gradient…

Optimization and Control · Mathematics 2026-04-01 S. Gratton , Ph. L. Toint

Recursive Bound-Constrained AdaGrad with Applications to Multilevel and Domain Decomposition Minimization

Two OFFO (Objective-Function Free Optimization) noise tolerant algorithms are presented that handle bound constraints, inexact gradients and use second-order information when available.The first is a multi-level method exploiting a…

Optimization and Control · Mathematics 2025-07-16 Serge Gratton , Alena Kopaničáková , Philippe Toint

Complexity of Adagrad and other first-order methods for nonconvex optimization problems with bounds constraints

A parametric class of trust-region algorithms for constrained nonconvex optimization is analyzed, where the objective function is never computed. By defining appropriate first-order stationarity criteria, we are able to extend the Adagrad…

Optimization and Control · Mathematics 2024-11-04 Serge Gratton , Sadok Jerad , Philippe L. Toint

Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling

We consider the problem of minimizing a high-dimensional objective function, which may include a regularization term, using (possibly noisy) evaluations of the function. Such optimization is also called derivative-free, zeroth-order, or…

Optimization and Control · Mathematics 2023-03-20 HanQin Cai , Daniel Mckenzie , Wotao Yin , Zhenliang Zhang

A Simple First-Order Algorithm for Full-Rank Equality Constrained Optimization

A very simple first-order algorithm is proposed for solving nonlinear optimization problems with deterministic nonlinear equality constraints. This algorithm adaptively selects steps in the plane tangent to the constraints or steps that…

Optimization and Control · Mathematics 2026-03-11 Serge Gratton , Philippe L. Toint

Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods

This work proposes a universal and adaptive second-order method for minimizing second-order smooth, convex functions. Our algorithm achieves $O(\sigma / \sqrt{T})$ convergence when the oracle feedback is stochastic with variance $\sigma^2$,…

Optimization and Control · Mathematics 2022-12-13 Kimon Antonakopoulos , Ali Kavis , Volkan Cevher

A Stochastic Objective-Function-Free Adaptive Regularization Method with Optimal Complexity

A fully stochastic second-order adaptive-regularization method for unconstrained nonconvex optimization is presented which never computes the objective-function value, but yet achieves the optimal $\mathcal{O}(\epsilon^{-3/2})$ complexity…

Optimization and Control · Mathematics 2025-01-22 Serge Gratton , Sadok Jerad , Philippe L. Toint

Gradientless Descent: High-Dimensional Zeroth-Order Optimization

Zeroth-order optimization is the process of minimizing an objective $f(x)$, given oracle access to evaluations at adaptively chosen inputs $x$. In this paper, we present two simple yet powerful GradientLess Descent (GLD) algorithms that do…

Machine Learning · Computer Science 2020-05-20 Daniel Golovin , John Karro , Greg Kochanski , Chansoo Lee , Xingyou Song , Qiuyi Zhang

An optimally fast objective-function-free minimization algorithm using random subspaces

An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this…

Optimization and Control · Mathematics 2025-01-31 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

Efficiently avoiding saddle points with zero order methods: No gradients required

We consider the case of derivative-free algorithms for non-convex optimization, also known as zero order algorithms, that use only function evaluations rather than gradients. For a wide variety of gradient approximators based on finite…

Optimization and Control · Mathematics 2019-10-30 Lampros Flokas , Emmanouil-Vasileios Vlatakis-Gkaragkounis , Georgios Piliouras

On Graduated Optimization for Stochastic Non-Convex Problems

The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. Despite its popularity, very little is known in terms…

Machine Learning · Computer Science 2015-07-28 Elad Hazan , Kfir Y. Levy , Shai Shalev-Shwartz

Complexity and performance for two classes of noise-tolerant first-order algorithms

Two classes of algorithms for optimization in the presence of noise are presented, that do not require the evaluation of the objective function. The first generalizes the well-known Adagrad method. Its complexity is then analyzed as a…

Optimization and Control · Mathematics 2025-01-30 S. Gratton , S. Jerad , Ph. L. Toint

Scalable Second Order Optimization for Deep Learning

Optimization in machine learning, both theoretical and applied, is presently dominated by first-order gradient methods such as stochastic gradient descent. Second-order optimization methods, that involve second derivatives and/or second…

Machine Learning · Computer Science 2021-03-08 Rohan Anil , Vineet Gupta , Tomer Koren , Kevin Regan , Yoram Singer

Convergence properties of an Objective-Function-Free Optimization regularization algorithm, including an $\mathcal{O}(\epsilon^{-3/2})$ complexity bound

An adaptive regularization algorithm for unconstrained nonconvex optimization is presented in which the objective function is never evaluated, but only derivatives are used. This algorithm belongs to the class of adaptive regularization…

Optimization and Control · Mathematics 2022-05-05 S. Gratton , S. Jerad , Ph. L. Toint

Non-Uniform Smoothness for Gradient Descent

The analysis of gradient descent-type methods typically relies on the Lipschitz continuity of the objective gradient. This generally requires an expensive hyperparameter tuning process to appropriately calibrate a stepsize for a given…

Optimization and Control · Mathematics 2023-11-16 Albert S. Berahas , Lindon Roberts , Fred Roosta

Fast Stochastic Second-Order Adagrad for Nonconvex Bound-Constrained Optimization

ADAGB2, a generalization of the Adagrad algorithm for stochastic optimization is introduced, which is also applicable to bound-constrained problems and capable of using second-order information when available. It is shown that, given…

Optimization and Control · Mathematics 2025-05-13 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

Zeroth-order Asynchronous Doubly Stochastic Algorithm with Variance Reduction

Zeroth-order (derivative-free) optimization attracts a lot of attention in machine learning, because explicit gradient calculations may be computationally expensive or infeasible. To handle large scale problems both in volume and dimension,…

Machine Learning · Computer Science 2016-12-06 Bin Gu , Zhouyuan Huo , Heng Huang