Related papers: Limits on Gradient Compression for Stochastic Opti…

Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility Problems

In this paper we provide oracle complexity lower bounds for finding a point in a given set using a memory-constrained algorithm that has access to a separation oracle. We assume that the set is contained within the unit $d$-dimensional ball…

Optimization and Control · Mathematics 2024-04-11 Moise Blanchard

Lower Bounds for Non-Convex Stochastic Optimization

We lower bound the complexity of finding $\epsilon$-stationary points (with gradient norm at most $\epsilon$) using stochastic first-order methods. In a well-studied model where algorithms access smooth, potentially non-convex functions…

Optimization and Control · Mathematics 2022-03-01 Yossi Arjevani , Yair Carmon , John C. Duchi , Dylan J. Foster , Nathan Srebro , Blake Woodworth

Tight Bounds for Sorting Under Partial Information

Sorting has a natural generalization where the input consists of: (1) a ground set $X$ of size $n$, (2) a partial oracle $O_P$ specifying some fixed partial order $P$ on $X$ and (3) a linear oracle $O_L$ specifying a linear order $L$ that…

Data Structures and Algorithms · Computer Science 2024-08-01 Ivor van der Hoog , Daniel Rutschmann

Fast Stochastic Second-Order Adagrad for Nonconvex Bound-Constrained Optimization

ADAGB2, a generalization of the Adagrad algorithm for stochastic optimization is introduced, which is also applicable to bound-constrained problems and capable of using second-order information when available. It is shown that, given…

Optimization and Control · Mathematics 2025-05-13 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

On the Gradient Complexity of Private Optimization with Private Oracles

We study the running time, in terms of first order oracle queries, of differentially private empirical/population risk minimization of Lipschitz convex losses. We first consider the setting where the loss is non-smooth and the optimizer…

Machine Learning · Computer Science 2025-11-19 Michael Menart , Aleksandar Nikolov

High-accuracy log-concave sampling with stochastic queries

We show that high-accuracy guarantees for log-concave sampling -- that is, iteration and query complexities which scale as $\mathrm{poly}\log(1/\delta)$, where $\delta$ is the desired target accuracy -- are achievable using stochastic…

Statistics Theory · Mathematics 2026-05-18 Fan Chen , Sinho Chewi , Constantinos Daskalakis , Alexander Rakhlin

Complexity Lower Bounds for Nonconvex-Strongly-Concave Min-Max Optimization

We provide a first-order oracle complexity lower bound for finding stationary points of min-max optimization problems where the objective function is smooth, nonconvex in the minimization variable, and strongly concave in the maximization…

Optimization and Control · Mathematics 2021-04-20 Haochuan Li , Yi Tian , Jingzhao Zhang , Ali Jadbabaie

Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

We consider the optimization of a quadratic objective function whose gradients are only accessible through a stochastic oracle that returns the gradient at any given point plus a zero-mean finite variance random error. We present the first…

Optimization and Control · Mathematics 2016-02-25 Aymeric Dieuleveut , Nicolas Flammarion , Francis Bach

Lower Complexity Bounds for Nonconvex-Strongly-Convex Bilevel Optimization with First-Order Oracles

Although upper bound guarantees for bilevel optimization have been widely studied, progress on lower bounds has been limited due to the complexity of the bilevel structure. In this work, we focus on the smooth nonconvex-strongly-convex…

Machine Learning · Computer Science 2025-11-27 Kaiyi Ji

Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization

Algorithmic reproducibility measures the deviation in outputs of machine learning algorithms upon minor changes in the training process. Previous work suggests that first-order methods would need to trade-off convergence rate (gradient…

Machine Learning · Computer Science 2024-01-11 Liang Zhang , Junchi Yang , Amin Karbasi , Niao He

Acceleration with a Ball Optimization Oracle

Consider an oracle which takes a point $x$ and returns the minimizer of a convex function $f$ in an $\ell_2$ ball of radius $r$ around $x$. It is straightforward to show that roughly $r^{-1}\log\frac{1}{\epsilon}$ calls to the oracle…

Optimization and Control · Mathematics 2020-03-19 Yair Carmon , Arun Jambulapati , Qijia Jiang , Yujia Jin , Yin Tat Lee , Aaron Sidford , Kevin Tian

The First Optimal Acceleration of High-Order Methods in Smooth Convex Optimization

In this paper, we study the fundamental open question of finding the optimal high-order algorithm for solving smooth convex minimization problems. Arjevani et al. (2019) established the lower bound $\Omega\left(\epsilon^{-2/(3p+1)}\right)$…

Optimization and Control · Mathematics 2022-05-20 Dmitry Kovalev , Alexander Gasnikov

Constrained, Global Optimization of Functions with Lipschitz Continuous Gradients

We present two first-order, sequential optimization algorithms to solve constrained optimization problems. We consider a black-box setting with a priori unknown, non-convex objective and constraint functions that have Lipschitz continuous…

Optimization and Control · Mathematics 2020-11-19 Abraham P. Vinod , Arie Israel , Ufuk Topcu

The gradient complexity of linear regression

We investigate the computational complexity of several basic linear algebra primitives, including largest eigenvector computation and linear regression, in the computational model that allows access to the data via a matrix-vector product…

Machine Learning · Computer Science 2021-05-25 Mark Braverman , Elad Hazan , Max Simchowitz , Blake Woodworth

Information-Theoretic Lower Bounds for Zero-Order Stochastic Gradient Estimation

In this paper we analyze the necessary number of samples to estimate the gradient of any multidimensional smooth (possibly non-convex) function in a zero-order stochastic oracle model. In this model, an estimator has access to noisy values…

Machine Learning · Computer Science 2021-07-07 Abdulrahman Alabdulkareem , Jean Honorio

Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

We design an algorithm which finds an $\epsilon$-approximate stationary point (with $\|\nabla F(x)\|\le \epsilon$) using $O(\epsilon^{-3})$ stochastic gradient and Hessian-vector products, matching guarantees that were previously available…

Machine Learning · Computer Science 2020-06-25 Yossi Arjevani , Yair Carmon , John C. Duchi , Dylan J. Foster , Ayush Sekhari , Karthik Sridharan

Near-Optimal Lower Bounds For Convex Optimization For All Orders of Smoothness

We study the complexity of optimizing highly smooth convex functions. For a positive integer $p$, we want to find an $\epsilon$-approximate minimum of a convex function $f$, given oracle access to the function and its first $p$ derivatives,…

Optimization and Control · Mathematics 2021-12-06 Ankit Garg , Robin Kothari , Praneeth Netrapalli , Suhail Sherif

New Aspects of Black Box Conditional Gradient: Variance Reduction and One Point Feedback

This paper deals with the black-box optimization problem. In this setup, we do not have access to the gradient of the objective function, therefore, we need to estimate it somehow. We propose a new type of approximation JAGUAR, that…

Optimization and Control · Mathematics 2024-12-03 Andrey Veprikov , Aleksandr Bogdanov , Vladislav Minashkin , Aleksandr Beznosikov

Oracle Lower Bounds for Stochastic Gradient Sampling Algorithms

We consider the problem of sampling from a strongly log-concave density in $\mathbb{R}^d$, and prove an information theoretic lower bound on the number of stochastic gradient queries of the log density needed. Several popular sampling…

Machine Learning · Statistics 2021-07-06 Niladri S. Chatterji , Peter L. Bartlett , Philip M. Long

Advancing the lower bounds: An accelerated, stochastic, second-order method with optimal adaptation to inexactness

We present a new accelerated stochastic second-order method that is robust to both gradient and Hessian inexactness, which occurs typically in machine learning. We establish theoretical lower bounds and prove that our algorithm achieves…

Optimization and Control · Mathematics 2024-05-28 Artem Agafonov , Dmitry Kamzolov , Alexander Gasnikov , Ali Kavis , Kimon Antonakopoulos , Volkan Cevher , Martin Takáč