English
Related papers

Related papers: Limits on Gradient Compression for Stochastic Opti…

200 papers

In this paper we provide oracle complexity lower bounds for finding a point in a given set using a memory-constrained algorithm that has access to a separation oracle. We assume that the set is contained within the unit $d$-dimensional ball…

Optimization and Control · Mathematics 2024-04-11 Moise Blanchard

We lower bound the complexity of finding $\epsilon$-stationary points (with gradient norm at most $\epsilon$) using stochastic first-order methods. In a well-studied model where algorithms access smooth, potentially non-convex functions…

Optimization and Control · Mathematics 2022-03-01 Yossi Arjevani , Yair Carmon , John C. Duchi , Dylan J. Foster , Nathan Srebro , Blake Woodworth

Sorting has a natural generalization where the input consists of: (1) a ground set $X$ of size $n$, (2) a partial oracle $O_P$ specifying some fixed partial order $P$ on $X$ and (3) a linear oracle $O_L$ specifying a linear order $L$ that…

Data Structures and Algorithms · Computer Science 2024-08-01 Ivor van der Hoog , Daniel Rutschmann

ADAGB2, a generalization of the Adagrad algorithm for stochastic optimization is introduced, which is also applicable to bound-constrained problems and capable of using second-order information when available. It is shown that, given…

Optimization and Control · Mathematics 2025-05-13 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

We study the running time, in terms of first order oracle queries, of differentially private empirical/population risk minimization of Lipschitz convex losses. We first consider the setting where the loss is non-smooth and the optimizer…

Machine Learning · Computer Science 2025-11-19 Michael Menart , Aleksandar Nikolov

We show that high-accuracy guarantees for log-concave sampling -- that is, iteration and query complexities which scale as $\mathrm{poly}\log(1/\delta)$, where $\delta$ is the desired target accuracy -- are achievable using stochastic…

Statistics Theory · Mathematics 2026-05-18 Fan Chen , Sinho Chewi , Constantinos Daskalakis , Alexander Rakhlin

We provide a first-order oracle complexity lower bound for finding stationary points of min-max optimization problems where the objective function is smooth, nonconvex in the minimization variable, and strongly concave in the maximization…

Optimization and Control · Mathematics 2021-04-20 Haochuan Li , Yi Tian , Jingzhao Zhang , Ali Jadbabaie

We consider the optimization of a quadratic objective function whose gradients are only accessible through a stochastic oracle that returns the gradient at any given point plus a zero-mean finite variance random error. We present the first…

Optimization and Control · Mathematics 2016-02-25 Aymeric Dieuleveut , Nicolas Flammarion , Francis Bach

Although upper bound guarantees for bilevel optimization have been widely studied, progress on lower bounds has been limited due to the complexity of the bilevel structure. In this work, we focus on the smooth nonconvex-strongly-convex…

Machine Learning · Computer Science 2025-11-27 Kaiyi Ji

Algorithmic reproducibility measures the deviation in outputs of machine learning algorithms upon minor changes in the training process. Previous work suggests that first-order methods would need to trade-off convergence rate (gradient…

Machine Learning · Computer Science 2024-01-11 Liang Zhang , Junchi Yang , Amin Karbasi , Niao He

Consider an oracle which takes a point $x$ and returns the minimizer of a convex function $f$ in an $\ell_2$ ball of radius $r$ around $x$. It is straightforward to show that roughly $r^{-1}\log\frac{1}{\epsilon}$ calls to the oracle…

Optimization and Control · Mathematics 2020-03-19 Yair Carmon , Arun Jambulapati , Qijia Jiang , Yujia Jin , Yin Tat Lee , Aaron Sidford , Kevin Tian

In this paper, we study the fundamental open question of finding the optimal high-order algorithm for solving smooth convex minimization problems. Arjevani et al. (2019) established the lower bound $\Omega\left(\epsilon^{-2/(3p+1)}\right)$…

Optimization and Control · Mathematics 2022-05-20 Dmitry Kovalev , Alexander Gasnikov

We present two first-order, sequential optimization algorithms to solve constrained optimization problems. We consider a black-box setting with a priori unknown, non-convex objective and constraint functions that have Lipschitz continuous…

Optimization and Control · Mathematics 2020-11-19 Abraham P. Vinod , Arie Israel , Ufuk Topcu

We investigate the computational complexity of several basic linear algebra primitives, including largest eigenvector computation and linear regression, in the computational model that allows access to the data via a matrix-vector product…

Machine Learning · Computer Science 2021-05-25 Mark Braverman , Elad Hazan , Max Simchowitz , Blake Woodworth

In this paper we analyze the necessary number of samples to estimate the gradient of any multidimensional smooth (possibly non-convex) function in a zero-order stochastic oracle model. In this model, an estimator has access to noisy values…

Machine Learning · Computer Science 2021-07-07 Abdulrahman Alabdulkareem , Jean Honorio

We design an algorithm which finds an $\epsilon$-approximate stationary point (with $\|\nabla F(x)\|\le \epsilon$) using $O(\epsilon^{-3})$ stochastic gradient and Hessian-vector products, matching guarantees that were previously available…

Machine Learning · Computer Science 2020-06-25 Yossi Arjevani , Yair Carmon , John C. Duchi , Dylan J. Foster , Ayush Sekhari , Karthik Sridharan

We study the complexity of optimizing highly smooth convex functions. For a positive integer $p$, we want to find an $\epsilon$-approximate minimum of a convex function $f$, given oracle access to the function and its first $p$ derivatives,…

Optimization and Control · Mathematics 2021-12-06 Ankit Garg , Robin Kothari , Praneeth Netrapalli , Suhail Sherif

This paper deals with the black-box optimization problem. In this setup, we do not have access to the gradient of the objective function, therefore, we need to estimate it somehow. We propose a new type of approximation JAGUAR, that…

Optimization and Control · Mathematics 2024-12-03 Andrey Veprikov , Aleksandr Bogdanov , Vladislav Minashkin , Aleksandr Beznosikov

We consider the problem of sampling from a strongly log-concave density in $\mathbb{R}^d$, and prove an information theoretic lower bound on the number of stochastic gradient queries of the log density needed. Several popular sampling…

Machine Learning · Statistics 2021-07-06 Niladri S. Chatterji , Peter L. Bartlett , Philip M. Long

We present a new accelerated stochastic second-order method that is robust to both gradient and Hessian inexactness, which occurs typically in machine learning. We establish theoretical lower bounds and prove that our algorithm achieves…

Optimization and Control · Mathematics 2024-05-28 Artem Agafonov , Dmitry Kamzolov , Alexander Gasnikov , Ali Kavis , Kimon Antonakopoulos , Volkan Cevher , Martin Takáč
‹ Prev 1 2 3 10 Next ›