Related papers: Learning Supervised PageRank with Gradient-Based a…

Learning Supervised PageRank with Gradient-Free Optimization Methods

In this paper, we consider a problem of learning supervised PageRank models, which can account for some properties not considered by classical approaches such as the classical PageRank algorithm. Due to huge hidden dimension of the…

Optimization and Control · Mathematics 2015-12-08 Lev Bogolubsky , Pavel Dvurechensky , Alexander Gasnikov , Gleb Gusev , Yurii Nesterov , Andrey Raigorodskii , Aleksey Tikhonov , Maxim Zhukovskii

A randomized algorithm for nonconvex minimization with inexact evaluations and complexity guarantees

We consider minimization of a smooth nonconvex function with inexact oracle access to gradient and Hessian (without assuming access to the function value) to achieve approximate second-order optimality. A novel feature of our method is that…

Optimization and Control · Mathematics 2024-03-27 Shuyao Li , Stephen J. Wright

Gradient-Free Methods for Saddle-Point Problem

In the paper, we generalize the approach Gasnikov et. al, 2017, which allows to solve (stochastic) convex optimization problems with an inexact gradient-free oracle, to the convex-concave saddle-point problem. The proposed approach works,…

Optimization and Control · Mathematics 2022-09-13 Aleksandr Beznosikov , Abdurakhmon Sadiev , Alexander Gasnikov

Optimal Rates for Random Order Online Optimization

We study online convex optimization in the random order model, recently proposed by \citet{garber2020online}, where the loss functions may be chosen by an adversary, but are then presented to the online algorithm in a uniformly random…

Machine Learning · Computer Science 2021-06-30 Uri Sherman , Tomer Koren , Yishay Mansour

Linearly Convergent Algorithm with Variance Reduction for Distributed Stochastic Optimization

This paper considers a distributed stochastic strongly convex optimization, where agents connected over a network aim to cooperatively minimize the average of all agents' local cost functions. Due to the stochasticity of gradient estimation…

Optimization and Control · Mathematics 2020-02-17 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Efficient Dictionary Learning with Gradient Descent

Randomly initialized first-order optimization algorithms are the method of choice for solving many high-dimensional nonconvex problems in machine learning, yet general theoretical guarantees cannot rule out convergence to critical points of…

Optimization and Control · Mathematics 2018-09-28 Dar Gilboa , Sam Buchanan , John Wright

Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition

We analyze stochastic gradient descent for optimizing non-convex functions. In many cases for non-convex functions the goal is to find a reasonable local minimum, and the main concern is that gradient updates are trapped in saddle points.…

Machine Learning · Computer Science 2015-03-10 Rong Ge , Furong Huang , Chi Jin , Yang Yuan

Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n)

We consider the stochastic approximation problem where a convex function has to be minimized, given only the knowledge of unbiased estimates of its gradients at certain points, a framework which includes machine learning methods based on…

Machine Learning · Computer Science 2013-06-11 Francis Bach , Eric Moulines

On the Convergence and Complexity of the Stochastic Central Finite-Difference Based Gradient Estimation Methods

This paper presents an algorithmic framework for solving unconstrained stochastic optimization problems using only stochastic function evaluations. We employ central finite-difference based gradient estimation methods to approximate the…

Optimization and Control · Mathematics 2025-01-14 Raghu Bollapragada , Cem Karamanli

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Efficient Gradient Approximation Method for Constrained Bilevel Optimization

Bilevel optimization has been developed for many machine learning tasks with large-scale and high-dimensional data. This paper considers a constrained bilevel optimization problem, where the lower-level optimization problem is convex with…

Machine Learning · Computer Science 2023-08-22 Siyuan Xu , Minghui Zhu

Stochastic Nested Variance Reduction for Nonconvex Optimization

We study finite-sum nonconvex optimization problems, where the objective function is an average of $n$ nonconvex functions. We propose a new stochastic gradient descent algorithm based on nested variance reduction. Compared with…

Machine Learning · Computer Science 2020-10-20 Dongruo Zhou , Pan Xu , Quanquan Gu

A Convexity-dependent Two-Phase Training Algorithm for Deep Neural Networks

The key task of machine learning is to minimize the loss function that measures the model fit to the training data. The numerical methods to do this efficiently depend on the properties of the loss function. The most decisive among these…

Machine Learning · Computer Science 2025-10-31 Tomas Hrycej , Bernhard Bermeitinger , Massimo Pavone , Götz-Henrik Wiegand , Siegfried Handschuh

Provable Guarantees for Gradient-Based Meta-Learning

We study the problem of meta-learning through the lens of online convex optimization, developing a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods.…

Machine Learning · Computer Science 2019-05-17 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

Stochastic Variance-reduced Gradient Descent for Low-rank Matrix Recovery from Linear Measurements

We study the problem of estimating low-rank matrices from linear measurements (a.k.a., matrix sensing) through nonconvex optimization. We propose an efficient stochastic variance reduced gradient descent algorithm to solve a nonconvex…

Machine Learning · Statistics 2017-01-17 Xiao Zhang , Lingxiao Wang , Quanquan Gu

Variance-Reduced Gradient Estimator for Nonconvex Zeroth-Order Distributed Optimization

This paper investigates distributed zeroth-order optimization for smooth nonconvex problems, targeting the trade-off between convergence rate and sampling cost per zeroth-order gradient estimation in current algorithms that use either the…

Optimization and Control · Mathematics 2026-04-10 Huaiyi Mu , Yujie Tang , Jie Song , Zhongkui Li

Minimax Optimal Online Stochastic Learning for Sequences of Convex Functions under Sub-Gradient Observation Failures

We study online convex optimization under stochastic sub-gradient observation faults, where we introduce adaptive algorithms with minimax optimal regret guarantees. We specifically study scenarios where our sub-gradient observations can be…

Machine Learning · Computer Science 2019-04-23 Hakan Gokcesu , Suleyman S. Kozat

Simple Stochastic Gradient Methods for Non-Smooth Non-Convex Regularized Optimization

Our work focuses on stochastic gradient methods for optimizing a smooth non-convex loss function with a non-smooth non-convex regularizer. Research on this class of problem is quite limited, and until recently no non-asymptotic convergence…

Optimization and Control · Mathematics 2019-05-15 Michael R. Metel , Akiko Takeda

Stochastic Gradient Descent on a Tree: an Adaptive and Robust Approach to Stochastic Convex Optimization

Online minimization of an unknown convex function over the interval $[0,1]$ is considered under first-order stochastic bandit feedback, which returns a random realization of the gradient of the function at each query point. Without knowing…

Machine Learning · Statistics 2020-02-21 Sattar Vakili , Sudeep Salgia , Qing Zhao

Convergence of First-Order Methods for Constrained Nonconvex Optimization with Dependent Data

We focus on analyzing the classical stochastic projected gradient methods under a general dependent data sampling scheme for constrained smooth nonconvex optimization. We show the worst-case rate of convergence $\tilde{O}(t^{-1/4})$ and…

Optimization and Control · Mathematics 2023-06-26 Ahmet Alacaoglu , Hanbaek Lyu