Related papers: Accelerated Gradient Algorithms with Adaptive Subs…

Improved Algorithms for Convex-Concave Minimax Optimization

This paper studies minimax optimization problems $\min_x \max_y f(x,y)$, where $f(x,y)$ is $m_x$-strongly convex with respect to $x$, $m_y$-strongly concave with respect to $y$ and $(L_x,L_{xy},L_y)$-smooth. Zhang et al. provided the…

Machine Learning · Computer Science 2020-10-20 Yuanhao Wang , Jian Li

Near-Optimal Convergence of Accelerated Gradient Methods under Generalized and $(L_0, L_1)$-Smoothness

We study first-order methods for convex optimization problems with functions $f$ satisfying the recently proposed $\ell$-smoothness condition $||\nabla^{2}f(x)|| \le \ell\left(||\nabla f(x)||\right),$ which generalizes the $L$-smoothness…

Optimization and Control · Mathematics 2026-05-22 Alexander Tyurin

Adaptive Conditional Gradient Descent

Selecting an effective step-size is a fundamental challenge in first-order optimization, especially for problems with non-Euclidean geometries. This paper presents a novel adaptive step-size strategy for optimization algorithms that rely on…

Optimization and Control · Mathematics 2025-10-14 Abbas Khademi , Antonio Silveti-Falls

Gradient methods for convex minimization: better rates under weaker conditions

The convergence behavior of gradient methods for minimizing convex differentiable functions is one of the core questions in convex optimization. This paper shows that their well-known complexities can be achieved under conditions weaker…

Optimization and Control · Mathematics 2013-09-10 Hui Zhang , Wotao Yin

Adaptive extra-gradient methods for min-max optimization and games

We present a new family of min-max optimization algorithms that automatically exploit the geometry of the gradient data observed at earlier iterations to perform more informative extra-gradient steps in later ones. Thanks to this adaptation…

Optimization and Control · Mathematics 2020-11-20 Kimon Antonakopoulos , E. Veronica Belmega , Panayotis Mertikopoulos

Third-order Smoothness Helps: Even Faster Stochastic Optimization Algorithms for Finding Local Minima

We propose stochastic optimization algorithms that can find local minima faster than existing algorithms for nonconvex optimization problems, by exploiting the third-order smoothness to escape non-degenerate saddle points more efficiently.…

Optimization and Control · Mathematics 2017-12-19 Yaodong Yu , Pan Xu , Quanquan Gu

Accelerated Algorithms for Smooth Convex-Concave Minimax Problems with $\mathcal{O}(1/k^2)$ Rate on Squared Gradient Norm

In this work, we study the computational complexity of reducing the squared gradient magnitude for smooth minimax optimization problems. First, we present algorithms with accelerated $\mathcal{O}(1/k^2)$ last-iterate rates, faster than the…

Optimization and Control · Mathematics 2021-06-11 TaeHo Yoon , Ernest K. Ryu

Near-Optimal Algorithms for Minimax Optimization

This paper resolves a longstanding open question pertaining to the design of near-optimal first-order algorithms for smooth and strongly-convex-strongly-concave minimax problems. Current state-of-the-art first-order algorithms find an…

Optimization and Control · Mathematics 2021-07-27 Tianyi Lin , Chi Jin , Michael. I. Jordan

Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

In modern decentralized applications, ensuring communication efficiency and privacy for the users are the key challenges. In order to train machine-learning models, the algorithm has to communicate to the data center and sample data for its…

Optimization and Control · Mathematics 2024-04-04 Hoang Huy Nguyen , Yan Li , Tuo Zhao

Efficient Algorithms for Smooth Minimax Optimization

This paper studies first order methods for solving smooth minimax optimization problems $\min_x \max_y g(x,y)$ where $g(\cdot,\cdot)$ is smooth and $g(x,\cdot)$ is concave for each $x$. In terms of $g(\cdot,y)$, we consider two settings --…

Optimization and Control · Mathematics 2019-07-03 Kiran Koshy Thekumparampil , Prateek Jain , Praneeth Netrapalli , Sewoong Oh

Near-optimal tensor methods for minimizing the gradient norm of convex functions and accelerated primal-dual tensor methods

Motivated, in particular, by the entropy-regularized optimal transport problem, we consider convex optimization problems with linear equality constraints, where the dual objective has Lipschitz $p$-th order derivatives, and develop two…

Optimization and Control · Mathematics 2023-08-11 Pavel Dvurechensky , Petr Ostroukhov , Alexander Gasnikov , César A. Uribe , Anastasiya Ivanova

Nesterov Finds GRAAL: Optimal and Adaptive Gradient Method for Convex Optimization

In this paper, we focus on the problem of minimizing a continuously differentiable convex objective function, $\min_x f(x)$. Recently, Malitsky (2020); Alacaoglu et al.(2023) developed an adaptive first-order method, GRAAL. This algorithm…

Optimization and Control · Mathematics 2025-09-01 Ekaterina Borodich , Dmitry Kovalev

Gradient Sliding for Composite Optimization

We consider in this paper a class of composite optimization problems whose objective function is given by the summation of a general smooth and nonsmooth component, together with a relatively simple nonsmooth term. We present a new class of…

Optimization and Control · Mathematics 2015-10-27 Guanghui Lan

A Fast and Convergent Proximal Algorithm for Regularized Nonconvex and Nonsmooth Bi-level Optimization

Many important machine learning applications involve regularized nonconvex bi-level optimization. However, the existing gradient-based bi-level optimization algorithms cannot handle nonconvex or nonsmooth regularizers, and they suffer from…

Machine Learning · Computer Science 2022-06-06 Ziyi Chen , Bhavya Kailkhura , Yi Zhou

Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization

This paper considers stochastic optimization problems for a large class of objective functions, including convex and continuous submodular. Stochastic proximal gradient methods have been widely used to solve such problems; however, their…

Optimization and Control · Mathematics 2018-11-13 Aryan Mokhtari , Hamed Hassani , Amin Karbasi

Adaptive Algorithms with Sharp Convergence Rates for Stochastic Hierarchical Optimization

Hierarchical optimization refers to problems with interdependent decision variables and objectives, such as minimax and bilevel formulations. While various algorithms have been proposed, existing methods and analyses lack adaptivity in…

Machine Learning · Computer Science 2025-10-27 Xiaochuan Gong , Jie Hao , Mingrui Liu

Gradient-Normalized Smoothness for Optimization with Approximate Hessians

In this work, we develop new optimization algorithms that use approximate second-order information combined with the gradient regularization technique to achieve fast global convergence rates for both convex and non-convex objectives. The…

Optimization and Control · Mathematics 2025-06-17 Andrei Semenov , Martin Jaggi , Nikita Doikov

Smooth Optimization with Approximate Gradient

We show that the optimal complexity of Nesterov's smooth first-order optimization algorithm is preserved when the gradient is only computed up to a small, uniformly bounded error. In applications of this method to semidefinite programs,…

Optimization and Control · Mathematics 2008-05-16 Alexandre d'Aspremont

Revisiting Subgradient Method: Complexity and Convergence Beyond Lipschitz Continuity

The subgradient method is one of the most fundamental algorithmic schemes for nonsmooth optimization. The existing complexity and convergence results for this method are mainly derived for Lipschitz continuous objective functions. In this…

Optimization and Control · Mathematics 2024-11-01 Xiao Li , Lei Zhao , Daoli Zhu , Anthony Man-Cho So

Complexity Lower Bounds of Adaptive Gradient Algorithms for Non-convex Stochastic Optimization under Relaxed Smoothness

Recent results in non-convex stochastic optimization demonstrate the convergence of popular adaptive algorithms (e.g., AdaGrad) under the $(L_0, L_1)$-smoothness condition, but the rate of convergence is a higher-order polynomial in terms…

Machine Learning · Computer Science 2025-05-09 Michael Crawshaw , Mingrui Liu