Related papers: Optimization, Learning, and Games with Predictable…

Local and adaptive mirror descents in extensive-form games

We study how to learn $\epsilon$-optimal strategies in zero-sum imperfect information games (IIG) with trajectory feedback. In this setting, players update their policies sequentially based on their observations over a fixed number of…

Computer Science and Game Theory · Computer Science 2023-09-06 Côme Fiegel , Pierre Ménard , Tadashi Kozuno , Rémi Munos , Vianney Perchet , Michal Valko

Last iterate convergence in no-regret learning: constrained min-max optimization for convex-concave landscapes

In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate…

Machine Learning · Computer Science 2025-09-30 Qi Lei , Sai Ganesh Nagarajan , Ioannis Panageas , Xiao Wang

Minimax Optimal Algorithms for Unconstrained Linear Optimization

We design and analyze minimax-optimal algorithms for online linear optimization games where the player's choice is unconstrained. The player strives to minimize regret, the difference between his loss and the loss of a post-hoc benchmark…

Machine Learning · Computer Science 2013-02-12 H. Brendan McMahan

Mirror Prox Algorithm for Multi-Term Composite Minimization and Semi-Separable Problems

In the paper, we develop a composite version of Mirror Prox algorithm for solving convex-concave saddle point problems and monotone variational inequalities of special structure, allowing to cover saddle point/variational analogies of what…

Optimization and Control · Mathematics 2014-05-23 Niao He , Anatoli Juditsky , Arkadi Nemirovski

On Last-Iterate Convergence Beyond Zero-Sum Games

Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results…

Computer Science and Game Theory · Computer Science 2022-03-24 Ioannis Anagnostides , Ioannis Panageas , Gabriele Farina , Tuomas Sandholm

Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms

Self-play via online learning is one of the premier ways to solve large-scale two-player zero-sum games, both in theory and practice. Particularly popular algorithms include optimistic multiplicative weights update (OMWU) and optimistic…

Computer Science and Game Theory · Computer Science 2025-01-22 Yang Cai , Gabriele Farina , Julien Grand-Clément , Christian Kroer , Chung-Wei Lee , Haipeng Luo , Weiqiang Zheng

Acceleration through Optimistic No-Regret Dynamics

We consider the problem of minimizing a smooth convex function by reducing the optimization to computing the Nash equilibrium of a particular zero-sum convex-concave game. Zero-sum games can be solved using online learning dynamics, where a…

Machine Learning · Computer Science 2018-11-16 Jun-Kun Wang , Jacob Abernethy

Mirror Descent for Constrained Optimization Problems with Large Subgradient Values

Based on the ideas of arXiv:1710.06612, we consider the problem of minimization of the Holder-continuous non-smooth functional $f$ with non-positive convex (generally, non-smooth) Lipschitz-continuous functional constraint. We propose some…

Optimization and Control · Mathematics 2022-01-03 Fedor Stonyakin , Alexey Stepanov , Alexander Gasnikov , Alexander Titov

Alternating Mirror Descent for Constrained Min-Max Games

In this paper we study two-player bilinear zero-sum games with constrained strategy spaces. An instance of natural occurrences of such constraints is when mixed strategies are used, which correspond to a probability simplex constraint. We…

Computer Science and Game Theory · Computer Science 2022-06-10 Andre Wibisono , Molei Tao , Georgios Piliouras

Efficient Online Mirror Descent Stochastic Approximation for Multi-Stage Stochastic Programming

We study the unconstrained and the minimax saddle point variants of the convex multi-stage stochastic programming problem, where consecutive decisions are coupled through the objective functions, rather than through the constraints. We…

Optimization and Control · Mathematics 2026-03-02 Junhui Zhang , Patrick Jaillet

Improved Optimistic Mirror Descent for Sparsity and Curvature

Online Convex Optimization plays a key role in large scale machine learning. Early approaches to this problem were conservative, in which the main focus was protection against the worst case scenario. But recently several algorithms have…

Machine Learning · Computer Science 2016-09-09 Parameswaran Kamalaruban

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often…

Computer Science and Game Theory · Computer Science 2021-10-19 Yu-Guan Hsieh , Kimon Antonakopoulos , Panayotis Mertikopoulos

Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

We consider the problem of online learning and its application to solving minimax games. For the online learning problem, Follow the Perturbed Leader (FTPL) is a widely studied algorithm which enjoys the optimal $O(T^{1/2})$ worst-case…

Machine Learning · Computer Science 2020-06-16 Arun Sai Suggala , Praneeth Netrapalli

UniXGrad: A Universal, Adaptive Algorithm with Optimal Guarantees for Constrained Optimization

We propose a novel adaptive, accelerated algorithm for the stochastic constrained convex optimization setting. Our method, which is inspired by the Mirror-Prox method, \emph{simultaneously} achieves the optimal rates for smooth/non-smooth…

Optimization and Control · Mathematics 2019-10-31 Ali Kavis , Kfir Y. Levy , Francis Bach , Volkan Cevher

Robust Data-Driven Accelerated Mirror Descent

Learning-to-optimize is an emerging framework that leverages training data to speed up the solution of certain optimization problems. One such approach is based on the classical mirror descent algorithm, where the mirror map is modelled…

Optimization and Control · Mathematics 2023-06-05 Hong Ye Tan , Subhadip Mukherjee , Junqi Tang , Andreas Hauptmann , Carola-Bibiane Schönlieb

Fast Rates in $\alpha$-Potential Games via Regularized Mirror Descent

An $\alpha$-potential game is a multi-player non-cooperative interaction in which a global potential function approximates individual player rewards up to a structural bias $\alpha$. While identifying a Nash Equilibrium (NE) in generic…

Computer Science and Game Theory · Computer Science 2026-05-19 Claire Chen , Yuheng Zhang

Mirror Descent Strikes Again: Optimal Stochastic Convex Optimization under Infinite Noise Variance

We study stochastic convex optimization under infinite noise variance. Specifically, when the stochastic gradient is unbiased and has uniformly bounded $(1+\kappa)$-th moment, for some $\kappa \in (0,1]$, we quantify the convergence rate of…

Machine Learning · Statistics 2022-02-24 Nuri Mert Vural , Lu Yu , Krishnakumar Balasubramanian , Stanislav Volgushev , Murat A. Erdogdu

Primal-dual Accelerated Mirror-Descent Method for Constrained Bilinear Saddle-Point Problems

We develop a first-order accelerated algorithm for a class of constrained bilinear saddle-point problems with applications to network systems. The algorithm is a modified time-varying primal-dual version of an accelerated mirror-descent…

Optimization and Control · Mathematics 2024-10-04 Weijian Li , Xianlin Zeng , Lacra Pavel

Online Optimization in Dynamic Environments

High-velocity streams of high-dimensional data pose significant "big data" analysis challenges across a range of applications and settings. Online learning and online convex programming play a significant role in the rapid recovery of…

Machine Learning · Statistics 2016-01-20 Eric C. Hall , Rebecca M. Willett

Online (Non-)Convex Learning via Tempered Optimism

Optimistic Online Learning aims to exploit experts conveying reliable information to predict the future. However, such implicit optimism may be challenged when it comes to practical crafting of such experts. A fundamental example consists…

Machine Learning · Computer Science 2025-10-29 Maxime Haddouche , Olivier Wintenberger , Benjamin Guedj