English
Related papers

Related papers: Optimization, Learning, and Games with Predictable…

200 papers

We study how to learn $\epsilon$-optimal strategies in zero-sum imperfect information games (IIG) with trajectory feedback. In this setting, players update their policies sequentially based on their observations over a fixed number of…

Computer Science and Game Theory · Computer Science 2023-09-06 Côme Fiegel , Pierre Ménard , Tadashi Kozuno , Rémi Munos , Vianney Perchet , Michal Valko

In a recent series of papers it has been established that variants of Gradient Descent/Ascent and Mirror Descent exhibit last iterate convergence in convex-concave zero-sum games. Specifically, \cite{DISZ17, LiangS18} show last iterate…

Machine Learning · Computer Science 2025-09-30 Qi Lei , Sai Ganesh Nagarajan , Ioannis Panageas , Xiao Wang

We design and analyze minimax-optimal algorithms for online linear optimization games where the player's choice is unconstrained. The player strives to minimize regret, the difference between his loss and the loss of a post-hoc benchmark…

Machine Learning · Computer Science 2013-02-12 H. Brendan McMahan

In the paper, we develop a composite version of Mirror Prox algorithm for solving convex-concave saddle point problems and monotone variational inequalities of special structure, allowing to cover saddle point/variational analogies of what…

Optimization and Control · Mathematics 2014-05-23 Niao He , Anatoli Juditsky , Arkadi Nemirovski

Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results…

Computer Science and Game Theory · Computer Science 2022-03-24 Ioannis Anagnostides , Ioannis Panageas , Gabriele Farina , Tuomas Sandholm

Self-play via online learning is one of the premier ways to solve large-scale two-player zero-sum games, both in theory and practice. Particularly popular algorithms include optimistic multiplicative weights update (OMWU) and optimistic…

Computer Science and Game Theory · Computer Science 2025-01-22 Yang Cai , Gabriele Farina , Julien Grand-Clément , Christian Kroer , Chung-Wei Lee , Haipeng Luo , Weiqiang Zheng

We consider the problem of minimizing a smooth convex function by reducing the optimization to computing the Nash equilibrium of a particular zero-sum convex-concave game. Zero-sum games can be solved using online learning dynamics, where a…

Machine Learning · Computer Science 2018-11-16 Jun-Kun Wang , Jacob Abernethy

Based on the ideas of arXiv:1710.06612, we consider the problem of minimization of the Holder-continuous non-smooth functional $f$ with non-positive convex (generally, non-smooth) Lipschitz-continuous functional constraint. We propose some…

Optimization and Control · Mathematics 2022-01-03 Fedor Stonyakin , Alexey Stepanov , Alexander Gasnikov , Alexander Titov

In this paper we study two-player bilinear zero-sum games with constrained strategy spaces. An instance of natural occurrences of such constraints is when mixed strategies are used, which correspond to a probability simplex constraint. We…

Computer Science and Game Theory · Computer Science 2022-06-10 Andre Wibisono , Molei Tao , Georgios Piliouras

We study the unconstrained and the minimax saddle point variants of the convex multi-stage stochastic programming problem, where consecutive decisions are coupled through the objective functions, rather than through the constraints. We…

Optimization and Control · Mathematics 2026-03-02 Junhui Zhang , Patrick Jaillet

Online Convex Optimization plays a key role in large scale machine learning. Early approaches to this problem were conservative, in which the main focus was protection against the worst case scenario. But recently several algorithms have…

Machine Learning · Computer Science 2016-09-09 Parameswaran Kamalaruban

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often…

Computer Science and Game Theory · Computer Science 2021-10-19 Yu-Guan Hsieh , Kimon Antonakopoulos , Panayotis Mertikopoulos

We consider the problem of online learning and its application to solving minimax games. For the online learning problem, Follow the Perturbed Leader (FTPL) is a widely studied algorithm which enjoys the optimal $O(T^{1/2})$ worst-case…

Machine Learning · Computer Science 2020-06-16 Arun Sai Suggala , Praneeth Netrapalli

We propose a novel adaptive, accelerated algorithm for the stochastic constrained convex optimization setting. Our method, which is inspired by the Mirror-Prox method, \emph{simultaneously} achieves the optimal rates for smooth/non-smooth…

Optimization and Control · Mathematics 2019-10-31 Ali Kavis , Kfir Y. Levy , Francis Bach , Volkan Cevher

Learning-to-optimize is an emerging framework that leverages training data to speed up the solution of certain optimization problems. One such approach is based on the classical mirror descent algorithm, where the mirror map is modelled…

Optimization and Control · Mathematics 2023-06-05 Hong Ye Tan , Subhadip Mukherjee , Junqi Tang , Andreas Hauptmann , Carola-Bibiane Schönlieb

An $\alpha$-potential game is a multi-player non-cooperative interaction in which a global potential function approximates individual player rewards up to a structural bias $\alpha$. While identifying a Nash Equilibrium (NE) in generic…

Computer Science and Game Theory · Computer Science 2026-05-19 Claire Chen , Yuheng Zhang

We study stochastic convex optimization under infinite noise variance. Specifically, when the stochastic gradient is unbiased and has uniformly bounded $(1+\kappa)$-th moment, for some $\kappa \in (0,1]$, we quantify the convergence rate of…

Machine Learning · Statistics 2022-02-24 Nuri Mert Vural , Lu Yu , Krishnakumar Balasubramanian , Stanislav Volgushev , Murat A. Erdogdu

We develop a first-order accelerated algorithm for a class of constrained bilinear saddle-point problems with applications to network systems. The algorithm is a modified time-varying primal-dual version of an accelerated mirror-descent…

Optimization and Control · Mathematics 2024-10-04 Weijian Li , Xianlin Zeng , Lacra Pavel

High-velocity streams of high-dimensional data pose significant "big data" analysis challenges across a range of applications and settings. Online learning and online convex programming play a significant role in the rapid recovery of…

Machine Learning · Statistics 2016-01-20 Eric C. Hall , Rebecca M. Willett

Optimistic Online Learning aims to exploit experts conveying reliable information to predict the future. However, such implicit optimism may be challenged when it comes to practical crafting of such experts. A fundamental example consists…

Machine Learning · Computer Science 2025-10-29 Maxime Haddouche , Olivier Wintenberger , Benjamin Guedj
‹ Prev 1 2 3 10 Next ›