Related papers: Learning Zero-Sum Simultaneous-Move Markov Games U…

Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games

We study reinforcement learning for two-player zero-sum Markov games with simultaneous moves in the finite-horizon setting, where the transition kernel of the underlying Markov games can be parameterized by a linear function over the…

Machine Learning · Computer Science 2022-04-21 Zixiang Chen , Dongruo Zhou , Quanquan Gu

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is…

Machine Learning · Computer Science 2022-08-11 Chris Junchi Li , Dongruo Zhou , Quanquan Gu , Michael I. Jordan

Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games

This paper addresses the problem of learning an equilibrium efficiently in general-sum Markov games through decentralized multi-agent reinforcement learning. Given the fundamental difficulty of calculating a Nash equilibrium (NE), we…

Machine Learning · Computer Science 2022-02-01 Weichao Mao , Tamer Başar

Near Optimal Convergence to Coarse Correlated Equilibrium in General-Sum Markov Games

No-regret learning dynamics play a central role in game theory, enabling decentralized convergence to equilibrium for concepts such as Coarse Correlated Equilibrium (CCE) or Correlated Equilibrium (CE). In this work, we improve the…

Computer Science and Game Theory · Computer Science 2025-11-05 Asrin Efe Yorulmaz , Tamer Başar

Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria

The works of (Daskalakis et al., 2009, 2022; Jin et al., 2022; Deng et al., 2023) indicate that computing Nash equilibria in multi-player Markov games is a computationally hard task. This fact raises the question of whether or not…

Computer Science and Game Theory · Computer Science 2023-05-30 Fivos Kalogiannis , Ioannis Panageas

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games

We study policy optimization algorithms for computing correlated equilibria in multi-player general-sum Markov Games. Previous results achieve $O(T^{-1/2})$ convergence rate to a correlated equilibrium and an accelerated $O(T^{-3/4})$…

Machine Learning · Computer Science 2024-05-03 Yang Cai , Haipeng Luo , Chen-Yu Wei , Weiqiang Zheng

Near-Optimal Reinforcement Learning with Self-Play

This paper considers the problem of designing optimal algorithms for reinforcement learning in two-player zero-sum games. We focus on self-play algorithms which learn the optimal policy by playing against itself without any direct…

Machine Learning · Computer Science 2020-07-15 Yu Bai , Chi Jin , Tiancheng Yu

Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model

This paper studies multi-agent reinforcement learning in Markov games, with the goal of learning Nash equilibria or coarse correlated equilibria (CCE) sample-optimally. All prior results suffer from at least one of the two obstacles: the…

Machine Learning · Computer Science 2022-10-13 Gen Li , Yuejie Chi , Yuting Wei , Yuxin Chen

Near-Optimal Last-iterate Convergence of Policy Optimization in Zero-sum Polymatrix Markov games

Computing approximate Nash equilibria in multi-player general-sum Markov games is a computationally intractable task. However, multi-player Markov games with certain cooperative or competitive structures might circumvent this…

Computer Science and Game Theory · Computer Science 2023-08-17 Zailin Ma , Jiansheng Yang , Zhihua Zhang

Regret Minimization and Convergence to Equilibria in General-sum Markov Games

An abundance of recent impossibility results establish that regret minimization in Markov games with adversarial opponents is both statistically and computationally intractable. Nevertheless, none of these results preclude the possibility…

Machine Learning · Computer Science 2025-06-17 Liad Erez , Tal Lancewicki , Uri Sherman , Tomer Koren , Yishay Mansour

Approximating Nash Equilibria in General-Sum Games via Meta-Learning

Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the…

Computer Science and Game Theory · Computer Science 2025-04-29 David Sychrovský , Christopher Solinas , Revan MacQueen , Kevin Wang , James R. Wright , Nathan R. Sturtevant , Michael Bowling

Competing Against Equilibria in Zero-Sum Games with Evolving Payoffs

We study the problem of repeated play in a zero-sum game in which the payoff matrix may change, in a possibly adversarial fashion, on each round; we call these Online Matrix Games. Finding the Nash Equilibrium (NE) of a two player zero-sum…

Machine Learning · Computer Science 2020-04-06 Adrian Rivera Cardoso , Jacob Abernethy , He Wang , Huan Xu

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often…

Computer Science and Game Theory · Computer Science 2021-10-19 Yu-Guan Hsieh , Kimon Antonakopoulos , Panayotis Mertikopoulos

Last-iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-horizon Competitive Markov Games

We study infinite-horizon discounted two-player zero-sum Markov games, and develop a decentralized algorithm that provably converges to the set of Nash equilibria under self-play. Our algorithm is based on running an Optimistic Gradient…

Machine Learning · Computer Science 2021-07-08 Chen-Yu Wei , Chung-Wei Lee , Mengxiao Zhang , Haipeng Luo

Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games

In this work, we study potential games and Markov potential games under stochastic cost and bandit feedback. We propose a variant of the Frank-Wolfe algorithm with sufficient exploration and recursive gradient estimation, which provably…

Computer Science and Game Theory · Computer Science 2024-04-11 Jing Dong , Baoxiang Wang , Yaoliang Yu

Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games

This paper makes progress towards learning Nash equilibria in two-player zero-sum Markov games from offline data. Specifically, consider a $\gamma$-discounted infinite-horizon Markov game with $S$ states, where the max-player has $A$…

Machine Learning · Computer Science 2025-03-18 Yuling Yan , Gen Li , Yuxin Chen , Jianqing Fan

Acceleration through Optimistic No-Regret Dynamics

We consider the problem of minimizing a smooth convex function by reducing the optimization to computing the Nash equilibrium of a particular zero-sum convex-concave game. Zero-sum games can be solved using online learning dynamics, where a…

Machine Learning · Computer Science 2018-11-16 Jun-Kun Wang , Jacob Abernethy

Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games

We study the global convergence of policy optimization for finding the Nash equilibria (NE) in zero-sum linear quadratic (LQ) games. To this end, we first investigate the landscape of LQ games, viewing it as a nonconvex-nonconcave…

Machine Learning · Computer Science 2021-02-12 Kaiqing Zhang , Zhuoran Yang , Tamer Başar

Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning

We study risk-sensitive multi-agent reinforcement learning under general-sum Markov games, where agents optimize the entropic risk measure of rewards with possibly diverse risk preferences. We show that using the regret naively adapted from…

Machine Learning · Computer Science 2024-05-07 Yingjie Fei , Ruitu Xu

Efficiently Computing Nash Equilibria in Adversarial Team Markov Games

Computing Nash equilibrium policies is a central problem in multi-agent reinforcement learning that has received extensive attention both in theory and in practice. However, provable guarantees have been thus far either limited to fully…

Computer Science and Game Theory · Computer Science 2022-08-04 Fivos Kalogiannis , Ioannis Anagnostides , Ioannis Panageas , Emmanouil-Vasileios Vlatakis-Gkaragkounis , Vaggos Chatziafratis , Stelios Stavroulakis