English
Related papers

Related papers: Learning Zero-Sum Simultaneous-Move Markov Games U…

200 papers

We study reinforcement learning for two-player zero-sum Markov games with simultaneous moves in the finite-horizon setting, where the transition kernel of the underlying Markov games can be parameterized by a linear function over the…

Machine Learning · Computer Science 2022-04-21 Zixiang Chen , Dongruo Zhou , Quanquan Gu

We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is…

Machine Learning · Computer Science 2022-08-11 Chris Junchi Li , Dongruo Zhou , Quanquan Gu , Michael I. Jordan

This paper addresses the problem of learning an equilibrium efficiently in general-sum Markov games through decentralized multi-agent reinforcement learning. Given the fundamental difficulty of calculating a Nash equilibrium (NE), we…

Machine Learning · Computer Science 2022-02-01 Weichao Mao , Tamer Başar

No-regret learning dynamics play a central role in game theory, enabling decentralized convergence to equilibrium for concepts such as Coarse Correlated Equilibrium (CCE) or Correlated Equilibrium (CE). In this work, we improve the…

Computer Science and Game Theory · Computer Science 2025-11-05 Asrin Efe Yorulmaz , Tamer Başar

The works of (Daskalakis et al., 2009, 2022; Jin et al., 2022; Deng et al., 2023) indicate that computing Nash equilibria in multi-player Markov games is a computationally hard task. This fact raises the question of whether or not…

Computer Science and Game Theory · Computer Science 2023-05-30 Fivos Kalogiannis , Ioannis Panageas

We study policy optimization algorithms for computing correlated equilibria in multi-player general-sum Markov Games. Previous results achieve $O(T^{-1/2})$ convergence rate to a correlated equilibrium and an accelerated $O(T^{-3/4})$…

Machine Learning · Computer Science 2024-05-03 Yang Cai , Haipeng Luo , Chen-Yu Wei , Weiqiang Zheng

This paper considers the problem of designing optimal algorithms for reinforcement learning in two-player zero-sum games. We focus on self-play algorithms which learn the optimal policy by playing against itself without any direct…

Machine Learning · Computer Science 2020-07-15 Yu Bai , Chi Jin , Tiancheng Yu

This paper studies multi-agent reinforcement learning in Markov games, with the goal of learning Nash equilibria or coarse correlated equilibria (CCE) sample-optimally. All prior results suffer from at least one of the two obstacles: the…

Machine Learning · Computer Science 2022-10-13 Gen Li , Yuejie Chi , Yuting Wei , Yuxin Chen

Computing approximate Nash equilibria in multi-player general-sum Markov games is a computationally intractable task. However, multi-player Markov games with certain cooperative or competitive structures might circumvent this…

Computer Science and Game Theory · Computer Science 2023-08-17 Zailin Ma , Jiansheng Yang , Zhihua Zhang

An abundance of recent impossibility results establish that regret minimization in Markov games with adversarial opponents is both statistically and computationally intractable. Nevertheless, none of these results preclude the possibility…

Machine Learning · Computer Science 2025-06-17 Liad Erez , Tal Lancewicki , Uri Sherman , Tomer Koren , Yishay Mansour

Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the…

Computer Science and Game Theory · Computer Science 2025-04-29 David Sychrovský , Christopher Solinas , Revan MacQueen , Kevin Wang , James R. Wright , Nathan R. Sturtevant , Michael Bowling

We study the problem of repeated play in a zero-sum game in which the payoff matrix may change, in a possibly adversarial fashion, on each round; we call these Online Matrix Games. Finding the Nash Equilibrium (NE) of a two player zero-sum…

Machine Learning · Computer Science 2020-04-06 Adrian Rivera Cardoso , Jacob Abernethy , He Wang , Huan Xu

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often…

Computer Science and Game Theory · Computer Science 2021-10-19 Yu-Guan Hsieh , Kimon Antonakopoulos , Panayotis Mertikopoulos

We study infinite-horizon discounted two-player zero-sum Markov games, and develop a decentralized algorithm that provably converges to the set of Nash equilibria under self-play. Our algorithm is based on running an Optimistic Gradient…

Machine Learning · Computer Science 2021-07-08 Chen-Yu Wei , Chung-Wei Lee , Mengxiao Zhang , Haipeng Luo

In this work, we study potential games and Markov potential games under stochastic cost and bandit feedback. We propose a variant of the Frank-Wolfe algorithm with sufficient exploration and recursive gradient estimation, which provably…

Computer Science and Game Theory · Computer Science 2024-04-11 Jing Dong , Baoxiang Wang , Yaoliang Yu

This paper makes progress towards learning Nash equilibria in two-player zero-sum Markov games from offline data. Specifically, consider a $\gamma$-discounted infinite-horizon Markov game with $S$ states, where the max-player has $A$…

Machine Learning · Computer Science 2025-03-18 Yuling Yan , Gen Li , Yuxin Chen , Jianqing Fan

We consider the problem of minimizing a smooth convex function by reducing the optimization to computing the Nash equilibrium of a particular zero-sum convex-concave game. Zero-sum games can be solved using online learning dynamics, where a…

Machine Learning · Computer Science 2018-11-16 Jun-Kun Wang , Jacob Abernethy

We study the global convergence of policy optimization for finding the Nash equilibria (NE) in zero-sum linear quadratic (LQ) games. To this end, we first investigate the landscape of LQ games, viewing it as a nonconvex-nonconcave…

Machine Learning · Computer Science 2021-02-12 Kaiqing Zhang , Zhuoran Yang , Tamer Başar

We study risk-sensitive multi-agent reinforcement learning under general-sum Markov games, where agents optimize the entropic risk measure of rewards with possibly diverse risk preferences. We show that using the regret naively adapted from…

Machine Learning · Computer Science 2024-05-07 Yingjie Fei , Ruitu Xu

Computing Nash equilibrium policies is a central problem in multi-agent reinforcement learning that has received extensive attention both in theory and in practice. However, provable guarantees have been thus far either limited to fully…

‹ Prev 1 2 3 10 Next ›