Related papers: Offline Learning in Markov Games with General Func…

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Offline reinforcement learning (RL) aims at learning an optimal strategy using a pre-collected dataset without further interactions with the environment. While various algorithms have been proposed for offline RL in the previous literature,…

Machine Learning · Computer Science 2023-03-02 Wei Xiong , Han Zhong , Chengshuai Shi , Cong Shen , Liwei Wang , Tong Zhang

The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces

Modern reinforcement learning (RL) commonly engages practical problems with large state spaces, where function approximation must be deployed to approximate either the value function or the policy. While recent progresses in RL theory…

Machine Learning · Computer Science 2021-10-14 Chi Jin , Qinghua Liu , Tiancheng Yu

Learning Closed-Loop Parametric Nash Equilibria of Multi-Agent Collaborative Field Coverage

Multi-agent reinforcement learning is a challenging and active field of research due to the inherent nonstationary property and coupling between agents. A popular approach to modeling the multi-agent interactions underlying the multi-agent…

Multiagent Systems · Computer Science 2025-10-07 Jushan Chen , Santiago Paternain

When is Offline Two-Player Zero-Sum Markov Game Solvable?

We study what dataset assumption permits solving offline two-player zero-sum Markov games. In stark contrast to the offline single-agent Markov decision process, we show that the single strategy concentration assumption is insufficient for…

Machine Learning · Computer Science 2022-10-17 Qiwen Cui , Simon S. Du

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation

We propose a new model, independent linear Markov game, for multi-agent reinforcement learning with a large state space and a large number of agents. This is a class of Markov games with independent linear function approximation, where each…

Machine Learning · Computer Science 2023-06-23 Qiwen Cui , Kaiqing Zhang , Simon S. Du

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is…

Machine Learning · Computer Science 2022-08-11 Chris Junchi Li , Dongruo Zhou , Quanquan Gu , Michael I. Jordan

Learning Nash Equilibrium for General-Sum Markov Games from Batch Data

This paper addresses the problem of learning a Nash equilibrium in $\gamma$-discounted multiplayer general-sum Markov Games (MG). A key component of this model is the possibility for the players to either collaborate or team apart to…

Computer Science and Game Theory · Computer Science 2017-03-07 Julien Pérolat , Florian Strub , Bilal Piot , Olivier Pietquin

Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games

Multi-agent reinforcement learning (MARL) lies at the heart of a plethora of applications involving the interaction of a group of agents in a shared unknown environment. A prominent framework for studying MARL is Markov games, with the goal…

Machine Learning · Computer Science 2025-02-17 Tong Yang , Bo Dai , Lin Xiao , Yuejie Chi

Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence

We examine global non-asymptotic convergence properties of policy gradient methods for multi-agent reinforcement learning (RL) problems in Markov potential games (MPG). To learn a Nash equilibrium of an MPG in which the size of state space…

Machine Learning · Computer Science 2022-08-08 Dongsheng Ding , Chen-Yu Wei , Kaiqing Zhang , Mihailo R. Jovanović

Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games

This paper makes progress towards learning Nash equilibria in two-player zero-sum Markov games from offline data. Specifically, consider a $\gamma$-discounted infinite-horizon Markov game with $S$ states, where the max-player has $A$…

Machine Learning · Computer Science 2025-03-18 Yuling Yan , Gen Li , Yuxin Chen , Jianqing Fan

Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic…

Machine Learning · Computer Science 2023-06-02 Dongsheng Ding , Xiaohan Wei , Zhuoran Yang , Zhaoran Wang , Mihailo R. Jovanović

Offline Equilibrium Finding

Offline reinforcement learning (offline RL) is an emerging field that has recently begun gaining attention across various application domains due to its ability to learn strategies from earlier collected datasets. Offline RL proved very…

Artificial Intelligence · Computer Science 2023-02-09 Shuxin Li , Xinrun Wang , Youzhi Zhang , Jakub Cerny , Pengdeng Li , Hau Chan , Bo An

Learning Approximate Nash Equilibria in Cooperative Multi-Agent Reinforcement Learning via Mean-Field Subsampling

Many large-scale platforms and networked control systems have a centralized decision maker interacting with a massive population of agents under strict observability constraints. Motivated by such applications, we study a cooperative Markov…

Multiagent Systems · Computer Science 2026-05-12 Emile Anand , Ishani Karmarkar

Sample-Efficient Tabular Self-Play for Offline Robust Reinforcement Learning

Multi-agent reinforcement learning (MARL), as a thriving field, explores how multiple agents independently make decisions in a shared dynamic environment. Due to environmental uncertainties, policies in MARL must remain robust to tackle the…

Machine Learning · Computer Science 2025-12-02 Na Li , Zewu Zheng , Wei Ni , Hangguan Shan , Wenjie Zhang , Xinyu Li

Conservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning

Offline learning of strategies takes data efficiency to its extreme by restricting algorithms to a fixed dataset of state-action trajectories. We consider the problem in a mixed-motive multiagent setting, where the goal is to solve a game…

Artificial Intelligence · Computer Science 2026-03-03 Austin A. Nguyen , Michael P. Wellman

Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?

We study multi-player general-sum Markov games with one of the players designated as the leader and the other players regarded as followers. In particular, we focus on the class of games where the followers are myopic, i.e., they aim to…

Machine Learning · Computer Science 2021-12-28 Han Zhong , Zhuoran Yang , Zhaoran Wang , Michael I. Jordan

MF-OML: Online Mean-Field Reinforcement Learning with Occupation Measures for Large Population Games

Reinforcement learning for multi-agent games has attracted lots of attention recently. However, given the challenge of solving Nash equilibria for large population games, existing works with guaranteed polynomial complexities either focus…

Optimization and Control · Mathematics 2025-09-04 Anran Hu , Junzi Zhang

Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model

This paper studies multi-agent reinforcement learning in Markov games, with the goal of learning Nash equilibria or coarse correlated equilibria (CCE) sample-optimally. All prior results suffer from at least one of the two obstacles: the…

Machine Learning · Computer Science 2022-10-13 Gen Li , Yuejie Chi , Yuting Wei , Yuxin Chen

Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games

This paper addresses the problem of learning an equilibrium efficiently in general-sum Markov games through decentralized multi-agent reinforcement learning. Given the fundamental difficulty of calculating a Nash equilibrium (NE), we…

Machine Learning · Computer Science 2022-02-01 Weichao Mao , Tamer Başar

Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty

To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely studied in single-agent regimes, in multi-agent environments, the…

Machine Learning · Computer Science 2024-05-10 Laixi Shi , Eric Mazumdar , Yuejie Chi , Adam Wierman