Related papers: A Multiagent Reinforcement Learning Algorithm with…

Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics

This paper explores advanced topics in complex multi-agent systems building upon our previous work. We examine four fundamental challenges in Multi-Agent Reinforcement Learning (MARL): non-stationarity, partial observability, scalability…

Multiagent Systems · Computer Science 2024-12-31 Neil De La Fuente , Miquel Noguer i Alonso , Guim Casadellà

NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) is increasingly used to design learning-enabled agents that interact in shared environments. However, training MARL algorithms in general-sum games remains challenging: learning dynamics can become…

Machine Learning · Computer Science 2026-04-07 Addison Kalanther , Sanika Bharvirkar , Shankar Sastry , Chinmay Maheshwari

Bi-level Actor-Critic for Multi-agent Coordination

Coordination is one of the essential problems in multi-agent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when…

Multiagent Systems · Computer Science 2020-04-07 Haifeng Zhang , Weizhe Chen , Zeren Huang , Minne Li , Yaodong Yang , Weinan Zhang , Jun Wang

Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

While multi-agent reinforcement learning (MARL) has produced numerous algorithms that converge to Nash or related equilibria, such equilibria are often non-unique and can exhibit widely varying efficiency. This raises a fundamental…

Computer Science and Game Theory · Computer Science 2026-01-29 Runyu Zhang , Gioele Zardini , Asuman Ozdaglar , Jeff Shamma , Na Li

Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games

Multi-Agent Reinforcement Learning (MARL) -- where multiple agents learn to interact in a shared dynamic environment -- permeates across a wide range of critical applications. While there has been substantial progress on understanding the…

Computer Science and Game Theory · Computer Science 2022-10-05 Shicong Cen , Yuejie Chi , Simon S. Du , Lin Xiao

Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games

Multi-agent reinforcement learning (MARL) lies at the heart of a plethora of applications involving the interaction of a group of agents in a shared unknown environment. A prominent framework for studying MARL is Markov games, with the goal…

Machine Learning · Computer Science 2025-02-17 Tong Yang , Bo Dai , Lin Xiao , Yuejie Chi

V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL

A major challenge of multiagent reinforcement learning (MARL) is the curse of multiagents, where the size of the joint action space scales exponentially with the number of agents. This remains to be a bottleneck for designing efficient MARL…

Machine Learning · Computer Science 2021-10-28 Chi Jin , Qinghua Liu , Yuanhao Wang , Tiancheng Yu

Neural Auto-Curricula

When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, at each iteration, a new agent is discovered as the best response to a mixture over the opponent…

Artificial Intelligence · Computer Science 2021-11-02 Xidong Feng , Oliver Slumbers , Ziyu Wan , Bo Liu , Stephen McAleer , Ying Wen , Jun Wang , Yaodong Yang

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

To achieve general intelligence, agents must learn how to interact with others in a shared environment: this is the challenge of multiagent reinforcement learning (MARL). The simplest form is independent reinforcement learning (InRL), where…

Artificial Intelligence · Computer Science 2017-11-08 Marc Lanctot , Vinicius Zambaldi , Audrunas Gruslys , Angeliki Lazaridou , Karl Tuyls , Julien Perolat , David Silver , Thore Graepel

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Model-based reinforcement learning (RL), which finds an optimal policy using an empirical model, has long been recognized as one of the corner stones of RL. It is especially suitable for multi-agent RL (MARL), as it naturally decouples the…

Machine Learning · Computer Science 2023-08-10 Kaiqing Zhang , Sham M. Kakade , Tamer Başar , Lin F. Yang

Policy Optimization for Markov Games: Unified Framework and Faster Convergence

This paper studies policy optimization algorithms for multi-agent reinforcement learning. We begin by proposing an algorithm framework for two-player zero-sum Markov Games in the full-information setting, where each iteration consists of a…

Machine Learning · Computer Science 2022-07-26 Runyu Zhang , Qinghua Liu , Huan Wang , Caiming Xiong , Na Li , Yu Bai

On the Complexity of Multi-Agent Decision Making: From Learning in Games to Partial Monitoring

A central problem in the theory of multi-agent reinforcement learning (MARL) is to understand what structural conditions and algorithmic principles lead to sample-efficient learning guarantees, and how these considerations change as we move…

Machine Learning · Computer Science 2023-05-02 Dylan J. Foster , Dean P. Foster , Noah Golowich , Alexander Rakhlin

Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints

We study the problem of multi-agent reinforcement learning (MARL) with adaptivity constraints -- a new problem motivated by real-world applications where deployments of new policies are costly and the number of policy updates must be…

Machine Learning · Computer Science 2024-02-05 Dan Qiao , Yu-Xiang Wang

Multi-Agent Guided Policy Search for Non-Cooperative Dynamic Games

Multi-agent reinforcement learning (MARL) optimizes strategic interactions in non-cooperative dynamic games, where agents have misaligned objectives. However, data-driven methods such as multi-agent policy gradients (MA-PG) often suffer…

Systems and Control · Electrical Eng. & Systems 2026-02-13 Jingqi Li , Gechen Qu , Jason J. Choi , Somayeh Sojoudi , Claire Tomlin

Learning in Nonzero-Sum Stochastic Games with Potentials

Multi-agent reinforcement learning (MARL) has become effective in tackling discrete cooperative game scenarios. However, MARL has yet to penetrate settings beyond those modelled by team and zero-sum games, confining it to a small subset of…

Multiagent Systems · Computer Science 2021-06-16 David Mguni , Yutong Wu , Yali Du , Yaodong Yang , Ziyi Wang , Minne Li , Ying Wen , Joel Jennings , Jun Wang

Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques

We initiate the study of Preference-Based Multi-Agent Reinforcement Learning (PbMARL), exploring both theoretical foundations and empirical validations. We define the task as identifying the Nash equilibrium from a preference-only offline…

Machine Learning · Computer Science 2025-01-10 Natalia Zhang , Xinqi Wang , Qiwen Cui , Runlong Zhou , Sham M. Kakade , Simon S. Du

Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning

The thriving field of multi-agent reinforcement learning (MARL) studies how a group of interacting agents make decisions autonomously in a shared dynamic environment. Existing theoretical studies in this area suffer from at least two of the…

Machine Learning · Computer Science 2025-12-02 Na Li , Yuchen Jiao , Hangguan Shan , Shefeng Yan

Multi-agent Reinforcement Learning with Sparse Interactions by Negotiation and Knowledge Transfer

Reinforcement learning has significant applications for multi-agent systems, especially in unknown dynamic environments. However, most multi-agent reinforcement learning (MARL) algorithms suffer from such problems as exponential computation…

Multiagent Systems · Computer Science 2016-04-01 Luowei Zhou , Pei Yang , Chunlin Chen , Yang Gao

Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium

Multi-agent reinforcement learning (MARL) has achieved notable success in cooperative tasks, demonstrating impressive performance and scalability. However, deploying MARL agents in real-world applications presents critical safety…

Machine Learning · Computer Science 2024-11-25 Zeyang Li , Navid Azizan

Efficient Competitive Self-Play Policy Optimization

Reinforcement learning from self-play has recently reported many successes. Self-play, where the agents compete with themselves, is often used to generate training data for iterative policy improvement. In previous work, heuristic rules are…

Machine Learning · Computer Science 2020-09-15 Yuanyi Zhong , Yuan Zhou , Jian Peng