Related papers: QPLEX: Duplex Dueling Multi-Agent Q-Learning

Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning

Multi-Agent Reinforcement Learning (MARL) is useful in many problems that require the cooperation and coordination of multiple agents. Learning optimal policies using reinforcement learning in a multi-agent setting can be very difficult as…

Machine Learning · Computer Science 2022-05-31 Rafael Pina , Varuna De Silva , Joosep Hook , Ahmet Kondoz

Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value Factorization

Cooperative multi-agent reinforcement learning (MARL) commonly adopts centralized training with decentralized execution, where value-factorization methods enforce the individual-global-maximum (IGM) principle so that decentralized greedy…

Artificial Intelligence · Computer Science 2026-02-13 Chengrui Qu , Christopher Yeh , Kishan Panaganti , Eric Mazumdar , Adam Wierman

QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning

Centralized training is widely utilized in the field of multi-agent reinforcement learning (MARL) to assure the stability of training process. Once a joint policy is obtained, it is critical to design a value function factorization method…

Artificial Intelligence · Computer Science 2023-11-02 Rizhong Wang , Huiping Li , Di Cui , Demin Xu

QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement Learning

This paper introduces four new algorithms that can be used for tackling multi-agent reinforcement learning (MARL) problems occurring in cooperative settings. All algorithms are based on the Deep Quality-Value (DQV) family of algorithms, a…

Machine Learning · Computer Science 2020-12-23 Pascal Leroy , Damien Ernst , Pierre Geurts , Gilles Louppe , Jonathan Pisane , Matthia Sabatelli

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

Many advances in cooperative multi-agent reinforcement learning (MARL) are based on two common design principles: value decomposition and parameter sharing. A typical MARL algorithm of this fashion decomposes a centralized Q-function into…

Artificial Intelligence · Computer Science 2022-08-09 Wei Fu , Chao Yu , Zelai Xu , Jiaqi Yang , Yi Wu

ME-IGM: Individual-Global-Max in Maximum Entropy Multi-Agent Reinforcement Learning

Multi-agent credit assignment is a fundamental challenge for cooperative multi-agent reinforcement learning (MARL), where a team of agents learn from shared reward signals. The Individual-Global-Max (IGM) condition is a widely used…

Machine Learning · Computer Science 2026-02-04 Wen-Tse Chen , Yuxuan Li , Shiyu Huang , Jiayu Chen , Jeff Schneider

Beyond Monotonicity: Revisiting Factorization Principles in Multi-Agent Q-Learning

Value decomposition is a central approach in multi-agent reinforcement learning (MARL), enabling centralized training with decentralized execution by factorizing the global value function into local values. To ensure individual-global-max…

Machine Learning · Computer Science 2026-03-23 Tianmeng Hu , Yongzheng Cui , Rui Tang , Biao Luo , Ke Li

Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning

Value function decomposition methods for cooperative multi-agent reinforcement learning compose joint values from individual per-agent utilities, and train them using a joint objective. To ensure that the action selection process between…

Machine Learning · Computer Science 2025-05-16 Andrea Baisero , Rupali Bhati , Shuo Liu , Aathira Pillai , Christopher Amato

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Large Language Models (LLMs) have shown remarkable performance in completing various tasks. However, solving complex problems often requires the coordination of multiple agents, raising a fundamental question: how to effectively select and…

Computation and Language · Computer Science 2026-04-02 Eric Hanchen Jiang , Levina Li , Rui Sun , Xiao Liang , Yubei Li , Yuchen Wu , Haozheng Luo , Hengli Li , Zhi Zhang , Zhaolu Kang , Kai-Wei Chang , Ying Nian Wu

Structured Cooperative Multi-Agent Reinforcement Learning: a Bayesian Network Perspective

The empirical success of multi-agent reinforcement learning (MARL) has motivated the search for more efficient and scalable algorithms for large scale multi-agent systems. However, existing state-of-the-art algorithms do not fully exploit…

Multiagent Systems · Computer Science 2025-10-14 Shahbaz P Qadri Syed , He Bai

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Value function decomposition is becoming a popular rule of thumb for scaling up multi-agent reinforcement learning (MARL) in cooperative games. For such a decomposition rule to hold, the assumption of the individual-global max (IGM)…

Machine Learning · Computer Science 2022-02-17 Zehao Dou , Jakub Grudzien Kuba , Yaodong Yang

Quantum Multi-Agent Reinforcement Learning via Variational Quantum Circuit Design

In recent years, quantum computing (QC) has been getting a lot of attention from industry and academia. Especially, among various QC research topics, variational quantum circuit (VQC) enables quantum deep reinforcement learning (QRL). Many…

Quantum Physics · Physics 2022-04-12 Won Joon Yun , Yunseok Kwak , Jae Pyoung Kim , Hyunhee Cho , Soyi Jung , Jihong Park , Joongheon Kim

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Recently, deep multiagent reinforcement learning (MARL) has become a highly active research area as many real-world problems can be inherently viewed as multiagent systems. A particularly interesting and widely applicable class of problems…

Multiagent Systems · Computer Science 2020-02-11 Yaodong Yang , Jianye Hao , Guangyong Chen , Hongyao Tang , Yingfeng Chen , Yujing Hu , Changjie Fan , Zhongyu Wei

QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation…

Machine Learning · Computer Science 2021-02-24 Jian Hu , Seth Austin Harding , Haibin Wu , Siyue Hu , Shih-wei Liao

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

Existing value-factorized based Multi-Agent deep Reinforce-ment Learning (MARL) approaches are well-performing invarious multi-agent cooperative environment under thecen-tralized training and decentralized execution(CTDE) scheme,where all…

Artificial Intelligence · Computer Science 2019-11-19 Runsheng Yu , Zhenyu Shi , Xinrun Wang , Rundong Wang , Buhong Liu , Xinwen Hou , Hanjiang Lai , Bo An

Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions

Achieving distributed reinforcement learning (RL) for large-scale cooperative multi-agent systems (MASs) is challenging because: (i) each agent has access to only limited information; (ii) issues on convergence or computational complexity…

Machine Learning · Computer Science 2024-04-15 Gangshan Jing , He Bai , Jemin George , Aranya Chakrabortty , Piyush K. Sharma

MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning

Decentralized learning has shown great promise for cooperative multi-agent reinforcement learning (MARL). However, non-stationarity remains a significant challenge in fully decentralized learning. In the paper, we tackle the…

Machine Learning · Computer Science 2023-02-08 Kefan Su , Siyuan Zhou , Jiechuan Jiang , Chuang Gan , Xiangjun Wang , Zongqing Lu

Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network Approach

One of the challenges for multi-agent reinforcement learning (MARL) is designing efficient learning algorithms for a large system in which each agent has only limited or partial information of the entire system. While exciting progress has…

Machine Learning · Computer Science 2022-02-22 Haotian Gu , Xin Guo , Xiaoli Wei , Renyuan Xu

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior…

Multiagent Systems · Computer Science 2020-06-11 Yaodong Yang , Jianye Hao , Ben Liao , Kun Shao , Guangyong Chen , Wulong Liu , Hongyao Tang

QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning

Value decomposition (VD) methods have achieved remarkable success in cooperative multi-agent reinforcement learning (MARL). However, their reliance on the max operator for temporal-difference (TD) target calculation leads to systematic…

Multiagent Systems · Computer Science 2026-02-27 Yuanjun Li , Bin Zhang , Hao Chen , Zhouyang Jiang , Dapeng Li , Zhiwei Xu