Related papers: Fixing Incomplete Value Function Decomposition for…

MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Value function decomposition is becoming a popular rule of thumb for scaling up multi-agent reinforcement learning (MARL) in cooperative games. For such a decomposition rule to hold, the assumption of the individual-global max (IGM)…

Machine Learning · Computer Science 2022-02-17 Zehao Dou , Jakub Grudzien Kuba , Yaodong Yang

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Machine Learning · Computer Science 2018-06-07 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning

Value decomposition methods have gained popularity in the field of cooperative multi-agent reinforcement learning. However, almost all existing methods follow the principle of Individual Global Max (IGM) or its variants, which limits their…

Multiagent Systems · Computer Science 2023-05-18 Zhiwei Xu , Bin Zhang , Dapeng Li , Guangchong Zhou , Zeren Zhang , Guoliang Fan

NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning

Multi-agent value-based approaches recently make great progress, especially value decomposition methods. However, there are still a lot of limitations in value function factorization. In VDN, the joint action-value function is the sum of…

Artificial Intelligence · Computer Science 2021-07-14 Quanlin Chen

QPLEX: Duplex Dueling Multi-Agent Q-Learning

We explore value-based multi-agent reinforcement learning (MARL) in the popular paradigm of centralized training with decentralized execution (CTDE). CTDE has an important concept, Individual-Global-Max (IGM) principle, which requires the…

Machine Learning · Computer Science 2021-10-05 Jianhao Wang , Zhizhou Ren , Terry Liu , Yang Yu , Chongjie Zhang

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is…

Machine Learning · Computer Science 2020-09-22 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning

Multi-Agent Reinforcement Learning (MARL) is useful in many problems that require the cooperation and coordination of multiple agents. Learning optimal policies using reinforcement learning in a multi-agent setting can be very difficult as…

Machine Learning · Computer Science 2022-05-31 Rafael Pina , Varuna De Silva , Joosep Hook , Ahmet Kondoz

Beyond Monotonicity: Revisiting Factorization Principles in Multi-Agent Q-Learning

Value decomposition is a central approach in multi-agent reinforcement learning (MARL), enabling centralized training with decentralized execution by factorizing the global value function into local values. To ensure individual-global-max…

Machine Learning · Computer Science 2026-03-23 Tianmeng Hu , Yongzheng Cui , Rui Tang , Biao Luo , Ke Li

POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning

Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention. Many QMIX-based methods introduce monotonicity constraints between the joint action value…

Machine Learning · Computer Science 2025-04-11 Chang Huang , Shatong Zhu , Junqiao Zhao , Hongtu Zhou , Chen Ye , Tiantian Feng , Changjun Jiang

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QMIX is a popular $Q$-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action $Q$-values it can represent to be a…

Machine Learning · Computer Science 2020-10-23 Tabish Rashid , Gregory Farquhar , Bei Peng , Shimon Whiteson

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the…

Machine Learning · Computer Science 2021-11-02 Jianhao Wang , Zhizhou Ren , Beining Han , Jianing Ye , Chongjie Zhang

ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum…

Multiagent Systems · Computer Science 2023-12-27 Huiqun Li , Hanhan Zhou , Yifei Zou , Dongxiao Yu , Tian Lan

PairVDN - Pair-wise Decomposed Value Functions

Extending deep Q-learning to cooperative multi-agent settings is challenging due to the exponential growth of the joint action space, the non-stationary environment, and the credit assignment problem. Value decomposition allows deep…

Artificial Intelligence · Computer Science 2025-03-13 Zak Buzzard

QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition

In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of…

Multiagent Systems · Computer Science 2024-08-15 Songchen Fu , Shaojing Zhao , Ta Li , YongHong Yan

QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning

Centralized training is widely utilized in the field of multi-agent reinforcement learning (MARL) to assure the stability of training process. Once a joint policy is obtained, it is critical to design a value function factorization method…

Artificial Intelligence · Computer Science 2023-11-02 Rizhong Wang , Huiping Li , Di Cui , Demin Xu

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Designing reinforcement learning (RL) agents is typically a difficult process that requires numerous design iterations. Learning can fail for a multitude of reasons, and standard RL methods provide too few tools to provide insight into the…

Machine Learning · Computer Science 2022-10-24 James MacGlashan , Evan Archer , Alisa Devlic , Takuma Seno , Craig Sherstan , Peter R. Wurman , Peter Stone

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior…

Multiagent Systems · Computer Science 2020-06-11 Yaodong Yang , Jianye Hao , Ben Liao , Kun Shao , Guangyong Chen , Wulong Liu , Hongyao Tang

Value-Decomposition Multi-Agent Actor-Critics

The exploitation of extra state information has been an active research area in multi-agent reinforcement learning (MARL). QMIX represents the joint action-value using a non-negative function approximator and achieves the best performance,…

Artificial Intelligence · Computer Science 2020-12-21 Jianyu Su , Stephen Adams , Peter A. Beling

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

Value function factorization methods have become a dominant approach for cooperative multiagent reinforcement learning under a centralized training and decentralized execution paradigm. By factorizing the optimal joint action-value function…

Machine Learning · Computer Science 2023-02-14 Yongsheng Mei , Hanhan Zhou , Tian Lan