Related papers: QMIX: Monotonic Value Function Factorisation for D…

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is…

Machine Learning · Computer Science 2020-09-22 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QMIX is a popular $Q$-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action $Q$-values it can represent to be a…

Machine Learning · Computer Science 2020-10-23 Tabish Rashid , Gregory Farquhar , Bei Peng , Shimon Whiteson

NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning

Multi-agent value-based approaches recently make great progress, especially value decomposition methods. However, there are still a lot of limitations in value function factorization. In VDN, the joint action-value function is the sum of…

Artificial Intelligence · Computer Science 2021-07-14 Quanlin Chen

POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning

Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention. Many QMIX-based methods introduce monotonicity constraints between the joint action value…

Machine Learning · Computer Science 2025-04-11 Chang Huang , Shatong Zhu , Junqiao Zhao , Hongtu Zhou , Chen Ye , Tiantian Feng , Changjun Jiang

CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making

Robust coordination skills enable agents to operate cohesively in shared environments, together towards a common goal and, ideally, individually without hindering each other's progress. To this end, this paper presents Coordinated QMIX…

Machine Learning · Computer Science 2024-12-25 Giovanni Minelli , Mirco Musolesi

MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

Value function factorization via centralized training and decentralized execution is promising for solving cooperative multi-agent reinforcement tasks. One of the approaches in this area, QMIX, has become state-of-the-art and achieved the…

Multiagent Systems · Computer Science 2023-07-27 Hanhan Zhou , Tian Lan , Vaneet Aggarwal

MAVEN: Multi-Agent Variational Exploration

Centralised training with decentralised execution is an important setting for cooperative deep multi-agent reinforcement learning due to communication constraints during execution and computational tractability in training. In this paper,…

Machine Learning · Computer Science 2020-01-22 Anuj Mahajan , Tabish Rashid , Mikayel Samvelyan , Shimon Whiteson

ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum…

Multiagent Systems · Computer Science 2023-12-27 Huiqun Li , Hanhan Zhou , Yifei Zou , Dongxiao Yu , Tian Lan

SMIX($\lambda$): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning

Learning a stable and generalizable centralized value function (CVF) is a crucial but challenging task in multi-agent reinforcement learning (MARL), as it has to deal with the issue that the joint action space increases exponentially with…

Multiagent Systems · Computer Science 2020-08-11 Xinghu Yao , Chao Wen , Yuhui Wang , Xiaoyang Tan

QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation…

Machine Learning · Computer Science 2021-02-24 Jian Hu , Seth Austin Harding , Haibin Wu , Siyue Hu , Shih-wei Liao

RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents

Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE). However, such expected, i.e., risk-neutral, Q value…

Machine Learning · Computer Science 2021-03-23 Wei Qiu , Xinrun Wang , Runsheng Yu , Xu He , Rundong Wang , Bo An , Svetlana Obraztsova , Zinovi Rabinovich

QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition

In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of…

Multiagent Systems · Computer Science 2024-08-15 Songchen Fu , Shaojing Zhao , Ta Li , YongHong Yan

Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning

Value function decomposition methods for cooperative multi-agent reinforcement learning compose joint values from individual per-agent utilities, and train them using a joint objective. To ensure that the action selection process between…

Machine Learning · Computer Science 2025-05-16 Andrea Baisero , Rupali Bhati , Shuo Liu , Aathira Pillai , Christopher Amato

AgentMixer: Multi-Agent Correlated Policy Factorization

In multi-agent reinforcement learning, centralized training with decentralized execution (CTDE) methods typically assume that agents make decisions based on their local observations independently, which may not lead to a correlated joint…

Multiagent Systems · Computer Science 2024-12-16 Zhiyuan Li , Wenshuai Zhao , Lijun Wu , Joni Pajarinen

FACMAC: Factored Multi-Agent Centralised Policy Gradients

We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method,…

Machine Learning · Computer Science 2021-05-10 Bei Peng , Tabish Rashid , Christian A. Schroeder de Witt , Pierre-Alexandre Kamienny , Philip H. S. Torr , Wendelin Böhmer , Shimon Whiteson

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the…

Machine Learning · Computer Science 2021-11-02 Jianhao Wang , Zhizhou Ren , Beining Han , Jianing Ye , Chongjie Zhang

Value-Decomposition Multi-Agent Actor-Critics

The exploitation of extra state information has been an active research area in multi-agent reinforcement learning (MARL). QMIX represents the joint action-value using a non-negative function approximator and achieves the best performance,…

Artificial Intelligence · Computer Science 2020-12-21 Jianyu Su , Stephen Adams , Peter A. Beling

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

Value function factorization methods have become a dominant approach for cooperative multiagent reinforcement learning under a centralized training and decentralized execution paradigm. By factorizing the optimal joint action-value function…

Machine Learning · Computer Science 2023-02-14 Yongsheng Mei , Hanhan Zhou , Tian Lan

Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains

Tackling multi-agent learning problems efficiently is a challenging task in continuous action domains. While value-based algorithms excel in sample efficiency when applied to discrete action domains, they are usually inefficient when…

Multiagent Systems · Computer Science 2024-02-13 Yasin Findik , S. Reza Ahmadzadeh