English
Related papers

Related papers: Monotonic Value Function Factorisation for Deep Mu…

200 papers

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Multi-agent value-based approaches recently make great progress, especially value decomposition methods. However, there are still a lot of limitations in value function factorization. In VDN, the joint action-value function is the sum of…

Artificial Intelligence · Computer Science 2021-07-14 Quanlin Chen

QMIX is a popular $Q$-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action $Q$-values it can represent to be a…

Machine Learning · Computer Science 2020-10-23 Tabish Rashid , Gregory Farquhar , Bei Peng , Shimon Whiteson

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Centralised training with decentralised execution is an important setting for cooperative deep multi-agent reinforcement learning due to communication constraints during execution and computational tractability in training. In this paper,…

Machine Learning · Computer Science 2020-01-22 Anuj Mahajan , Tabish Rashid , Mikayel Samvelyan , Shimon Whiteson

Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention. Many QMIX-based methods introduce monotonicity constraints between the joint action value…

Machine Learning · Computer Science 2025-04-11 Chang Huang , Shatong Zhu , Junqiao Zhao , Hongtu Zhou , Chen Ye , Tiantian Feng , Changjun Jiang

Robust coordination skills enable agents to operate cohesively in shared environments, together towards a common goal and, ideally, individually without hindering each other's progress. To this end, this paper presents Coordinated QMIX…

Machine Learning · Computer Science 2024-12-25 Giovanni Minelli , Mirco Musolesi

Value function factorization via centralized training and decentralized execution is promising for solving cooperative multi-agent reinforcement tasks. One of the approaches in this area, QMIX, has become state-of-the-art and achieved the…

Multiagent Systems · Computer Science 2023-07-27 Hanhan Zhou , Tian Lan , Vaneet Aggarwal

Learning a stable and generalizable centralized value function (CVF) is a crucial but challenging task in multi-agent reinforcement learning (MARL), as it has to deal with the issue that the joint action space increases exponentially with…

Multiagent Systems · Computer Science 2020-08-11 Xinghu Yao , Chao Wen , Yuhui Wang , Xiaoyang Tan

Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum…

Multiagent Systems · Computer Science 2023-12-27 Huiqun Li , Hanhan Zhou , Yifei Zou , Dongxiao Yu , Tian Lan

In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation…

Machine Learning · Computer Science 2021-02-24 Jian Hu , Seth Austin Harding , Haibin Wu , Siyue Hu , Shih-wei Liao

We explore value decomposition solutions for multi-agent deep reinforcement learning in the popular paradigm of centralized training with decentralized execution(CTDE). As the recognized best solution to CTDE, Weighted QMIX is cutting-edge…

Multiagent Systems · Computer Science 2022-08-09 Kai Liu , Tianxian Zhang , Lingjiang Kong

We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method,…

Many complex multi-agent systems such as robot swarms control and autonomous vehicle coordination can be modeled as Multi-Agent Reinforcement Learning (MARL) tasks. QMIX, a widely popular MARL algorithm, has been used as a baseline for the…

Machine Learning · Computer Science 2023-06-09 Jian Hu , Siyang Jiang , Seth Austin Harding , Haibin Wu , Shih-wei Liao

Value function factorization methods have become a dominant approach for cooperative multiagent reinforcement learning under a centralized training and decentralized execution paradigm. By factorizing the optimal joint action-value function…

Machine Learning · Computer Science 2023-02-14 Yongsheng Mei , Hanhan Zhou , Tian Lan

In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of…

Multiagent Systems · Computer Science 2024-08-15 Songchen Fu , Shaojing Zhao , Ta Li , YongHong Yan

We propose a novel framework for value function factorization in multi-agent deep reinforcement learning (MARL) using graph neural networks (GNNs). In particular, we consider the team of agents as the set of nodes of a complete directed…

Machine Learning · Computer Science 2021-02-11 Navid Naderializadeh , Fan H. Hung , Sean Soleyman , Deepak Khosla

Recent years have witnessed the great success of multi-agent systems (MAS). Value decomposition, which decomposes joint action values into individual action values, has been an important work in MAS. However, many value decomposition…

Artificial Intelligence · Computer Science 2022-04-29 Yunpeng Bai , Chen Gong , Bin Zhang , Guoliang Fan , Xinwen Hou , Yu Liu

Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE). However, such expected, i.e., risk-neutral, Q value…

Machine Learning · Computer Science 2021-03-23 Wei Qiu , Xinrun Wang , Runsheng Yu , Xu He , Rundong Wang , Bo An , Svetlana Obraztsova , Zinovi Rabinovich

Deep reinforcement learning methods have shown great performance on many challenging cooperative multi-agent tasks. Two main promising research directions are multi-agent value function decomposition and multi-agent policy gradients. In…

Artificial Intelligence · Computer Science 2021-05-11 Yuan Pu , Shaochen Wang , Rui Yang , Xin Yao , Bin Li
‹ Prev 1 2 3 10 Next ›