Related papers: Value-Decomposition Multi-Agent Actor-Critics

NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning

Multi-agent value-based approaches recently make great progress, especially value decomposition methods. However, there are still a lot of limitations in value function factorization. In VDN, the joint action-value function is the sum of…

Artificial Intelligence · Computer Science 2021-07-14 Quanlin Chen

Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning

Value-based methods of multi-agent reinforcement learning (MARL), especially the value decomposition methods, have been demonstrated on a range of challenging cooperative tasks. However, current methods pay little attention to the…

Machine Learning · Computer Science 2021-02-12 Xiaoteng Ma , Yiqin Yang , Chenghao Li , Yiwen Lu , Qianchuan Zhao , Yang Jun

Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning

Deep reinforcement learning methods have shown great performance on many challenging cooperative multi-agent tasks. Two main promising research directions are multi-agent value function decomposition and multi-agent policy gradients. In…

Artificial Intelligence · Computer Science 2021-05-11 Yuan Pu , Shaochen Wang , Rui Yang , Xin Yao , Bin Li

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the…

Machine Learning · Computer Science 2021-11-02 Jianhao Wang , Zhizhou Ren , Beining Han , Jianing Ye , Chongjie Zhang

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Machine Learning · Computer Science 2018-06-07 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

Value function factorization via centralized training and decentralized execution is promising for solving cooperative multi-agent reinforcement tasks. One of the approaches in this area, QMIX, has become state-of-the-art and achieved the…

Multiagent Systems · Computer Science 2023-07-27 Hanhan Zhou , Tian Lan , Vaneet Aggarwal

MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is…

Machine Learning · Computer Science 2020-09-22 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft

The StarCraft II Multi-Agent Challenge (SMAC) was created to be a challenging benchmark problem for cooperative multi-agent reinforcement learning (MARL). SMAC focuses exclusively on the problem of StarCraft micromanagement and assumes that…

Multiagent Systems · Computer Science 2022-08-16 Muhammad Junaid Khan , Syed Hammad Ahmed , Gita Sukthankar

Distributional Advantage Actor-Critic

In traditional reinforcement learning, an agent maximizes the reward collected during its interaction with the environment by approximating the optimal policy through the estimation of value functions. Typically, given a state s and action…

Machine Learning · Computer Science 2018-06-20 Shangda Li , Selina Bing , Steven Yang

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Designing reinforcement learning (RL) agents is typically a difficult process that requires numerous design iterations. Learning can fail for a multitude of reasons, and standard RL methods provide too few tools to provide insight into the…

Machine Learning · Computer Science 2022-10-24 James MacGlashan , Evan Archer , Alisa Devlic , Takuma Seno , Craig Sherstan , Peter R. Wurman , Peter Stone

Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning

Value function decomposition methods for cooperative multi-agent reinforcement learning compose joint values from individual per-agent utilities, and train them using a joint objective. To ensure that the action selection process between…

Machine Learning · Computer Science 2025-05-16 Andrea Baisero , Rupali Bhati , Shuo Liu , Aathira Pillai , Christopher Amato

Off-Policy Multi-Agent Decomposed Policy Gradients

Multi-agent policy gradient (MAPG) methods recently witness vigorous progress. However, there is a significant performance discrepancy between MAPG methods and state-of-the-art multi-agent value-based approaches. In this paper, we…

Machine Learning · Computer Science 2020-10-06 Yihan Wang , Beining Han , Tonghan Wang , Heng Dong , Chongjie Zhang

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity. Hence, several…

Machine Learning · Computer Science 2023-07-10 Wenhao Li , Bo Jin , Xiangfeng Wang , Junchi Yan , Hongyuan Zha

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics

A key aspect of intelligence is the ability to demonstrate a broad spectrum of behaviors for adapting to unexpected situations. Over the past decade, advancements in deep reinforcement learning have led to groundbreaking achievements to…

Machine Learning · Computer Science 2024-06-04 Luca Grillotti , Maxence Faldor , Borja G. León , Antoine Cully

VDFD: Multi-Agent Value Decomposition Framework with Disentangled World Model

In this paper, we propose a novel model-based multi-agent reinforcement learning approach named Value Decomposition Framework with Disentangled World Model to address the challenge of achieving a common goal of multiple agents interacting…

Machine Learning · Computer Science 2025-09-29 Zhizun Wang , David Meger

Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning

We propose a novel framework for value function factorization in multi-agent deep reinforcement learning (MARL) using graph neural networks (GNNs). In particular, we consider the team of agents as the set of nodes of a complete directed…

Machine Learning · Computer Science 2021-02-11 Navid Naderializadeh , Fan H. Hung , Sean Soleyman , Deepak Khosla

FACMAC: Factored Multi-Agent Centralised Policy Gradients

We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces. Like MADDPG, a popular multi-agent actor-critic method,…

Machine Learning · Computer Science 2021-05-10 Bei Peng , Tabish Rashid , Christian A. Schroeder de Witt , Pierre-Alexandre Kamienny , Philip H. S. Torr , Wendelin Böhmer , Shimon Whiteson

Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method

Reinforcement learning has gathered much attention in recent years due to its rapid development and rich applications, especially on control systems and robotics. When tackling real-world applications with reinforcement learning method, the…

Machine Learning · Computer Science 2025-10-02 Andy Wu , Chun-Cheng Lin , Rung-Tzuo Liaw , Yuehua Huang , Chihjung Kuo , Chia Tong Weng

Policy Gradient With Value Function Approximation For Collective Multiagent Planning

Decentralized (PO)MDPs provide an expressive framework for sequential decision making in a multiagent system. Given their computational complexity, recent research has focused on tractable yet practical subclasses of Dec-POMDPs. We address…

Artificial Intelligence · Computer Science 2018-04-11 Duc Thien Nguyen , Akshat Kumar , Hoong Chuin Lau