English
Related papers

Related papers: Fixing Incomplete Value Function Decomposition for…

200 papers

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Value function decomposition is becoming a popular rule of thumb for scaling up multi-agent reinforcement learning (MARL) in cooperative games. For such a decomposition rule to hold, the assumption of the individual-global max (IGM)…

Machine Learning · Computer Science 2022-02-17 Zehao Dou , Jakub Grudzien Kuba , Yaodong Yang

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Value decomposition methods have gained popularity in the field of cooperative multi-agent reinforcement learning. However, almost all existing methods follow the principle of Individual Global Max (IGM) or its variants, which limits their…

Multiagent Systems · Computer Science 2023-05-18 Zhiwei Xu , Bin Zhang , Dapeng Li , Guangchong Zhou , Zeren Zhang , Guoliang Fan

Multi-agent value-based approaches recently make great progress, especially value decomposition methods. However, there are still a lot of limitations in value function factorization. In VDN, the joint action-value function is the sum of…

Artificial Intelligence · Computer Science 2021-07-14 Quanlin Chen

We explore value-based multi-agent reinforcement learning (MARL) in the popular paradigm of centralized training with decentralized execution (CTDE). CTDE has an important concept, Individual-Global-Max (IGM) principle, which requires the…

Machine Learning · Computer Science 2021-10-05 Jianhao Wang , Zhizhou Ren , Terry Liu , Yang Yu , Chongjie Zhang

In many real-world settings, a team of agents must coordinate its behaviour while acting in a decentralised fashion. At the same time, it is often possible to train the agents in a centralised fashion where global state information is…

Multi-Agent Reinforcement Learning (MARL) is useful in many problems that require the cooperation and coordination of multiple agents. Learning optimal policies using reinforcement learning in a multi-agent setting can be very difficult as…

Machine Learning · Computer Science 2022-05-31 Rafael Pina , Varuna De Silva , Joosep Hook , Ahmet Kondoz

Value decomposition is a central approach in multi-agent reinforcement learning (MARL), enabling centralized training with decentralized execution by factorizing the global value function into local values. To ensure individual-global-max…

Machine Learning · Computer Science 2026-03-23 Tianmeng Hu , Yongzheng Cui , Rui Tang , Biao Luo , Ke Li

Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention. Many QMIX-based methods introduce monotonicity constraints between the joint action value…

Machine Learning · Computer Science 2025-04-11 Chang Huang , Shatong Zhu , Junqiao Zhao , Hongtu Zhou , Chen Ye , Tiantian Feng , Changjun Jiang

QMIX is a popular $Q$-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action $Q$-values it can represent to be a…

Machine Learning · Computer Science 2020-10-23 Tabish Rashid , Gregory Farquhar , Bei Peng , Shimon Whiteson

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the…

Machine Learning · Computer Science 2021-11-02 Jianhao Wang , Zhizhou Ren , Beining Han , Jianing Ye , Chongjie Zhang

Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum…

Multiagent Systems · Computer Science 2023-12-27 Huiqun Li , Hanhan Zhou , Yifei Zou , Dongxiao Yu , Tian Lan

Extending deep Q-learning to cooperative multi-agent settings is challenging due to the exponential growth of the joint action space, the non-stationary environment, and the credit assignment problem. Value decomposition allows deep…

Artificial Intelligence · Computer Science 2025-03-13 Zak Buzzard

In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of…

Multiagent Systems · Computer Science 2024-08-15 Songchen Fu , Shaojing Zhao , Ta Li , YongHong Yan

Centralized training is widely utilized in the field of multi-agent reinforcement learning (MARL) to assure the stability of training process. Once a joint policy is obtained, it is critical to design a value function factorization method…

Artificial Intelligence · Computer Science 2023-11-02 Rizhong Wang , Huiping Li , Di Cui , Demin Xu

Designing reinforcement learning (RL) agents is typically a difficult process that requires numerous design iterations. Learning can fail for a multitude of reasons, and standard RL methods provide too few tools to provide insight into the…

Machine Learning · Computer Science 2022-10-24 James MacGlashan , Evan Archer , Alisa Devlic , Takuma Seno , Craig Sherstan , Peter R. Wurman , Peter Stone

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior…

Multiagent Systems · Computer Science 2020-06-11 Yaodong Yang , Jianye Hao , Ben Liao , Kun Shao , Guangyong Chen , Wulong Liu , Hongyao Tang

The exploitation of extra state information has been an active research area in multi-agent reinforcement learning (MARL). QMIX represents the joint action-value using a non-negative function approximator and achieves the best performance,…

Artificial Intelligence · Computer Science 2020-12-21 Jianyu Su , Stephen Adams , Peter A. Beling

Value function factorization methods have become a dominant approach for cooperative multiagent reinforcement learning under a centralized training and decentralized execution paradigm. By factorizing the optimal joint action-value function…

Machine Learning · Computer Science 2023-02-14 Yongsheng Mei , Hanhan Zhou , Tian Lan
‹ Prev 1 2 3 10 Next ›