Related papers: Multi-Agent Determinantal Q-Learning

Factorized Q-Learning for Large-Scale Multi-Agent Systems

Deep Q-learning has achieved significant success in single-agent decision making tasks. However, it is challenging to extend Q-learning to large-scale multi-agent scenarios, due to the explosion of action space resulting from the complex…

Multiagent Systems · Computer Science 2019-10-14 Ming Zhou , Yong Chen , Ying Wen , Yaodong Yang , Yufeng Su , Weinan Zhang , Dell Zhang , Jun Wang

$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior…

Multiagent Systems · Computer Science 2020-06-11 Yaodong Yang , Jianye Hao , Ben Liao , Kun Shao , Guangyong Chen , Wulong Liu , Hongyao Tang

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Recently, deep multiagent reinforcement learning (MARL) has become a highly active research area as many real-world problems can be inherently viewed as multiagent systems. A particularly interesting and widely applicable class of problems…

Multiagent Systems · Computer Science 2020-02-11 Yaodong Yang , Jianye Hao , Guangyong Chen , Hongyao Tang , Yingfeng Chen , Yujing Hu , Changjie Fan , Zhongyu Wei

Modular Multi-Objective Deep Reinforcement Learning with Decision Values

In this work we present a method for using Deep Q-Networks (DQNs) in multi-objective environments. Deep Q-Networks provide remarkable performance in single objective problems learning from high-level visual state representations. However,…

Artificial Intelligence · Computer Science 2018-02-26 Tomasz Tajmajer

Decentralized Multi-Agents by Imitation of a Centralized Controller

We consider a multi-agent reinforcement learning problem where each agent seeks to maximize a shared reward while interacting with other agents, and they may or may not be able to communicate. Typically the agents do not have access to…

Multiagent Systems · Computer Science 2021-04-26 Alex Tong Lin , Mark J. Debord , Katia Estabridis , Gary Hewer , Guido Montufar , Stanley Osher

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the…

Machine Learning · Computer Science 2021-11-02 Jianhao Wang , Zhizhou Ren , Beining Han , Jianing Ye , Chongjie Zhang

Distributed Value Decomposition Networks with Networked Agents

We investigate the problem of distributed training under partial observability, whereby cooperative multi-agent reinforcement learning agents (MARL) maximize the expected cumulative joint reward. We propose distributed value decomposition…

Machine Learning · Computer Science 2025-02-12 Guilherme S. Varela , Alberto Sardinha , Francisco S. Melo

Deep Constrained Q-learning

In many real world applications, reinforcement learning agents have to optimize multiple objectives while following certain rules or satisfying a list of constraints. Classical methods based on reward shaping, i.e. a weighted combination of…

Machine Learning · Computer Science 2020-09-15 Gabriel Kalweit , Maria Huegle , Moritz Werling , Joschka Boedecker

Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains

Tackling multi-agent learning problems efficiently is a challenging task in continuous action domains. While value-based algorithms excel in sample efficiency when applied to discrete action domains, they are usually inefficient when…

Multiagent Systems · Computer Science 2024-02-13 Yasin Findik , S. Reza Ahmadzadeh

Macro-Action-Based Deep Multi-Agent Reinforcement Learning

In real-world multi-robot systems, performing high-quality, collaborative behaviors requires robots to asynchronously reason about high-level action selection at varying time durations. Macro-Action Decentralized Partially Observable Markov…

Machine Learning · Computer Science 2021-10-19 Yuchen Xiao , Joshua Hoffman , Christopher Amato

Learning Multi-Robot Decentralized Macro-Action-Based Policies via a Centralized Q-Net

In many real-world multi-robot tasks, high-quality solutions often require a team of robots to perform asynchronous actions under decentralized control. Decentralized multi-agent reinforcement learning methods have difficulty learning…

Robotics · Computer Science 2020-03-05 Yuchen Xiao , Joshua Hoffman , Tian Xia , Christopher Amato

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Machine Learning · Computer Science 2018-06-07 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Offline Decentralized Multi-Agent Reinforcement Learning

In many real-world multi-agent cooperative tasks, due to high cost and risk, agents cannot continuously interact with the environment and collect experiences during learning, but have to learn from offline datasets. However, the transition…

Machine Learning · Computer Science 2023-08-01 Jiechuan Jiang , Zongqing Lu

MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning

In the real world, many tasks require multiple agents to cooperate with each other under the condition of local observations. To solve such problems, many multi-agent reinforcement learning methods based on Centralized Training with…

Multiagent Systems · Computer Science 2021-06-23 Zhiwei Xu , Dapeng Li , Yunpeng Bai , Guoliang Fan

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Average-reward reinforcement learning offers a principled framework for long-term decision-making by maximizing the mean reward per time step. Although Q-learning is a widely used model-free algorithm with established sample complexity in…

Machine Learning · Statistics 2026-01-21 Yuchen Jiao , Jiin Woo , Gen Li , Gauri Joshi , Yuejie Chi

Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous

Collaboration requires agents to align their goals on the fly. Underlying the human ability to align goals with other agents is their ability to predict the intentions of others and actively update their own plans. We propose hierarchical…

Multiagent Systems · Computer Science 2020-11-10 Rose E. Wang , J. Chase Kew , Dennis Lee , Tsang-Wei Edward Lee , Tingnan Zhang , Brian Ichter , Jie Tan , Aleksandra Faust

Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces

Parameterised actions in reinforcement learning are composed of discrete actions with continuous action-parameters. This provides a framework for solving complex domains that require combining high-level actions with flexible control. The…

Machine Learning · Computer Science 2019-05-14 Craig J. Bester , Steven D. James , George D. Konidaris

Distributed Q-Learning for Dynamically Decoupled Systems

Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. However in many applications, building accurate models of agents or interactions amongst them might be…

Optimization and Control · Mathematics 2019-03-21 Siavash Alemzadeh , Mehran Mesbahi

Consensus Learning for Cooperative Multi-Agent Reinforcement Learning

Almost all multi-agent reinforcement learning algorithms without communication follow the principle of centralized training with decentralized execution. During centralized training, agents can be guided by the same signals, such as the…

Multiagent Systems · Computer Science 2022-12-08 Zhiwei Xu , Bin Zhang , Dapeng Li , Zeren Zhang , Guangchong Zhou , Hao Chen , Guoliang Fan