Related papers: Macro-Action-Based Deep Multi-Agent Reinforcement …

Learning Multi-Robot Decentralized Macro-Action-Based Policies via a Centralized Q-Net

In many real-world multi-robot tasks, high-quality solutions often require a team of robots to perform asynchronous actions under decentralized control. Decentralized multi-agent reinforcement learning methods have difficulty learning…

Robotics · Computer Science 2020-03-05 Yuchen Xiao , Joshua Hoffman , Tian Xia , Christopher Amato

Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability

The state-of-the-art multi-agent reinforcement learning (MARL) methods have provided promising solutions to a variety of complex problems. Yet, these methods all assume that agents perform synchronized primitive-action executions so that…

Artificial Intelligence · Computer Science 2022-10-12 Yuchen Xiao

Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes

The incorporation of macro-actions (temporally extended actions) into multi-agent decision problems has the potential to address the curse of dimensionality associated with such decision problems. Since macro-actions last for stochastic…

Artificial Intelligence · Computer Science 2019-05-30 Kunal Menda , Yi-Chun Chen , Justin Grana , James W. Bono , Brendan D. Tracey , Mykel J. Kochenderfer , David Wolpert

Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces

Parameterised actions in reinforcement learning are composed of discrete actions with continuous action-parameters. This provides a framework for solving complex domains that require combining high-level actions with flexible control. The…

Machine Learning · Computer Science 2019-05-14 Craig J. Bester , Steven D. James , George D. Konidaris

Modular Multi-Objective Deep Reinforcement Learning with Decision Values

In this work we present a method for using Deep Q-Networks (DQNs) in multi-objective environments. Deep Q-Networks provide remarkable performance in single objective problems learning from high-level visual state representations. However,…

Artificial Intelligence · Computer Science 2018-02-26 Tomasz Tajmajer

On Improving Deep Reinforcement Learning for POMDPs

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e.g., computer Go. However, very little work has been done…

Machine Learning · Computer Science 2018-05-09 Pengfei Zhu , Xin Li , Pascal Poupart , Guanghui Miao

On Improving Deep Reinforcement Learning for POMDPs

Deep Reinforcement Learning (RL) recently emerged as one of the most competitive approaches for learning in sequential decision making problems with fully observable environments, e.g., computer Go. However, very little work has been done…

Machine Learning · Computer Science 2018-05-25 Pengfei Zhu , Xin Li , Pascal Poupart , Guanghui Miao

Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions

This paper presents a data-driven approach for multi-robot coordination in partially-observable domains based on Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) and macro-actions (MAs). Dec-POMDPs provide a general…

Multiagent Systems · Computer Science 2017-08-21 Miao Liu , Kavinayan Sivakumar , Shayegan Omidshafiei , Christopher Amato , Jonathan P. How

Actor-Critic Deep Reinforcement Learning for Dynamic Multichannel Access

We consider the dynamic multichannel access problem, which can be formulated as a partially observable Markov decision process (POMDP). We first propose a model-free actor-critic deep reinforcement learning based framework to explore the…

Information Theory · Computer Science 2018-10-10 Chen Zhong , Ziyang Lu , M. Cenk Gursoy , Senem Velipasalar

Reinforcement Learning in Decentralized Stochastic Control Systems with Partial History Sharing

In this paper, we are interested in systems with multiple agents that wish to collaborate in order to accomplish a common task while a) agents have different information (decentralized information) and b) agents do not know the model of the…

Optimization and Control · Mathematics 2020-12-04 Jalal Arabneydi , Aditya Mahajan

Policy Gradient With Value Function Approximation For Collective Multiagent Planning

Decentralized (PO)MDPs provide an expressive framework for sequential decision making in a multiagent system. Given their computational complexity, recent research has focused on tractable yet practical subclasses of Dec-POMDPs. We address…

Artificial Intelligence · Computer Science 2018-04-11 Duc Thien Nguyen , Akshat Kumar , Hoong Chuin Lau

Deep Decentralized Reinforcement Learning for Cooperative Control

In order to collaborate efficiently with unknown partners in cooperative control settings, adaptation of the partners based on online experience is required. The rather general and widely applicable control setting, where each cooperation…

Multiagent Systems · Computer Science 2019-10-30 Florian Köpf , Samuel Tesfazgi , Michael Flad , Sören Hohmann

Centralized Model and Exploration Policy for Multi-Agent RL

Reinforcement learning (RL) in partially observable, fully cooperative multi-agent settings (Dec-POMDPs) can in principle be used to address many real-world challenges such as controlling a swarm of rescue robots or a team of quadcopters.…

Artificial Intelligence · Computer Science 2022-02-08 Qizhen Zhang , Chris Lu , Animesh Garg , Jakob Foerster

Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control

Attention mechanisms excel at learning sequential patterns by discriminating data based on relevance and importance. This provides state-of-the-art performance in advanced generative artificial intelligence models. This paper applies this…

Systems and Control · Electrical Eng. & Systems 2026-03-24 Turki Bin Mohaya , Peter Seiler

Managing engineering systems with large state and action spaces through deep reinforcement learning

Decision-making for engineering systems can be efficiently formulated as a Markov Decision Process (MDP) or a Partially Observable MDP (POMDP). Typical MDP and POMDP solution procedures utilize offline knowledge about the environment and…

Systems and Control · Computer Science 2018-11-07 C. P. Andriotis , K. G. Papakonstantinou

Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces

Deep Reinforcement Learning (DRL) has been applied to address a variety of cooperative multi-agent problems with either discrete action spaces or continuous action spaces. However, to the best of our knowledge, no previous work has ever…

Machine Learning · Computer Science 2019-06-04 Haotian Fu , Hongyao Tang , Jianye Hao , Zihan Lei , Yingfeng Chen , Changjie Fan

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting,…

Machine Learning · Computer Science 2018-06-07 Tabish Rashid , Mikayel Samvelyan , Christian Schroeder de Witt , Gregory Farquhar , Jakob Foerster , Shimon Whiteson

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions

The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination…

Multiagent Systems · Computer Science 2015-02-24 Shayegan Omidshafiei , Ali-akbar Agha-mohammadi , Christopher Amato , Jonathan P. How

Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents

Deep Q-Network (DQN) based multi-agent systems (MAS) for reinforcement learning (RL) use various schemes where in the agents have to learn and communicate. The learning is however specific to each agent and communication may be…

Machine Learning · Computer Science 2020-08-11 Abdul Mueed Hafiz , Ghulam Mohiuddin Bhat

Multi-Agent Determinantal Q-Learning

Centralized training with decentralized execution has become an important paradigm in multi-agent learning. Though practical, current methods rely on restrictive assumptions to decompose the centralized value function across agents for…

Machine Learning · Computer Science 2020-06-11 Yaodong Yang , Ying Wen , Liheng Chen , Jun Wang , Kun Shao , David Mguni , Weinan Zhang