English
Related papers

Related papers: Asynchronous Actor-Critic for Multi-Agent Reinforc…

200 papers

Reinforcement learning in multi-agent scenarios is important for real-world applications but presents challenges beyond those seen in single-agent settings. We present an actor-critic algorithm that trains decentralized policies in…

Machine Learning · Computer Science 2019-05-29 Shariq Iqbal , Fei Sha

Recent multi-agent actor-critic methods have utilized centralized training with decentralized execution to address the non-stationarity of co-adapting agents. This training paradigm constrains learning to the centralized phase such that…

Multiagent Systems · Computer Science 2019-10-09 Kevin Corder , Manuel M. Vindiola , Keith Decker

We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers. We present asynchronous variants of four standard…

Centralized Training for Decentralized Execution, where agents are trained offline using centralized information but execute in a decentralized manner online, has gained popularity in the multi-agent reinforcement learning community. In…

Machine Learning · Computer Science 2021-12-06 Xueguang Lyu , Yuchen Xiao , Brett Daley , Christopher Amato

We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment,…

Machine Learning · Computer Science 2020-03-17 Ryan Lowe , Yi Wu , Aviv Tamar , Jean Harb , Pieter Abbeel , Igor Mordatch

In this paper we explore how actor-critic methods in deep reinforcement learning, in particular Asynchronous Advantage Actor-Critic (A3C), can be extended with agent modeling. Inspired by recent works on representation learning and…

Multiagent Systems · Computer Science 2019-07-24 Pablo Hernandez-Leal , Bilal Kartal , Matthew E. Taylor

Actor-critic (AC) methods are widely used in reinforcement learning (RL) and benefit from the flexibility of using any policy gradient method as the actor and value-based method as the critic. The critic is usually trained by minimizing the…

Machine Learning · Computer Science 2023-11-01 Sharan Vaswani , Amirreza Kazemi , Reza Babanezhad , Nicolas Le Roux

In this paper, we devise three actor-critic algorithms with decentralized training for multi-agent reinforcement learning in cooperative, adversarial, and mixed settings with continuous action spaces. To this goal, we adapt the MADDPG…

Machine Learning · Computer Science 2025-03-11 Diego Bolliger , Lorenz Zauter , Robert Ziegler

Reinforcement Learning (RL) techniques have drawn great attention in many challenging tasks, but their performance deteriorates dramatically when applied to real-world problems. Various methods, such as domain randomization, have been…

Machine Learning · Computer Science 2022-08-05 Wangyang Yue , Yuan Zhou , Xiaochuan Zhang , Yuchen Hua , Zhiyuan Wang , Guang Kou

We consider a multi-agent reinforcement learning problem where each agent seeks to maximize a shared reward while interacting with other agents, and they may or may not be able to communicate. Typically the agents do not have access to…

Multiagent Systems · Computer Science 2021-04-26 Alex Tong Lin , Mark J. Debord , Katia Estabridis , Gary Hewer , Guido Montufar , Stanley Osher

Learning in games has been widely used to solve many cooperative multi-agent problems such as coverage control, consensus, self-reconfiguration or vehicle-target assignment. One standard approach in this domain is to formulate the problem…

Systems and Control · Electrical Eng. & Systems 2022-09-07 Abbasali Koochakzadeh , Yasin Yazıcıoğlu

Many studies have applied reinforcement learning to train a dialog policy and show great promise these years. One common approach is to employ a user simulator to obtain a large number of simulated user experiences for reinforcement…

Computation and Language · Computer Science 2020-04-24 Ryuichi Takanobu , Runze Liang , Minlie Huang

In reinforcement learning for partially observable environments, many successful algorithms have been developed within the asymmetric learning paradigm. This paradigm leverages additional state information available at training time for…

Machine Learning · Computer Science 2025-09-09 Gaspard Lambrechts , Damien Ernst , Aditya Mahajan

In this work, we consider the problem of computing optimal actions for Reinforcement Learning (RL) agents in a co-operative setting, where the objective is to optimize a common goal. However, in many real-life applications, in addition to…

Artificial Intelligence · Computer Science 2021-01-08 P. Parnika , Raghuram Bharadwaj Diddigi , Sai Koti Reddy Danda , Shalabh Bhatnagar

This paper extends off-policy reinforcement learning to the multi-agent case in which a set of networked agents communicating with their neighbors according to a time-varying graph collaboratively evaluates and improves a target policy…

Machine Learning · Computer Science 2019-11-20 Wesley Suttle , Zhuoran Yang , Kaiqing Zhang , Zhaoran Wang , Tamer Basar , Ji Liu

In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can…

Multiagent Systems · Computer Science 2020-07-14 Raghuram Bharadwaj Diddigi , Sai Koti Reddy Danda , Prabuchandran K. J. , Shalabh Bhatnagar

In this paper, we propose a distributed off-policy actor critic method to solve multi-agent reinforcement learning problems. Specifically, we assume that all agents keep local estimates of the global optimal policy parameter and update…

Machine Learning · Computer Science 2019-03-25 Yan Zhang , Michael M. Zavlanos

Cooperative multi-agent problems often require coordination between agents, which can be achieved through a centralized policy that considers the global state. Multi-agent policy gradient (MAPG) methods are commonly used to learn such…

Robotics · Computer Science 2023-08-03 Xubo Lyu , Amin Banitalebi-Dehkordi , Mo Chen , Yong Zhang

Traditional methods plan feasible paths for multiple agents in the stochastic environment. However, the methods' iterations with the changes in the environment result in computation complexities, especially for the decentralized agents…

Robotics · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen , Jinhu Lü

Reinforcement learning (RL) algorithms can find an optimal policy for a single agent to accomplish a particular task. However, many real-world problems require multiple agents to collaborate in order to achieve a common goal. For example, a…

Machine Learning · Computer Science 2025-10-20 Jan Corazza , Hadi Partovi Aria , Hyohun Kim , Daniel Neider , Zhe Xu
‹ Prev 1 2 3 10 Next ›