Related papers: Cooperative Multi-Agent Reinforcement Learning: As…

A Communication-Efficient Decentralized Actor-Critic Algorithm

In this paper, we study the problem of reinforcement learning in multi-agent systems where communication among agents is limited. We develop a decentralized actor-critic learning framework in which each agent performs several local updates…

Machine Learning · Computer Science 2025-10-23 Xiaoxing Ren , Nicola Bastianello , Thomas Parisini , Andreas A. Malikopoulos

Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation

Reinforcement learning in cooperative multi-agent settings has recently advanced significantly in its scope, with applications in cooperative estimation for advertising, dynamic treatment regimes, distributed control, and federated…

Machine Learning · Computer Science 2021-03-30 Abhimanyu Dubey , Alex Pentland

Communication Efficient Parallel Reinforcement Learning

We consider the problem where $M$ agents interact with $M$ identical and independent environments with $S$ states and $A$ actions using reinforcement learning for $T$ rounds. The agents share their data with a central server to minimize…

Machine Learning · Computer Science 2021-02-23 Mridul Agarwal , Bhargav Ganguly , Vaneet Aggarwal

Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication

Deep reinforcement learning algorithms have recently been used to train multiple interacting agents in a centralised manner whilst keeping their execution decentralised. When the agents can only acquire partial observations and are faced…

Machine Learning · Computer Science 2020-01-27 Emanuele Pesce , Giovanni Montana

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging

Cooperative multi-agent reinforcement learning (MARL) has achieved significant results, most notably by leveraging the representation-learning abilities of deep neural networks. However, large centralized approaches quickly become…

Multiagent Systems · Computer Science 2022-12-05 Nikunj Gupta , G Srinivasaraghavan , Swarup Kumar Mohalik , Nishant Kumar , Matthew E. Taylor

Learning to Communicate in Multi-Agent Reinforcement Learning : A Review

We consider the issue of multiple agents learning to communicate through reinforcement learning within partially observable environments, with a focus on information asymmetry in the second part of our work. We provide a review of the…

Machine Learning · Computer Science 2019-11-14 Mohamed Salah Zaïem , Etienne Bennequin

Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning

In this paper we propose several novel distributed gradient-based temporal difference algorithms for multi-agent off-policy learning of linear approximation of the value function in Markov decision processes with strict information…

Machine Learning · Computer Science 2021-04-20 Milos S. Stankovic , Marko Beko , Srdjan S. Stankovic

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

We consider the problem of \emph{fully decentralized} multi-agent reinforcement learning (MARL), where the agents are located at the nodes of a time-varying communication network. Specifically, we assume that the reward functions of the…

Machine Learning · Computer Science 2018-02-28 Kaiqing Zhang , Zhuoran Yang , Han Liu , Tong Zhang , Tamer Başar

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogeneous linear Markov decision processes (linear MDPs) whose transition probability can be parameterized as a linear function of a given…

Machine Learning · Computer Science 2023-11-07 Jiafan He , Heyang Zhao , Dongruo Zhou , Quanquan Gu

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning

This paper extends off-policy reinforcement learning to the multi-agent case in which a set of networked agents communicating with their neighbors according to a time-varying graph collaboratively evaluates and improves a target policy…

Machine Learning · Computer Science 2019-11-20 Wesley Suttle , Zhuoran Yang , Kaiqing Zhang , Zhaoran Wang , Tamer Basar , Ji Liu

Federated Temporal Difference Learning with Linear Function Approximation under Environmental Heterogeneity

We initiate the study of federated reinforcement learning under environmental heterogeneity by considering a policy evaluation problem. Our setup involves $N$ agents interacting with environments that share the same state and action space…

Machine Learning · Computer Science 2024-07-02 Han Wang , Aritra Mitra , Hamed Hassani , George J. Pappas , James Anderson

Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods

We investigate reinforcement learning in the setting of Markov decision processes for a large number of exchangeable agents interacting in a mean field manner. Applications include, for example, the control of a large number of robots…

Optimization and Control · Mathematics 2025-04-30 René Carmona , Mathieu Laurière , Zongjun Tan

Finite-Time Performance of Distributed Temporal Difference Learning with Linear Function Approximation

We study the policy evaluation problem in multi-agent reinforcement learning, modeled by a Markov decision process. In this problem, the agents operate in a common environment under a fixed control policy, working together to discover the…

Optimization and Control · Mathematics 2020-01-13 Thinh T. Doan , Siva Theja Maguluri , Justin Romberg

Federated Reinforcement Learning at the Edge

Modern cyber-physical architectures use data collected from systems at different physical locations to learn appropriate behaviors and adapt to uncertain environments. However, an important challenge arises as communication exchanges at the…

Machine Learning · Computer Science 2021-12-14 Konstantinos Gatsis

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents

This article reviews recent advances in multi-agent reinforcement learning algorithms for large-scale control systems and communication networks, which learn to communicate and cooperate. We provide an overview of this emerging field, with…

Machine Learning · Computer Science 2020-06-24 Donghwan Lee , Niao He , Parameswaran Kamalaruban , Volkan Cevher

Learning Approximate Nash Equilibria in Cooperative Multi-Agent Reinforcement Learning via Mean-Field Subsampling

Many large-scale platforms and networked control systems have a centralized decision maker interacting with a massive population of agents under strict observability constraints. Motivated by such applications, we study a cooperative Markov…

Multiagent Systems · Computer Science 2026-05-12 Emile Anand , Ishani Karmarkar

Enabling Multi-Robot Collaboration from Single-Human Guidance

Learning collaborative behaviors is essential for multi-agent systems. Traditionally, multi-agent reinforcement learning solves this implicitly through a joint reward and centralized observations, assuming collaborative behavior will…

Robotics · Computer Science 2025-02-27 Zhengran Ji , Lingyu Zhang , Paul Sajda , Boyuan Chen

Intent-aware Multi-agent Reinforcement Learning

This paper proposes an intent-aware multi-agent planning framework as well as a learning algorithm. Under this framework, an agent plans in the goal space to maximize the expected utility. The planning process takes the belief of other…

Artificial Intelligence · Computer Science 2018-03-07 Siyuan Qi , Song-Chun Zhu

Learning Practical Communication Strategies in Cooperative Multi-Agent Reinforcement Learning

In Multi-Agent Reinforcement Learning, communication is critical to encourage cooperation among agents. Communication in realistic wireless networks can be highly unreliable due to network conditions varying with agents' mobility, and…

Artificial Intelligence · Computer Science 2022-09-16 Diyi Hu , Chi Zhang , Viktor Prasanna , Bhaskar Krishnamachari

Cooperative Multi-Agent Assignment over Stochastic Graphs via Constrained Reinforcement Learning

Constrained multi-agent reinforcement learning offers the framework to design scalable and almost surely feasible solutions for teams of agents operating in dynamic environments to carry out conflicting tasks. We address the challenges of…

Systems and Control · Electrical Eng. & Systems 2025-03-03 Leopoldo Agorio , Sean Van Alen , Santiago Paternain , Miguel Calvo-Fullana , Juan Andres Bazerque