Related papers: Provably Efficient Cooperative Multi-Agent Reinfor…

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

We study multi-agent reinforcement learning in the setting of episodic Markov decision processes, where multiple agents cooperate via communication through a central server. We propose a provably efficient algorithm based on value iteration…

Machine Learning · Computer Science 2023-06-27 Yifei Min , Jiafan He , Tianhao Wang , Quanquan Gu

Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator

This paper presents a novel approach to multi-agent reinforcement learning (RL) for linear systems with convex polytopic constraints. Existing work on RL has demonstrated the use of model predictive control (MPC) as a function approximator…

Systems and Control · Electrical Eng. & Systems 2025-01-06 Samuel Mallick , Filippo Airaldi , Azita Dabiri , Bart De Schutter

Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches

Reinforcement Learning (RL) is a learning paradigm concerned with learning to control a system so as to maximize an objective over the long term. This approach to learning has received immense interest in recent times and success manifests…

Artificial Intelligence · Computer Science 2018-07-26 Sanyam Kapoor

Communication Methods in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning is a promising research area that extends established reinforcement learning approaches to problems formulated as multi-agent systems. Recently, a multitude of communication methods have been introduced to…

Multiagent Systems · Computer Science 2026-01-21 Christoph Wittner

Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective

In this paper, we study the problem of robust cooperative multi-agent reinforcement learning (RL) where a large number of cooperative agents with distributed information aim to learn policies in the presence of \emph{stochastic} and…

Multiagent Systems · Computer Science 2025-06-16 Muhammad Aneeq uz Zaman , Mathieu Laurière , Alec Koppel , Tamer Başar

Learning Efficient Recursive Numeral Systems via Reinforcement Learning

It has previously been shown that by using reinforcement learning (RL), agents can derive simple approximate and exact-restricted numeral systems that are similar to human ones (Carlsson, 2021). However, it is a major challenge to show how…

Computation and Language · Computer Science 2025-05-20 Andrea Silvi , Jonathan Thomas , Emil Carlsson , Devdatt Dubhashi , Moa Johansson

Cooperative Online Learning in Stochastic and Adversarial MDPs

We study cooperative online learning in stochastic and adversarial Markov decision process (MDP). That is, in each episode, $m$ agents interact with an MDP simultaneously and share information in order to minimize their individual regret.…

Machine Learning · Computer Science 2022-09-02 Tal Lancewicki , Aviv Rosenberg , Yishay Mansour

Partner Approximating Learners (PAL): Simulation-Accelerated Learning with Explicit Partner Modeling in Multi-Agent Domains

Mixed cooperative-competitive control scenarios such as human-machine interaction with individual goals of the interacting partners are very challenging for reinforcement learning agents. In order to contribute towards intuitive…

Systems and Control · Electrical Eng. & Systems 2020-03-03 Florian Köpf , Alexander Nitsch , Michael Flad , Sören Hohmann

A Review of Cooperative Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning has made significant progress in multi-agent systems in recent years. In this review article, we have focused on presenting recent approaches on Multi-Agent Reinforcement Learning (MARL) algorithms. In…

Machine Learning · Computer Science 2021-05-03 Afshin OroojlooyJadid , Davood Hajinezhad

Learning to Teach in Cooperative Multiagent Reinforcement Learning

Collective human knowledge has clearly benefited from the fact that innovations by individuals are taught to others through communication. Similar to human social groups, agents in distributed learning systems would likely benefit from…

Multiagent Systems · Computer Science 2018-09-05 Shayegan Omidshafiei , Dong-Ki Kim , Miao Liu , Gerald Tesauro , Matthew Riemer , Christopher Amato , Murray Campbell , Jonathan P. How

Cooperative Multi-Agent Assignment over Stochastic Graphs via Constrained Reinforcement Learning

Constrained multi-agent reinforcement learning offers the framework to design scalable and almost surely feasible solutions for teams of agents operating in dynamic environments to carry out conflicting tasks. We address the challenges of…

Systems and Control · Electrical Eng. & Systems 2025-03-03 Leopoldo Agorio , Sean Van Alen , Santiago Paternain , Miguel Calvo-Fullana , Juan Andres Bazerque

Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback

In many real-world applications, it is hard to provide a reward signal in each step of a Reinforcement Learning (RL) process and more natural to give feedback when an episode ends. To this end, we study the recently proposed model of RL…

Machine Learning · Computer Science 2024-05-15 Asaf Cassel , Haipeng Luo , Aviv Rosenberg , Dmitry Sotnikov

Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning

Ad hoc teamwork problem describes situations where an agent has to cooperate with previously unseen agents to achieve a common goal. For an agent to be successful in these scenarios, it has to have a suitable cooperative skill. One could…

Artificial Intelligence · Computer Science 2022-10-21 Rujikorn Charakorn , Poramate Manoonpong , Nat Dilokthanakul

Reinforcement and Imitation Learning via Interactive No-Regret Learning

Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach…

Machine Learning · Computer Science 2014-06-24 Stephane Ross , J. Andrew Bagnell

Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation

We study the reinforcement learning (RL) problem in a constrained Markov decision process (CMDP), where an agent explores the environment to maximize the expected cumulative reward while satisfying a single constraint on the expected total…

Machine Learning · Computer Science 2026-01-29 Toshinori Kitamura , Arnob Ghosh , Tadashi Kozuno , Wataru Kumagai , Kazumi Kasaura , Kenta Hoshino , Yohei Hosoe , Yutaka Matsuo

Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition

This paper introduces a decentralized multi-agent reinforcement learning framework enabling structurally heterogeneous teams of agents to jointly discover and acquire randomly located targets in environments characterized by partial…

Robotics · Computer Science 2026-01-14 Gabriele Calzolari , Vidya Sumathy , Christoforos Kanellakis , George Nikolakopoulos

Communication Efficient Parallel Reinforcement Learning

We consider the problem where $M$ agents interact with $M$ identical and independent environments with $S$ states and $A$ actions using reinforcement learning for $T$ rounds. The agents share their data with a central server to minimize…

Machine Learning · Computer Science 2021-02-23 Mridul Agarwal , Bhargav Ganguly , Vaneet Aggarwal

Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

A challenge in reinforcement learning (RL) is minimizing the cost of sampling associated with exploration. Distributed exploration reduces sampling complexity in multi-agent RL (MARL). We investigate the benefits to performance in MARL when…

Machine Learning · Computer Science 2022-05-03 Justin Lidard , Udari Madhushani , Naomi Ehrich Leonard

Federated Learning for Heterogeneous Bandits with Unobserved Contexts

We study the problem of federated stochastic multi-arm contextual bandits with unknown contexts, in which M agents are faced with different bandits and collaborate to learn. The communication model consists of a central server and the…

Machine Learning · Computer Science 2024-01-31 Jiabin Lin , Shana Moothedath

Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning

Following the pivotal success of learning strategies to win at tasks, solely by interacting with an environment without any supervision, agents have gained the ability to make sequential decisions in complex MDPs. Yet, reinforcement…

Machine Learning · Computer Science 2026-03-18 Ezgi Korkmaz