English
Related papers

Related papers: Provably Efficient Cooperative Multi-Agent Reinfor…

200 papers

We study multi-agent reinforcement learning in the setting of episodic Markov decision processes, where multiple agents cooperate via communication through a central server. We propose a provably efficient algorithm based on value iteration…

Machine Learning · Computer Science 2023-06-27 Yifei Min , Jiafan He , Tianhao Wang , Quanquan Gu

This paper presents a novel approach to multi-agent reinforcement learning (RL) for linear systems with convex polytopic constraints. Existing work on RL has demonstrated the use of model predictive control (MPC) as a function approximator…

Systems and Control · Electrical Eng. & Systems 2025-01-06 Samuel Mallick , Filippo Airaldi , Azita Dabiri , Bart De Schutter

Reinforcement Learning (RL) is a learning paradigm concerned with learning to control a system so as to maximize an objective over the long term. This approach to learning has received immense interest in recent times and success manifests…

Artificial Intelligence · Computer Science 2018-07-26 Sanyam Kapoor

Multi-agent reinforcement learning is a promising research area that extends established reinforcement learning approaches to problems formulated as multi-agent systems. Recently, a multitude of communication methods have been introduced to…

Multiagent Systems · Computer Science 2026-01-21 Christoph Wittner

In this paper, we study the problem of robust cooperative multi-agent reinforcement learning (RL) where a large number of cooperative agents with distributed information aim to learn policies in the presence of \emph{stochastic} and…

Multiagent Systems · Computer Science 2025-06-16 Muhammad Aneeq uz Zaman , Mathieu Laurière , Alec Koppel , Tamer Başar

It has previously been shown that by using reinforcement learning (RL), agents can derive simple approximate and exact-restricted numeral systems that are similar to human ones (Carlsson, 2021). However, it is a major challenge to show how…

Computation and Language · Computer Science 2025-05-20 Andrea Silvi , Jonathan Thomas , Emil Carlsson , Devdatt Dubhashi , Moa Johansson

We study cooperative online learning in stochastic and adversarial Markov decision process (MDP). That is, in each episode, $m$ agents interact with an MDP simultaneously and share information in order to minimize their individual regret.…

Machine Learning · Computer Science 2022-09-02 Tal Lancewicki , Aviv Rosenberg , Yishay Mansour

Mixed cooperative-competitive control scenarios such as human-machine interaction with individual goals of the interacting partners are very challenging for reinforcement learning agents. In order to contribute towards intuitive…

Systems and Control · Electrical Eng. & Systems 2020-03-03 Florian Köpf , Alexander Nitsch , Michael Flad , Sören Hohmann

Deep Reinforcement Learning has made significant progress in multi-agent systems in recent years. In this review article, we have focused on presenting recent approaches on Multi-Agent Reinforcement Learning (MARL) algorithms. In…

Machine Learning · Computer Science 2021-05-03 Afshin OroojlooyJadid , Davood Hajinezhad

Collective human knowledge has clearly benefited from the fact that innovations by individuals are taught to others through communication. Similar to human social groups, agents in distributed learning systems would likely benefit from…

Constrained multi-agent reinforcement learning offers the framework to design scalable and almost surely feasible solutions for teams of agents operating in dynamic environments to carry out conflicting tasks. We address the challenges of…

Systems and Control · Electrical Eng. & Systems 2025-03-03 Leopoldo Agorio , Sean Van Alen , Santiago Paternain , Miguel Calvo-Fullana , Juan Andres Bazerque

In many real-world applications, it is hard to provide a reward signal in each step of a Reinforcement Learning (RL) process and more natural to give feedback when an episode ends. To this end, we study the recently proposed model of RL…

Machine Learning · Computer Science 2024-05-15 Asaf Cassel , Haipeng Luo , Aviv Rosenberg , Dmitry Sotnikov

Ad hoc teamwork problem describes situations where an agent has to cooperate with previously unseen agents to achieve a common goal. For an agent to be successful in these scenarios, it has to have a suitable cooperative skill. One could…

Artificial Intelligence · Computer Science 2022-10-21 Rujikorn Charakorn , Poramate Manoonpong , Nat Dilokthanakul

Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach…

Machine Learning · Computer Science 2014-06-24 Stephane Ross , J. Andrew Bagnell

We study the reinforcement learning (RL) problem in a constrained Markov decision process (CMDP), where an agent explores the environment to maximize the expected cumulative reward while satisfying a single constraint on the expected total…

This paper introduces a decentralized multi-agent reinforcement learning framework enabling structurally heterogeneous teams of agents to jointly discover and acquire randomly located targets in environments characterized by partial…

Robotics · Computer Science 2026-01-14 Gabriele Calzolari , Vidya Sumathy , Christoforos Kanellakis , George Nikolakopoulos

We consider the problem where $M$ agents interact with $M$ identical and independent environments with $S$ states and $A$ actions using reinforcement learning for $T$ rounds. The agents share their data with a central server to minimize…

Machine Learning · Computer Science 2021-02-23 Mridul Agarwal , Bhargav Ganguly , Vaneet Aggarwal

A challenge in reinforcement learning (RL) is minimizing the cost of sampling associated with exploration. Distributed exploration reduces sampling complexity in multi-agent RL (MARL). We investigate the benefits to performance in MARL when…

Machine Learning · Computer Science 2022-05-03 Justin Lidard , Udari Madhushani , Naomi Ehrich Leonard

We study the problem of federated stochastic multi-arm contextual bandits with unknown contexts, in which M agents are faced with different bandits and collaborate to learn. The communication model consists of a central server and the…

Machine Learning · Computer Science 2024-01-31 Jiabin Lin , Shana Moothedath

Following the pivotal success of learning strategies to win at tasks, solely by interacting with an environment without any supervision, agents have gained the ability to make sequential decisions in complex MDPs. Yet, reinforcement…

Machine Learning · Computer Science 2026-03-18 Ezgi Korkmaz
‹ Prev 1 2 3 10 Next ›