Related papers: A Policy Gradient Algorithm for Learning to Learn …

Multi-Agent Deep Reinforcement Learning with Adaptive Policies

We propose a novel approach to address one aspect of the non-stationarity problem in multi-agent reinforcement learning (RL), where the other agents may alter their policies due to environment changes during execution. This violates the…

Machine Learning · Computer Science 2019-12-03 Yixiang Wang , Feng Wu

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

The key challenge in multiagent learning is learning a best response to the behaviour of other agents, which may be non-stationary: if the other agents adapt their strategy as well, the learning target moves. Disparate streams of research…

Multiagent Systems · Computer Science 2019-03-13 Pablo Hernandez-Leal , Michael Kaisers , Tim Baarslag , Enrique Munoz de Cote

Influencing Long-Term Behavior in Multiagent Reinforcement Learning

The main challenge of multiagent reinforcement learning is the difficulty of learning useful policies in the presence of other simultaneously learning agents whose changing behaviors jointly affect the environment's transition and reward…

Machine Learning · Computer Science 2022-10-18 Dong-Ki Kim , Matthew Riemer , Miao Liu , Jakob N. Foerster , Michael Everett , Chuangchuang Sun , Gerald Tesauro , Jonathan P. How

Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals

In a multi-agent setting, the optimal policy of a single agent is largely dependent on the behavior of other agents. We investigate the problem of multi-agent reinforcement learning, focusing on decentralized learning in non-stationary…

Artificial Intelligence · Computer Science 2019-10-01 Anahita Mohseni-Kabir , David Isele , Kikuo Fujimura

Multi-agent cooperation through learning-aware policy gradients

Self-interested individuals often fail to cooperate, posing a fundamental challenge for multi-agent learning. How can we achieve cooperation among self-interested, independent learning agents? Promising recent work has shown that in certain…

Artificial Intelligence · Computer Science 2025-03-20 Alexander Meulemans , Seijin Kobayashi , Johannes von Oswald , Nino Scherrer , Eric Elmoznino , Blake Richards , Guillaume Lajoie , Blaise Agüera y Arcas , João Sacramento

Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations

Multi-agent reinforcement learning systems aim to provide interacting agents with the ability to collaboratively learn and adapt to the behaviour of other agents. In many real-world applications, the agents can only acquire a partial view…

Machine Learning · Computer Science 2018-12-04 Ozsel Kilinc , Giovanni Montana

Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning

Recent developments in deep reinforcement learning are concerned with creating decision-making agents which can perform well in various complex domains. A particular approach which has received increasing attention is multi-agent…

Machine Learning · Computer Science 2019-06-13 Georgios Papoudakis , Filippos Christianos , Arrasy Rahman , Stefano V. Albrecht

Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients

In this paper, we explore using deep reinforcement learning for problems with multiple agents. Most existing methods for deep multi-agent reinforcement learning consider only a small number of agents. When the number of agents increases,…

Machine Learning · Computer Science 2018-05-24 Arbaaz Khan , Clark Zhang , Daniel D. Lee , Vijay Kumar , Alejandro Ribeiro

SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning

Multi-agent systems (MAS) need to adaptively cope with dynamic environments, changing agent populations, and diverse tasks. However, most of the multi-agent systems cannot easily handle them, due to the complexity of the state and task…

Artificial Intelligence · Computer Science 2024-05-06 Qian Long , Fangwei Zhong , Mingdong Wu , Yizhou Wang , Song-Chun Zhu

Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

In multi-timescale multi-agent reinforcement learning (MARL), agents interact across different timescales. In general, policies for time-dependent behaviors, such as those induced by multiple timescales, are non-stationary. Learning…

Machine Learning · Computer Science 2023-07-19 Patrick Emami , Xiangyu Zhang , David Biagioni , Ahmed S. Zamzam

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment,…

Machine Learning · Computer Science 2020-03-17 Ryan Lowe , Yi Wu , Aviv Tamar , Jean Harb , Pieter Abbeel , Igor Mordatch

Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments

We are interested in learning models of non-stationary environments, which can be framed as a multi-task learning problem. Model-free reinforcement learning algorithms can achieve good asymptotic performance in multi-task learning at a cost…

Machine Learning · Computer Science 2020-11-24 Elahe Aghapour , Nora Ayanian

Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning

Many potential applications of reinforcement learning in the real world involve interacting with other agents whose numbers vary over time. We propose new neural policy architectures for these multi-agent problems. In contrast to other…

Machine Learning · Computer Science 2019-06-03 Matthew A. Wright , Roberto Horowitz

Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning

Critical sectors of human society are progressing toward the adoption of powerful artificial intelligence (AI) agents, which are trained individually on behalf of self-interested principals but deployed in a shared environment. Short of…

Multiagent Systems · Computer Science 2021-12-22 Jiachen Yang , Ethan Wang , Rakshit Trivedi , Tuo Zhao , Hongyuan Zha

On the dynamics of multi agent nonlinear filtering and learning

Multiagent systems aim to accomplish highly complex learning tasks through decentralised consensus seeking dynamics and their use has garnered a great deal of attention in the signal processing and computational intelligence societies. This…

Machine Learning · Statistics 2023-09-20 Sayed Pouria Talebi , Danilo Mandic

GenAI-based Multi-Agent Reinforcement Learning towards Distributed Agent Intelligence: A Generative-RL Agent Perspective

Multi-agent reinforcement learning faces fundamental challenges that conventional approaches have failed to overcome: exponentially growing joint action spaces, non-stationary environments where simultaneous learning creates moving targets,…

Artificial Intelligence · Computer Science 2025-07-15 Hang Wang , Junshan Zhang

Learning Policy Representations in Multiagent Systems

Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We…

Multiagent Systems · Computer Science 2018-08-02 Aditya Grover , Maruan Al-Shedivat , Jayesh K. Gupta , Yura Burda , Harrison Edwards

Influencing Towards Stable Multi-Agent Interactions

Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an…

Robotics · Computer Science 2021-10-18 Woodrow Z. Wang , Andy Shih , Annie Xie , Dorsa Sadigh

Multi-Agent Generative Adversarial Imitation Learning

Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal. However, most existing approaches are not applicable in multi-agent settings due to the existence of multiple (Nash)…

Machine Learning · Computer Science 2018-07-27 Jiaming Song , Hongyu Ren , Dorsa Sadigh , Stefano Ermon

Multiagent Reinforcement Learning with Neighbor Action Estimation

Multiagent reinforcement learning, as a prominent intelligent paradigm, enables collaborative decision-making within complex systems. However, existing approaches often rely on explicit action exchange between agents to evaluate action…

Robotics · Computer Science 2026-01-09 Zhenglong Luo , Zhiyong Chen , Aoxiang Liu