Related papers: Multiagent Soft Q-Learning

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment,…

Machine Learning · Computer Science 2020-03-17 Ryan Lowe , Yi Wu , Aviv Tamar , Jean Harb , Pieter Abbeel , Igor Mordatch

Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games

This paper introduces an information-theoretic constraint on learned policy complexity in the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) reinforcement learning algorithm. Previous research with a related approach in continuous…

Artificial Intelligence · Computer Science 2025-05-16 Tailia Malloy , Tim Klinger , Miao Liu , Matthew Riemer , Gerald Tesauro , Chris R. Sims

Safe Deep Reinforcement Learning for Multi-Agent Systems with Continuous Action Spaces

Multi-agent control problems constitute an interesting area of application for deep reinforcement learning models with continuous action spaces. Such real-world applications, however, typically come with critical safety constraints that…

Machine Learning · Computer Science 2021-08-12 Ziyad Sheebaelhamd , Konstantinos Zisis , Athina Nisioti , Dimitris Gkouletsos , Dario Pavllo , Jonas Kohler

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

A fundamental challenge in multiagent reinforcement learning is to learn beneficial behaviors in a shared environment with other simultaneously learning agents. In particular, each agent perceives the environment as effectively…

Machine Learning · Computer Science 2021-06-15 Dong-Ki Kim , Miao Liu , Matthew Riemer , Chuangchuang Sun , Marwa Abdulhai , Golnaz Habibi , Sebastian Lopez-Cot , Gerald Tesauro , Jonathan P. How

Reinforcement Learning Agent for a 2D Shooter Game

Reinforcement learning agents in complex game environments often suffer from sparse rewards, training instability, and poor sample efficiency. This paper presents a hybrid training approach that combines offline imitation learning with…

Machine Learning · Computer Science 2025-09-19 Thomas Ackermann , Moritz Spang , Hamza A. A. Gardi

Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations

Multi-agent reinforcement learning systems aim to provide interacting agents with the ability to collaboratively learn and adapt to the behaviour of other agents. In many real-world applications, the agents can only acquire a partial view…

Machine Learning · Computer Science 2018-12-04 Ozsel Kilinc , Giovanni Montana

Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients

In this paper, we explore using deep reinforcement learning for problems with multiple agents. Most existing methods for deep multi-agent reinforcement learning consider only a small number of agents. When the number of agents increases,…

Machine Learning · Computer Science 2018-05-24 Arbaaz Khan , Clark Zhang , Daniel D. Lee , Vijay Kumar , Alejandro Ribeiro

A Behavior-based Approach for Multi-agent Q-learning for Autonomous Exploration

The use of mobile robots is being popular over the world mainly for autonomous explorations in hazardous/ toxic or unknown environments. This exploration will be more effective and efficient if the explorations in unknown environment can be…

Robotics · Computer Science 2011-10-11 Dip Narayan Ray , Somajyoti Majumder , Sumit Mukhopadhyay

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Multiagent systems appear in most social, economical, and political situations. In the present work we extend the Deep Q-Learning Network architecture proposed by Google DeepMind to multiagent environments and investigate how two agents…

Artificial Intelligence · Computer Science 2015-11-30 Ardi Tampuu , Tambet Matiisen , Dorian Kodelja , Ilya Kuzovkin , Kristjan Korjus , Juhan Aru , Jaan Aru , Raul Vicente

Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments

In this paper, we consider the problem of multi-agent navigation in partially observable grid environments. This problem is challenging for centralized planning approaches as they, typically, rely on the full knowledge of the environment.…

Machine Learning · Computer Science 2021-08-16 Vasilii Davydov , Alexey Skrynnik , Konstantin Yakovlev , Aleksandr I. Panov

Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

In decentralized multi-agent reinforcement learning, agents learning in isolation can lead to relative over-generalization (RO), where optimal joint actions are undervalued in favor of suboptimal ones. This hinders effective coordination in…

Machine Learning · Computer Science 2024-11-19 Ting Zhu , Yue Jin , Jeremie Houssineau , Giovanni Montana

$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

Factorized Q-Learning for Large-Scale Multi-Agent Systems

Deep Q-learning has achieved significant success in single-agent decision making tasks. However, it is challenging to extend Q-learning to large-scale multi-agent scenarios, due to the explosion of action space resulting from the complex…

Multiagent Systems · Computer Science 2019-10-14 Ming Zhou , Yong Chen , Ying Wen , Yaodong Yang , Yufeng Su , Weinan Zhang , Dell Zhang , Jun Wang

Beyond Strict Competition: Approximate Convergence of Multi Agent Q-Learning Dynamics

The behaviour of multi-agent learning in competitive settings is often considered under the restrictive assumption of a zero-sum game. Only under this strict requirement is the behaviour of learning well understood; beyond this, learning…

Computer Science and Game Theory · Computer Science 2023-07-27 Aamal Hussain , Francesco Belardinelli , Georgios Piliouras

Multi-agent Collaboration for Feasible Collaborative Behavior Construction and Evaluation

In the case of the two-person zero-sum stochastic game with a central controller, this paper proposes a best collaborative behavior search and selection algorithm based on reinforcement learning, in response to how to choose the best…

Robotics · Computer Science 2019-10-01 Yunkai Wang , Shenhan Jia , Zexi Chen , Zheyuan Huang , Rong Xiong

Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning

Many potential applications of reinforcement learning in the real world involve interacting with other agents whose numbers vary over time. We propose new neural policy architectures for these multi-agent problems. In contrast to other…

Machine Learning · Computer Science 2019-06-03 Matthew A. Wright , Roberto Horowitz

Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward

Many cooperative multi-agent problems require agents to learn individual tasks while contributing to the collective success of the group. This is a challenging task for current state-of-the-art multi-agent reinforcement algorithms that are…

Multiagent Systems · Computer Science 2020-03-25 Hassam Ullah Sheikh , Ladislau Bölöni

A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space

The research of extending deep reinforcement learning (drl) to multi-agent field has solved many complicated problems and made great achievements. However, almost all these studies only focus on discrete or continuous action space and there…

Machine Learning · Computer Science 2022-09-01 Hongzhi Hua , Guixuan Wen , Kaigui Wu

Difference Rewards Policy Gradients

Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing…

Multiagent Systems · Computer Science 2024-12-20 Jacopo Castellini , Sam Devlin , Frans A. Oliehoek , Rahul Savani

Balancing Two-Player Stochastic Games with Soft Q-Learning

Within the context of video games the notion of perfectly rational agents can be undesirable as it leads to uninteresting situations, where humans face tough adversarial decision makers. Current frameworks for stochastic games and…

Artificial Intelligence · Computer Science 2019-01-09 Jordi Grau-Moya , Felix Leibfried , Haitham Bou-Ammar