Related papers: A Regularized Opponent Model with Maximum Entropy …

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

This paper investigates the model-based methods in multi-agent reinforcement learning (MARL). We specify the dynamics sample complexity and the opponent sample complexity in MARL, and conduct a theoretic analysis of return discrepancy upper…

Machine Learning · Computer Science 2022-03-18 Weinan Zhang , Xihuai Wang , Jian Shen , Ming Zhou

Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games

This paper presents an algorithmic framework for learning robust policies in asymmetric imperfect-information games, where the joint reward could depend on the uncertain opponent type (a private information known only to the opponent itself…

Artificial Intelligence · Computer Science 2020-03-05 Macheng Shen , Jonathan P. How

The Multi-Agent Reinforcement Learning in Malm\"O (MARL\"O) Competition

Learning in multi-agent scenarios is a fruitful research direction, but current approaches still show scalability problems in multiple games with general reward settings and different opponent types. The Multi-Agent Reinforcement Learning…

Artificial Intelligence · Computer Science 2025-04-14 Diego Perez-Liebana , Katja Hofmann , Sharada Prasanna Mohanty , Noboru Kuno , Andre Kramer , Sam Devlin , Raluca D. Gaina , Daniel Ionita

O-MAPL: Offline Multi-agent Preference Learning

Inferring reward functions from demonstrations is a key challenge in reinforcement learning (RL), particularly in multi-agent RL (MARL), where large joint state-action spaces and complex inter-agent interactions complicate the task. While…

Machine Learning · Computer Science 2025-02-03 The Viet Bui , Tien Mai , Hong Thanh Nguyen

Learning to Model Opponent Learning

Multi-Agent Reinforcement Learning (MARL) considers settings in which a set of coexisting agents interact with one another and their environment. The adaptation and learning of other agents induces non-stationarity in the environment…

Machine Learning · Computer Science 2020-06-09 Ian Davies , Zheng Tian , Jun Wang

RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

Offline reinforcement learning (RL) aims to find performant policies from logged data without further environment interaction. Model-based algorithms, which learn a model of the environment from the dataset and perform conservative policy…

Machine Learning · Computer Science 2022-10-12 Marc Rigter , Bruno Lacerda , Nick Hawes

Reinforcement Learning Agents in Colonel Blotto

Models and games are simplified representations of the world. There are many different kinds of models, all differing in complexity and which aspect of the world they allow us to further our understanding of. In this paper we focus on a…

Artificial Intelligence · Computer Science 2022-04-07 Joseph Christian G. Noel

Multi-Agent Reinforcement Learning and Real-Time Decision-Making in Robotic Soccer for Virtual Environments

The deployment of multi-agent systems in dynamic, adversarial environments like robotic soccer necessitates real-time decision-making, sophisticated cooperation, and scalable algorithms to avoid the curse of dimensionality. While…

Robotics · Computer Science 2025-12-04 Aya Taourirte , Md Sohag Mia

The Benefits of Power Regularization in Cooperative Reinforcement Learning

Cooperative Multi-Agent Reinforcement Learning (MARL) algorithms, trained only to optimize task reward, can lead to a concentration of power where the failure or adversarial intent of a single agent could decimate the reward of every agent…

Machine Learning · Computer Science 2024-06-18 Michelle Li , Michael Dennis

LERO: LLM-driven Evolutionary framework with Hybrid Rewards and Enhanced Observation for Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) faces two critical bottlenecks distinct from single-agent RL: credit assignment in cooperative tasks and partial observability of environmental states. We propose LERO, a framework integrating Large…

Machine Learning · Computer Science 2025-03-31 Yuan Wei , Xiaohan Shan , Jianmin Li

Robust Reinforcement Learning through Efficient Adversarial Herding

Although reinforcement learning (RL) is considered the gold standard for policy design, it may not always provide a robust solution in various scenarios. This can result in severe performance degradation when the environment is exposed to…

Machine Learning · Computer Science 2023-06-14 Juncheng Dong , Hao-Lun Hsu , Qitong Gao , Vahid Tarokh , Miroslav Pajic

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization

Offline Multi-Agent Reinforcement Learning (MARL) is an emerging field that aims to learn optimal multi-agent policies from pre-collected datasets. Compared to single-agent case, multi-agent setting involves a large joint state-action space…

Artificial Intelligence · Computer Science 2024-12-19 Zongkai Liu , Qian Lin , Chao Yu , Xiawei Wu , Yile Liang , Donghui Li , Xuetao Ding

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

Multi-Agent Reinforcement Learning (MARL) has shown promising results across several domains. Despite this promise, MARL policies often lack robustness and are therefore sensitive to small changes in their environment. This presents a…

Machine Learning · Computer Science 2023-10-18 Alexander Bukharin , Yan Li , Yue Yu , Qingru Zhang , Zhehui Chen , Simiao Zuo , Chao Zhang , Songan Zhang , Tuo Zhao

An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement Learning

Multi-Agent Reinforcement Learning can lead to the development of collaborative agent behaviors that show similarities with organizational concepts. Pushing forward this perspective, we introduce a novel framework that explicitly…

Artificial Intelligence · Computer Science 2025-04-01 Julien Soulé , Jean-Paul Jamont , Michel Occello , Louis-Marie Traonouez , Paul Théron

MARPO: A Reflective Policy Optimization for Multi Agent Reinforcement Learning

We propose Multi Agent Reflective Policy Optimization (MARPO) to alleviate the issue of sample inefficiency in multi agent reinforcement learning. MARPO consists of two key components: a reflection mechanism that leverages subsequent…

Multiagent Systems · Computer Science 2025-12-30 Cuiling Wu , Yaozhong Gan , Junliang Xing , Ying Fu

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning

Many advances that have improved the robustness and efficiency of deep reinforcement learning (RL) algorithms can, in one way or another, be understood as introducing additional objectives or constraints in the policy optimization step.…

Machine Learning · Computer Science 2023-08-02 Abbas Abdolmaleki , Sandy H. Huang , Giulia Vezzani , Bobak Shahriari , Jost Tobias Springenberg , Shruti Mishra , Dhruva TB , Arunkumar Byravan , Konstantinos Bousmalis , Andras Gyorgy , Csaba Szepesvari , Raia Hadsell , Nicolas Heess , Martin Riedmiller

Optimistic Proximal Policy Optimization

Reinforcement Learning, a machine learning framework for training an autonomous agent based on rewards, has shown outstanding results in various domains. However, it is known that learning a good policy is difficult in a domain where…

Machine Learning · Computer Science 2019-06-27 Takahisa Imagawa , Takuya Hiraoka , Yoshimasa Tsuruoka

Meta-Learning for Multi-objective Reinforcement Learning

Multi-objective reinforcement learning (MORL) is the generalization of standard reinforcement learning (RL) approaches to solve sequential decision making problems that consist of several, possibly conflicting, objectives. Generally, in…

Artificial Intelligence · Computer Science 2019-10-08 Xi Chen , Ali Ghadirzadeh , Mårten Björkman , Patric Jensfelt

Multi-Objective Reinforcement Learning with Max-Min Criterion: A Game-Theoretic Approach

In this paper, we propose a provably convergent and practical framework for multi-objective reinforcement learning with max-min criterion. From a game-theoretic perspective, we reformulate max-min multi-objective reinforcement learning as a…

Machine Learning · Computer Science 2025-10-24 Woohyeon Byeon , Giseung Park , Jongseong Chae , Amir Leshem , Youngchul Sung

Maximum Entropy Model-based Reinforcement Learning

Recent advances in reinforcement learning have demonstrated its ability to solve hard agent-environment interaction tasks on a super-human level. However, the application of reinforcement learning methods to practical and real-world tasks…

Artificial Intelligence · Computer Science 2021-12-03 Oleg Svidchenko , Aleksei Shpilman