Related papers: Preference-Based Multi-Agent Reinforcement Learnin…

Preference-Guided Learning for Sparse-Reward Multi-Agent Reinforcement Learning

We study the problem of online multi-agent reinforcement learning (MARL) in environments with sparse rewards, where reward feedback is not provided at each interaction but only revealed at the end of a trajectory. This setting, though…

Machine Learning · Computer Science 2025-09-29 The Viet Bui , Tien Mai , Hong Thanh Nguyen

Sample-Efficient Reinforcement Learning of Partially Observable Markov Games

This paper considers the challenging tasks of Multi-Agent Reinforcement Learning (MARL) under partial observability, where each agent only sees her own individual observations and actions that reveal incomplete information about the…

Machine Learning · Computer Science 2022-10-18 Qinghua Liu , Csaba Szepesvári , Chi Jin

Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics

This paper explores advanced topics in complex multi-agent systems building upon our previous work. We examine four fundamental challenges in Multi-Agent Reinforcement Learning (MARL): non-stationarity, partial observability, scalability…

Multiagent Systems · Computer Science 2024-12-31 Neil De La Fuente , Miquel Noguer i Alonso , Guim Casadellà

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents decisions. Due to the complexity of the problem, the majority of the previously developed MARL algorithms assumed agents either had some…

Machine Learning · Computer Science 2014-01-16 Sherief Abdallah , Victor Lesser

Nash Q-Network for Multi-Agent Cybersecurity Simulation

Cybersecurity defense involves interactions between adversarial parties (namely defenders and hackers), making multi-agent reinforcement learning (MARL) an ideal approach for modeling and learning strategies for these scenarios. This paper…

Multiagent Systems · Computer Science 2025-09-03 Qintong Xie , Edward Koh , Xavier Cadet , Peter Chin

Game-Theoretic Understandings of Multi-Agent Systems with Multiple Objectives

In practical multi-agent systems, agents often have diverse objectives, which makes the system more complex, as each agent's performance across multiple criteria depends on the joint actions of all agents, creating intricate strategic…

Multiagent Systems · Computer Science 2025-09-30 Yue Wang

Query-Policy Misalignment in Preference-Based Reinforcement Learning

Preference-based reinforcement learning (PbRL) provides a natural way to align RL agents' behavior with human desired outcomes, but is often restrained by costly human feedback. To improve feedback efficiency, most existing PbRL methods…

Machine Learning · Computer Science 2024-07-08 Xiao Hu , Jianxiong Li , Xianyuan Zhan , Qing-Shan Jia , Ya-Qin Zhang

Preference-based Reinforcement Learning with Finite-Time Guarantees

Preference-based Reinforcement Learning (PbRL) replaces reward values in traditional reinforcement learning by preferences to better elicit human opinion on the target objective, especially when numerical reward values are hard to design or…

Machine Learning · Computer Science 2020-10-27 Yichong Xu , Ruosong Wang , Lin F. Yang , Aarti Singh , Artur Dubrawski

On the Complexity of Multi-Agent Decision Making: From Learning in Games to Partial Monitoring

A central problem in the theory of multi-agent reinforcement learning (MARL) is to understand what structural conditions and algorithmic principles lead to sample-efficient learning guarantees, and how these considerations change as we move…

Machine Learning · Computer Science 2023-05-02 Dylan J. Foster , Dean P. Foster , Noah Golowich , Alexander Rakhlin

O-MAPL: Offline Multi-agent Preference Learning

Inferring reward functions from demonstrations is a key challenge in reinforcement learning (RL), particularly in multi-agent RL (MARL), where large joint state-action spaces and complex inter-agent interactions complicate the task. While…

Machine Learning · Computer Science 2025-02-03 The Viet Bui , Tien Mai , Hong Thanh Nguyen

Towards Multi-Agent Reinforcement Learning using Quantum Boltzmann Machines

Reinforcement learning has driven impressive advances in machine learning. Simultaneously, quantum-enhanced machine learning algorithms using quantum annealing underlie heavy developments. Recently, a multi-agent reinforcement learning…

Artificial Intelligence · Computer Science 2021-11-23 Tobias Müller , Christoph Roch , Kyrill Schmid , Philipp Altmann

Learning Closed-Loop Parametric Nash Equilibria of Multi-Agent Collaborative Field Coverage

Multi-agent reinforcement learning is a challenging and active field of research due to the inherent nonstationary property and coupling between agents. A popular approach to modeling the multi-agent interactions underlying the multi-agent…

Multiagent Systems · Computer Science 2025-10-07 Jushan Chen , Santiago Paternain

Equilibria for Games with Combined Qualitative and Quantitative Objectives

The overall aim of our research is to develop techniques to reason about the equilibrium properties of multi-agent systems. We model multi-agent systems as concurrent games, in which each player is a process that is assumed to act…

Logic in Computer Science · Computer Science 2020-08-14 Julian Gutierrez , Aniello Murano , Giuseppe Perelli , Sasha Rubin , Thomas Steeples , Michael Wooldridge

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear preferences, with the goal of enabling few-shot adaptation to new tasks. In MORL, the aim is to learn policies over multiple competing objectives…

Machine Learning · Computer Science 2019-11-07 Runzhe Yang , Xingyuan Sun , Karthik Narasimhan

Prioritized League Reinforcement Learning for Large-Scale Heterogeneous Multiagent Systems

Large-scale heterogeneous multiagent systems feature various realistic factors in the real world, such as agents with diverse abilities and overall system cost. In comparison to homogeneous systems, heterogeneous systems offer significant…

Artificial Intelligence · Computer Science 2024-03-28 Qingxu Fu , Zhiqiang Pu , Min Chen , Tenghai Qiu , Jianqiang Yi

Provable Offline Preference-Based Reinforcement Learning

In this paper, we investigate the problem of offline Preference-based Reinforcement Learning (PbRL) with human feedback where feedback is available in the form of preference between trajectory pairs rather than explicit rewards. Our…

Machine Learning · Computer Science 2023-10-03 Wenhao Zhan , Masatoshi Uehara , Nathan Kallus , Jason D. Lee , Wen Sun

A Polynomial-Time Algorithm for 1/2-Well-Supported Nash Equilibria in Bimatrix Games

Since the seminal PPAD-completeness result for computing a Nash equilibrium even in two-player games, an important line of research has focused on relaxations achievable in polynomial time. In this paper, we consider the notion of…

Computer Science and Game Theory · Computer Science 2022-07-15 Argyrios Deligkas , Michail Fasoulakis , Evangelos Markakis

Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

While multi-agent reinforcement learning (MARL) has produced numerous algorithms that converge to Nash or related equilibria, such equilibria are often non-unique and can exhibit widely varying efficiency. This raises a fundamental…

Computer Science and Game Theory · Computer Science 2026-01-29 Runyu Zhang , Gioele Zardini , Asuman Ozdaglar , Jeff Shamma , Na Li

Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation

We consider model-based multi-agent reinforcement learning, where the environment transition model is unknown and can only be learned via expensive interactions with the environment. We propose H-MARL (Hallucinated Multi-Agent Reinforcement…

Machine Learning · Computer Science 2022-07-12 Pier Giuseppe Sessa , Maryam Kamgarpour , Andreas Krause

Advances in Preference-based Reinforcement Learning: A Review

Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by…

Artificial Intelligence · Computer Science 2024-08-23 Youssef Abdelkareem , Shady Shehata , Fakhri Karray