Related papers: Multi-Agent Constrained Policy Optimisation

Multi-Agent Trust Region Policy Optimization

We extend trust region policy optimization (TRPO) to multi-agent reinforcement learning (MARL) problems. We show that the policy update of TRPO can be transformed into a distributed consensus optimization problem for multi-agent cases. By…

Artificial Intelligence · Computer Science 2023-08-08 Hepeng Li , Haibo He

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Experience replay is crucial for off-policy reinforcement learning (RL) methods. By remembering and reusing the experiences from past different policies, experience replay significantly improves the training efficiency and stability of RL…

Machine Learning · Computer Science 2023-03-01 Yongsheng Mei , Hanhan Zhou , Tian Lan , Guru Venkataramani , Peng Wei

Constrained Markov Decision Processes via Backward Value Functions

Although Reinforcement Learning (RL) algorithms have found tremendous success in simulated domains, they often cannot directly be applied to physical systems, especially in cases where there are hard constraints to satisfy (e.g. on safety…

Machine Learning · Computer Science 2020-08-28 Harsh Satija , Philip Amortila , Joelle Pineau

End-to-End Optimization of LLM-Driven Multi-Agent Search Systems via Heterogeneous-Group-Based Reinforcement Learning

Large language models (LLMs) are versatile, yet their deployment in complex real-world settings is limited by static knowledge cutoffs and the difficulty of producing controllable behavior within a single inference. Multi-agent search…

Machine Learning · Computer Science 2026-04-21 Guanzhong Chen , Shaoxiong Yang , Chao Li , Wei Liu , Jian Luan , Zenglin Xu

DePAint: A Decentralized Safe Multi-Agent Reinforcement Learning Algorithm considering Peak and Average Constraints

The domain of safe multi-agent reinforcement learning (MARL), despite its potential applications in areas ranging from drone delivery and vehicle automation to the development of zero-energy communities, remains relatively unexplored. The…

Multiagent Systems · Computer Science 2024-04-05 Raheeb Hassan , K. M. Shadman Wadith , Md. Mamun or Rashid , Md. Mosaddek Khan

Safety Constrained Multi-Agent Reinforcement Learning for Active Voltage Control

Active voltage control presents a promising avenue for relieving power congestion and enhancing voltage quality, taking advantage of the distributed controllable generators in the power network, such as roof-top photovoltaics. While…

Machine Learning · Computer Science 2024-09-04 Yang Qu , Jinming Ma , Feng Wu

Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic…

Machine Learning · Computer Science 2023-06-02 Dongsheng Ding , Xiaohan Wei , Zhuoran Yang , Zhaoran Wang , Mihailo R. Jovanović

Co2PO: Coordinated Constrained Policy Optimization for Multi-Agent RL

Constrained multi-agent reinforcement learning (MARL) faces a fundamental tension between exploration and safety-constrained optimization. Existing leading approaches, such as Lagrangian methods, typically rely on global penalties or…

Machine Learning · Computer Science 2026-02-04 Shrenik Patel , Christine Truong

DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning

Safe Multi-agent reinforcement learning (safe MARL) has increasingly gained attention in recent years, emphasizing the need for agents to not only optimize the global return but also adhere to safety requirements through behavioral…

Machine Learning · Computer Science 2024-03-13 Xuefeng Wang , Henglin Pu , Hyung Jun Kim , Husheng Li

Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium

Multi-agent reinforcement learning (MARL) has achieved notable success in cooperative tasks, demonstrating impressive performance and scalability. However, deploying MARL agents in real-world applications presents critical safety…

Machine Learning · Computer Science 2024-11-25 Zeyang Li , Navid Azizan

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

Trust region methods rigorously enabled reinforcement learning (RL) agents to learn monotonically improving policies, leading to superior performance on a variety of tasks. Unfortunately, when it comes to multi-agent reinforcement learning…

Artificial Intelligence · Computer Science 2022-04-05 Jakub Grudzien Kuba , Ruiqing Chen , Muning Wen , Ying Wen , Fanglei Sun , Jun Wang , Yaodong Yang

LLM Collaboration With Multi-Agent Reinforcement Learning

A large amount of work has been done in Multi-Agent Systems (MAS) for modeling and solving problems with multiple interacting agents. However, most LLMs are pretrained independently and not specifically optimized for coordination. Existing…

Artificial Intelligence · Computer Science 2025-12-10 Shuo Liu , Tianle Chen , Zeyu Liang , Xueguang Lyu , Christopher Amato

Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review

Markov games (MGs) and multi-agent reinforcement learning (MARL) are studied to model decision making in multi-agent systems. Traditionally, the objective in MG and MARL has been risk-neutral, i.e., agents are assumed to optimize a…

Computer Science and Game Theory · Computer Science 2024-06-11 Hafez Ghaemi , Shirin Jamshidi , Mohammad Mashreghi , Majid Nili Ahmadabadi , Hamed Kebriaei

Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI

Enterprise AI systems increasingly deploy multiple intelligent agents across mission-critical workflows that must satisfy hard policy constraints, bounded risk exposure, and comprehensive auditability (SOX, HIPAA, GDPR). Existing…

Artificial Intelligence · Computer Science 2026-04-21 Vinil Pasupuleti , Shyalendar Reddy Allala , Siva Rama Krishna Varma Bayyavarapu , Shrey Tyagi

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Policy optimization methods with function approximation are widely used in multi-agent reinforcement learning. However, it remains elusive how to design such algorithms with statistical guarantees. Leveraging a multi-agent performance…

Machine Learning · Computer Science 2023-05-09 Yulai Zhao , Zhuoran Yang , Zhaoran Wang , Jason D. Lee

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Recent years have witnessed significant advances in reinforcement learning (RL), which has registered great success in solving various sequential decision-making problems in machine learning. Most of the successful RL applications, e.g.,…

Machine Learning · Computer Science 2021-04-30 Kaiqing Zhang , Zhuoran Yang , Tamer Başar

Provably Learning Nash Policies in Constrained Markov Potential Games

Multi-agent reinforcement learning (MARL) addresses sequential decision-making problems with multiple agents, where each agent optimizes its own objective. In many real-world instances, the agents may not only want to optimize their…

Machine Learning · Computer Science 2023-06-14 Pragnya Alatur , Giorgia Ramponi , Niao He , Andreas Krause

Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty

Intraday surgical scheduling is a multi-objective decision problem under uncertainty-balancing elective throughput, urgent and emergency demand, delays, sequence-dependent setups, and overtime. We formulate the problem as a cooperative…

Machine Learning · Computer Science 2025-12-05 Kailiang Liu , Ying Chen , Ralf Borndörfer , Thorsten Koch

Multi-Agent Trust Region Policy Optimisation: A Joint Constraint Approach

Multi-agent reinforcement learning (MARL) requires coordinated and stable policy updates among interacting agents. Heterogeneous-Agent Trust Region Policy Optimization (HATRPO) enforces per-agent trust region constraints using…

Artificial Intelligence · Computer Science 2025-08-15 Chak Lam Shek , Guangyao Shi , Pratap Tokekar

Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method

We discuss the problem of decentralized multi-agent reinforcement learning (MARL) in this work. In our setting, the global state, action, and reward are assumed to be fully observable, while the local policy is protected as privacy by each…

Multiagent Systems · Computer Science 2021-11-02 Kuo Li , Qing-Shan Jia