Related papers: Representation Learning For Efficient Deep Multi-A…

MARPO: A Reflective Policy Optimization for Multi Agent Reinforcement Learning

We propose Multi Agent Reflective Policy Optimization (MARPO) to alleviate the issue of sample inefficiency in multi agent reinforcement learning. MARPO consists of two key components: a reflection mechanism that leverages subsequent…

Multiagent Systems · Computer Science 2025-12-30 Cuiling Wu , Yaozhong Gan , Junliang Xing , Ying Fu

Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping

Mapping deep neural networks (DNNs) to hardware is critical for optimizing latency, energy consumption, and resource utilization, making it a cornerstone of high-performance accelerator design. Due to the vast and complex mapping space,…

Machine Learning · Computer Science 2025-07-23 Srivatsan Krishnan , Jason Jabbour , Dan Zhang , Natasha Jaques , Aleksandra Faust , Shayegan Omidshafiei , Vijay Janapa Reddi

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Experience replay is crucial for off-policy reinforcement learning (RL) methods. By remembering and reusing the experiences from past different policies, experience replay significantly improves the training efficiency and stability of RL…

Machine Learning · Computer Science 2023-03-01 Yongsheng Mei , Hanhan Zhou , Tian Lan , Guru Venkataramani , Peng Wei

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization

Offline Multi-Agent Reinforcement Learning (MARL) is an emerging field that aims to learn optimal multi-agent policies from pre-collected datasets. Compared to single-agent case, multi-agent setting involves a large joint state-action space…

Artificial Intelligence · Computer Science 2024-12-19 Zongkai Liu , Qian Lin , Chao Yu , Xiawei Wu , Yile Liang , Donghui Li , Xuetao Ding

Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey

Multi-Agent Reinforcement Learning (MARL) has become a powerful framework for numerous real-world applications, modeling distributed decision-making and learning from interactions with complex environments. Resource Allocation Optimization…

Multiagent Systems · Computer Science 2025-05-01 Mohamad A. Hady , Siyi Hu , Mahardhika Pratama , Jimmy Cao , Ryszard Kowalczyk

Multi-agent Reinforcement Learning for Energy Saving in Multi-Cell Massive MIMO Systems

We develop a multi-agent reinforcement learning (MARL) algorithm to minimize the total energy consumption of multiple massive MIMO (multiple-input multiple-output) base stations (BSs) in a multi-cell network while preserving the overall…

Information Theory · Computer Science 2024-02-06 Tianzhang Cai , Qichen Wang , Shuai Zhang , Özlem Tuğfe Demir , Cicek Cavdar

Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects

Significant advances have recently been achieved in Multi-Agent Reinforcement Learning (MARL) which tackles sequential decision-making problems involving multiple participants. However, MARL requires a tremendous number of samples for…

Multiagent Systems · Computer Science 2024-12-30 Xihuai Wang , Zhicheng Zhang , Weinan Zhang

Multi-Agent Reinforcement Learning Scheduling to Support Low Latency in Teleoperated Driving

The teleoperated driving (TD) scenario comes with stringent Quality of Service (QoS) communication constraints, especially in terms of end-to-end (E2E) latency and reliability. In this context, Predictive Quality of Service (PQoS), possibly…

Networking and Internet Architecture · Computer Science 2025-05-07 Giacomo Avanzi , Marco Giordani , Michele Zorzi

Towards Efficient Multi-Agent Learning Systems

Multi-Agent Reinforcement Learning (MARL) is an increasingly important research field that can model and control multiple large-scale autonomous systems. Despite its achievements, existing multi-agent learning methods typically involve…

Multiagent Systems · Computer Science 2023-05-25 Kailash Gogineni , Peng Wei , Tian Lan , Guru Venkataramani

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

This paper investigates the model-based methods in multi-agent reinforcement learning (MARL). We specify the dynamics sample complexity and the opponent sample complexity in MARL, and conduct a theoretic analysis of return discrepancy upper…

Machine Learning · Computer Science 2022-03-18 Weinan Zhang , Xihuai Wang , Jian Shen , Ming Zhou

Mars-PO: Multi-Agent Reasoning System Preference Optimization

Mathematical reasoning is a fundamental capability for large language models (LLMs), yet achieving high performance in this domain remains a significant challenge. The auto-regressive generation process often makes LLMs susceptible to…

Artificial Intelligence · Computer Science 2024-12-02 Xiaoxuan Lou , Chaojie Wang , Bo An

Multi-Agent Constrained Policy Optimisation

Developing reinforcement learning algorithms that satisfy safety constraints is becoming increasingly important in real-world applications. In multi-agent reinforcement learning (MARL) settings, policy optimisation with safety awareness is…

Artificial Intelligence · Computer Science 2022-02-11 Shangding Gu , Jakub Grudzien Kuba , Munning Wen , Ruiqing Chen , Ziyan Wang , Zheng Tian , Jun Wang , Alois Knoll , Yaodong Yang

When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems

Reinforcement Learning (RL) has emerged as a crucial method for training or fine-tuning large language models (LLMs), enabling adaptive, task-specific optimizations through interactive feedback. Multi-Agent Reinforcement Learning (MARL), in…

Machine Learning · Computer Science 2026-02-10 Junwei Su , Chuan Wu

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

Recent approaches have utilized self-supervised auxiliary tasks as representation learning to improve the performance and sample efficiency of vision-based reinforcement learning algorithms in single-agent settings. However, in multi-agent…

Machine Learning · Computer Science 2023-06-06 Haolin Song , Mingxiao Feng , Wengang Zhou , Houqiang Li

Mask-based Latent Reconstruction for Reinforcement Learning

For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance. However, in practice, limited experience and high-dimensional inputs prevent effective representation…

Machine Learning · Computer Science 2022-10-11 Tao Yu , Zhizheng Zhang , Cuiling Lan , Yan Lu , Zhibo Chen

Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach

Multi-agent reinforcement learning (MARL) becomes more challenging in the presence of more agents, as the capacity of the joint state and action spaces grows exponentially in the number of agents. To address such a challenge of scale, we…

Machine Learning · Computer Science 2021-05-19 Yan Li , Lingxiao Wang , Jiachen Yang , Ethan Wang , Zhaoran Wang , Tuo Zhao , Hongyuan Zha

Efficient Multi-agent Reinforcement Learning by Planning

Multi-agent reinforcement learning (MARL) algorithms have accomplished remarkable breakthroughs in solving large-scale decision-making tasks. Nonetheless, most existing MARL algorithms are model-free, limiting sample efficiency and…

Machine Learning · Computer Science 2024-05-21 Qihan Liu , Jianing Ye , Xiaoteng Ma , Jun Yang , Bin Liang , Chongjie Zhang

MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) methods often suffer from high sample complexity, limiting their use in real-world problems where data is sparse or expensive to collect. Although latent-variable world models have been employed to…

Machine Learning · Computer Science 2024-02-15 Aravind Venugopal , Stephanie Milani , Fei Fang , Balaraman Ravindran

End-to-End Optimization of LLM-Driven Multi-Agent Search Systems via Heterogeneous-Group-Based Reinforcement Learning

Large language models (LLMs) are versatile, yet their deployment in complex real-world settings is limited by static knowledge cutoffs and the difficulty of producing controllable behavior within a single inference. Multi-agent search…

Machine Learning · Computer Science 2026-04-21 Guanzhong Chen , Shaoxiong Yang , Chao Li , Wei Liu , Jian Luan , Zenglin Xu

A Review of Cooperative Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning has made significant progress in multi-agent systems in recent years. In this review article, we have focused on presenting recent approaches on Multi-Agent Reinforcement Learning (MARL) algorithms. In…

Machine Learning · Computer Science 2021-05-03 Afshin OroojlooyJadid , Davood Hajinezhad