Sampling-Based Coordination-Informed Multi-Objective Multi-Robot Reinforcement Learning

Antonio Marino; Esteban Restrepo; Soon-jo Chung; Paolo Robuffo Giordano; Claudio Pacchierotti

Sampling-Based Coordination-Informed Multi-Objective Multi-Robot Reinforcement Learning

机器人学 2026-06-29 v1 多智能体系统

作者: Antonio Marino , Esteban Restrepo , Soon-jo Chung , Paolo Robuffo Giordano , Claudio Pacchierotti

摘要

Multi-robot systems must simultaneously optimize competing objectives while maintaining coordinated behavior. Existing multi-agent reinforcement learning approaches often rely on fixed or centralized coordination, which limits adaptability and violates distributed constraints. This work introduces the Coordination-Informed Multi-Objective Reinforcement Learning (CIMORL) framework, integrating a distributed weight prediction mechanism, a privileged expert training strategy, and theoretical guarantees for Pareto-optimal solutions. We present the base CIMORL method alongside two sampling-based variants, CIMORL-TS (Tree Search) and CIMORL-MPPI (MPPI), which leverage privileged global information during training to enable fully decentralized deployment. Experimental validation in cooperative and adversarial scenarios demonstrates a $21.2\%$ hypervolume improvement and superior policy stability compared to state-of-the-art baselines. Real-world experiments with Crazyflie drones further validate the framework's robustness in resource allocation and multi-attacker multi-defend scenarios under partial observability.

引用

@article{arxiv.2606.30893,
  title  = {Sampling-Based Coordination-Informed Multi-Objective Multi-Robot Reinforcement Learning},
  author = {Antonio Marino and Esteban Restrepo and Soon-jo Chung and Paolo Robuffo Giordano and Claudio Pacchierotti},
  journal= {arXiv preprint arXiv:2606.30893},
  year   = {2026}
}

备注

20 pages, 11 figures, 4 tables