Related papers: PARCO: Parallel AutoRegressive Models for Multi-Ag…

Multi-Action Self-Improvement for Neural Combinatorial Optimization

Self-improvement has emerged as a state-of-the-art paradigm in Neural Combinatorial Optimization (NCO), where models iteratively refine their policies by generating and imitating high-quality solutions. Despite strong empirical performance,…

Machine Learning · Computer Science 2025-10-15 Laurin Luttmann , Lin Xie

Improving Generalization of Neural Combinatorial Optimization for Vehicle Routing Problems via Test-Time Projection Learning

Neural Combinatorial Optimization (NCO) has emerged as a promising learning-based paradigm for addressing Vehicle Routing Problems (VRPs) by minimizing the need for extensive manual engineering. While existing NCO methods, trained on…

Machine Learning · Computer Science 2025-11-24 Yuanyao Chen , Rongsheng Chen , Fu Luo , Zhenkun Wang

A Cooperative Multi-Agent Reinforcement Learning Framework for Resource Balancing in Complex Logistics Network

Resource balancing within complex transportation networks is one of the most important problems in real logistics domain. Traditional solutions on these problems leverage combinatorial optimization with demand and supply forecasting.…

Multiagent Systems · Computer Science 2019-03-05 Xihan Li , Jia Zhang , Jiang Bian , Yunhai Tong , Tie-Yan Liu

Multi-Agent Path Finding with Prioritized Communication Learning

Multi-agent pathfinding (MAPF) has been widely used to solve large-scale real-world problems, e.g., automation warehouses. The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and…

Robotics · Computer Science 2022-02-11 Wenhao Li , Hongjun Chen , Bo Jin , Wenzhe Tan , Hongyuan Zha , Xiangfeng Wang

Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks to new instantiations of themselves recursively. Recursive agents implement an…

Machine Learning · Computer Science 2026-05-08 Apurva Gandhi , Satyaki Chakraborty , Xiangjun Wang , Aviral Kumar , Graham Neubig

UniCO: Towards a Unified Model for Combinatorial Optimization Problems

Combinatorial Optimization (CO) encompasses a wide range of problems that arise in many real-world scenarios. While significant progress has been made in developing learning-based methods for specialized CO problems, a unified model with a…

Machine Learning · Computer Science 2025-05-13 Zefang Zong , Xiaochen Wei , Guozhen Zhang , Chen Gao , Huandong Wang , Yong Li

Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI

Enterprise AI systems increasingly deploy multiple intelligent agents across mission-critical workflows that must satisfy hard policy constraints, bounded risk exposure, and comprehensive auditability (SOX, HIPAA, GDPR). Existing…

Artificial Intelligence · Computer Science 2026-04-21 Vinil Pasupuleti , Shyalendar Reddy Allala , Siva Rama Krishna Varma Bayyavarapu , Shrey Tyagi

Autono: A ReAct-Based Highly Robust Autonomous Agent Framework

This paper proposes a highly robust autonomous agent framework based on the ReAct paradigm, designed to solve complex tasks through adaptive decision making and multi-agent collaboration. Unlike traditional frameworks that rely on fixed…

Multiagent Systems · Computer Science 2025-04-09 Zihao Wu

Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet

Finding a feasible and prompt solution to the Vehicle Routing Problem (VRP) is a prerequisite for efficient freight transportation, seamless logistics, and sustainable mobility. Traditional optimization methods reach their limits when…

Machine Learning · Computer Science 2024-11-08 Elija Deineko , Carina Kehrt

Evaluating Curriculum Learning Strategies in Neural Combinatorial Optimization

Neural combinatorial optimization (NCO) aims at designing problem-independent and efficient neural network-based strategies for solving combinatorial problems. The field recently experienced growth by successfully adapting architectures…

Machine Learning · Computer Science 2020-11-13 Michal Lisicki , Arash Afkanpour , Graham W. Taylor

Reinforced Collaboration in Multi-Agent Flow Networks

Multi-agent systems provide a powerful way to extend large language models (LLMs) by decomposing a complex task into specialized subtasks handled by different agents. However, their performance is often hindered by error propagation,…

Machine Learning · Computer Science 2026-05-14 Zheng Wang , Yuang Liu , Yangkai Ding

Toward Finding Strong Pareto Optimal Policies in Multi-Agent Reinforcement Learning

In this work, we study the problem of finding Pareto optimal policies in multi-agent reinforcement learning problems with cooperative reward structures. We show that any algorithm where each agent only optimizes their reward is subject to…

Machine Learning · Computer Science 2024-10-28 Bang Giang Le , Viet Cuong Ta

A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints

This article introduces a generalized framework for Decentralized Learning formulated as a Multi-Objective Optimization problem, in which both distributed agents and a central coordinator contribute independent, potentially conflicting…

Optimization and Control · Mathematics 2025-07-21 Roberto Morales , Umberto Biccari

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Reinforcement learning (RL) is attracting increasing interests in autonomous driving due to its potential to solve complex classification and control problems. However, existing RL algorithms are rarely applied to real vehicles for two…

Machine Learning · Computer Science 2020-03-04 Lu Wen , Jingliang Duan , Shengbo Eben Li , Shaobing Xu , Huei Peng

MARPO: A Reflective Policy Optimization for Multi Agent Reinforcement Learning

We propose Multi Agent Reflective Policy Optimization (MARPO) to alleviate the issue of sample inefficiency in multi agent reinforcement learning. MARPO consists of two key components: a reflection mechanism that leverages subsequent…

Multiagent Systems · Computer Science 2025-12-30 Cuiling Wu , Yaozhong Gan , Junliang Xing , Ying Fu

Combinatorial-hybrid Optimization for Multi-agent Systems under Collaborative Tasks

Multi-agent systems can be extremely efficient when working concurrently and collaboratively, e.g., for transportation, maintenance, search and rescue. Coordination of such teams often involves two aspects: (i) selecting appropriate…

Robotics · Computer Science 2023-08-29 Zili Tang , Junfeng Chen , Meng Guo

Learning for Robust Combinatorial Optimization: Algorithm and Application

Learning to optimize (L2O) has recently emerged as a promising approach to solving optimization problems by exploiting the strong prediction power of neural networks and offering lower runtime complexity than conventional solvers. While L2O…

Machine Learning · Computer Science 2021-12-21 Zhihui Shao , Jianyi Yang , Cong Shen , Shaolei Ren

Multi-Objective Task Assignment and Multiagent Planning with Hybrid GPU-CPU Acceleration

Allocation and planning with a collection of tasks and a group of agents is an important problem in multiagent systems. One commonly faced bottleneck is scalability, as in general the multiagent model increases exponentially in size with…

Multiagent Systems · Computer Science 2023-05-09 Thomas Robinson , Guoxin Su

ARMATA: Auto-Regressive Multi-Agent Task Assignment

Coordinating multi-agent systems over spatially distributed areas requires solving a complex hierarchical problem: first distributing areas among agents (allocation) and subsequently determining the optimal visitation order (routing).…

Multiagent Systems · Computer Science 2026-05-07 Yazan Youssef , Aboelmagd Noureldin , Sidney Givigi

MARCO: A Memory-Augmented Reinforcement Framework for Combinatorial Optimization

Neural Combinatorial Optimization (NCO) is an emerging domain where deep learning techniques are employed to address combinatorial optimization problems as a standalone solver. Despite their potential, existing NCO methods often suffer from…

Neural and Evolutionary Computing · Computer Science 2024-08-06 Andoni I. Garmendia , Quentin Cappart , Josu Ceberio , Alexander Mendiburu