Related papers: MAT-Agent: Adaptive Multi-Agent Training Optimizat…

Visual Document Understanding and Reasoning: A Multi-Agent Collaboration Framework with Agent-Wise Adaptive Test-Time Scaling

The dominant paradigm of monolithic scaling in Vision-Language Models (VLMs) is failing for understanding and reasoning in documents, yielding diminishing returns as it struggles with the inherent need of this domain for document-based…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Xinlei Yu , Chengming Xu , Zhangquan Chen , Yudong Zhang , Shilin Lu , Cheng Yang , Jiangning Zhang , Shuicheng Yan , Xiaobin Hu

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

Large sequence model (SM) such as GPT series and BERT has displayed outstanding performance and generalization capabilities on vision, language, and recently reinforcement learning tasks. A natural follow-up question is how to abstract…

Multiagent Systems · Computer Science 2022-10-31 Muning Wen , Jakub Grudzien Kuba , Runji Lin , Weinan Zhang , Ying Wen , Jun Wang , Yaodong Yang

AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions

Multi-agent reinforcement learning focuses on training the behaviors of multiple learning agents that coexist in a shared environment. Recently, MARL models, such as the Multi-Agent Transformer (MAT) and ACtion dEpendent deep Q-learning…

Multiagent Systems · Computer Science 2025-12-30 Shota Takayama , Katsuhide Fujita

Multi-Agent Cooperative Learning for Robust Vision-Language Alignment under OOD Concepts

This paper introduces a novel Multi-Agent Cooperative Learning (MACL) framework to address cross-modal alignment collapse in vision-language models when handling out-of-distribution (OOD) concepts. Four core agents, including image, text,…

Multiagent Systems · Computer Science 2026-04-08 Philip Xu

Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels

Multi-Armed Bandit (MAB) systems are witnessing an upswing in applications within multi-agent distributed environments, leading to the advancement of collaborative MAB algorithms. In such settings, communication between agents executing…

Machine Learning · Computer Science 2024-04-30 Osama A. Hanna , Merve Karakas , Lin F. Yang , Christina Fragouli

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments

Multi-agent navigation in dynamic environments is of great industrial value when deploying a large scale fleet of robot to real-world applications. This paper proposes a decentralized partially observable multi-agent path planning with…

Robotics · Computer Science 2020-08-03 Zuxin Liu , Baiming Chen , Hongyi Zhou , Guru Koushik , Martial Hebert , Ding Zhao

A Visual Communication Map for Multi-Agent Deep Reinforcement Learning

Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges…

Machine Learning · Computer Science 2021-02-24 Ngoc Duy Nguyen , Thanh Thi Nguyen , Doug Creighton , Saeid Nahavandi

OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection

Open-Vocabulary Object Detection (OVOD) aims to enable detectors to generalize across categories by leveraging semantic information. Although existing methods are pretrained on large vision-language datasets, their inference is still…

Artificial Intelligence · Computer Science 2026-04-21 Chujie Wang , Jianyu Lu , Zhiyuan Luo , Xi Chen , Chu He

Adaptability in Multi-Agent Reinforcement Learning: A Framework and Unified Review

Multi-Agent Reinforcement Learning (MARL) has shown clear effectiveness in coordinating multiple agents across simulated benchmarks and constrained scenarios. However, its deployment in real-world multi-agent systems (MAS) remains limited,…

Artificial Intelligence · Computer Science 2025-07-15 Siyi Hu , Mohamad A Hady , Jianglin Qiao , Jimmy Cao , Mahardhika Pratama , Ryszard Kowalczyk

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

We present PAT, a transformer-based network that learns complex temporal co-occurrence action dependencies in a video by exploiting multi-scale temporal features. In existing methods, the self-attention mechanism in transformers loses the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Faegheh Sardari , Armin Mustafa , Philip J. B. Jackson , Adrian Hilton

CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

Open large language models (LLMs) have significantly advanced the field of natural language processing, showcasing impressive performance across various tasks.Despite the significant advancements in LLMs, their effective operation still…

Computation and Language · Computer Science 2025-04-16 Xuechen Liang , Yangfan He , Meiling Tao , Yinghui Xia , Jianhui Wang , Tianyu Shi , Jun Wang , JingSong Yang

Multi-Agent Path Finding via Offline RL and LLM Collaboration

Multi-Agent Path Finding (MAPF) poses a significant and challenging problem critical for applications in robotics and logistics, particularly due to its combinatorial complexity and the partial observability inherent in realistic…

Multiagent Systems · Computer Science 2025-09-29 Merve Atasever , Matthew Hong , Mihir Nitin Kulkarni , Qingpei Li , Jyotirmoy V. Deshmukh

MARIC: Multi-Agent Reasoning for Image Classification

Image classification has traditionally relied on parameter-intensive model training, requiring large-scale annotated datasets and extensive fine tuning to achieve competitive performance. While recent vision language models (VLMs) alleviate…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Wonduk Seo , Minhyeong Yu , Hyunjin An , Seunghyun Lee

Multi-Agent Guided Policy Optimization

Due to practical constraints such as partial observability and limited communication, Centralized Training with Decentralized Execution (CTDE) has become the dominant paradigm in cooperative Multi-Agent Reinforcement Learning (MARL).…

Artificial Intelligence · Computer Science 2026-03-16 Yueheng Li , Guangming Xie , Zongqing Lu

Bridging the Capability Gap: Joint Alignment Tuning for Harmonizing LLM-based Multi-Agent Systems

The advancement of large language models (LLMs) has enabled the construction of multi-agent systems to solve complex tasks by dividing responsibilities among specialized agents, such as a planning agent for subgoal generation and a…

Computation and Language · Computer Science 2025-09-12 Minghang Zhu , Zhengliang Shi , Zhiwei Xu , Shiguang Wu , Lingjie Wang , Pengjie Ren , Zhaochun Ren , Zhumin Chen

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

Automatic multi-agent systems aim to instantiate agent workflows without relying on manually designed or fixed orchestration. However, existing automatic MAS approaches remain only partially adaptive: they either perform training-free…

Artificial Intelligence · Computer Science 2026-05-15 Yaolun Zhang , Yujie Zhao , Nan Wang , Yiran Wu , Jiayu Chang , Yizhao Chen , Qingyun Wu , Jishen Zhao , Huazheng Wang

Multi-Agent Tool-Integrated Policy Optimization

Large language models (LLMs) increasingly rely on multi-turn tool-integrated planning for knowledge-intensive and complex reasoning tasks. Existing implementations typically rely on a single agent, but they suffer from limited context…

Computation and Language · Computer Science 2025-10-07 Zhanfeng Mo , Xingxuan Li , Yuntao Chen , Lidong Bing

Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning

The paradigm of agentic AI is shifting from engineered complex workflows to post-training native models. However, existing agents are typically confined to static, predefined action spaces--such as exclusively using APIs, GUI events, or…

Machine Learning · Computer Science 2025-12-11 Kaichen He , Zihao Wang , Muyao Li , Anji Liu , Yitao Liang

Data Agent: Learning to Select Data via End-to-End Dynamic Optimization

Dynamic Data selection aims to accelerate training by prioritizing informative samples during online training. However, existing methods typically rely on task-specific handcrafted metrics or static/snapshot-based criteria to estimate…

Machine Learning · Computer Science 2026-05-14 Suorong Yang , Fangjian Su , Hai Gan , Ziqi Ye , Jie Li , Baile Xu , Furao Shen , Soujanya Poria