Related papers: Learning to Share: Selective Memory for Efficient …

Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems

Multi-agent systems (MAS) enable complex reasoning by coordinating multiple agents, but often incur high inference latency due to multi-step execution and repeated model invocations, severely limiting their scalability and usability in…

Multiagent Systems · Computer Science 2026-01-16 Xi Shi , Mengxin Zheng , Qian Lou

AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks

Test-time scaling (TTS) enhances the performance of large language models (LLMs) by allocating additional compute resources during inference. However, existing research primarily investigates TTS in single-stage tasks; while many real-world…

Artificial Intelligence · Computer Science 2025-10-23 Fali Wang , Hui Liu , Zhenwei Dai , Jingying Zeng , Zhiwei Zhang , Zongyu Wu , Chen Luo , Zhen Li , Xianfeng Tang , Qi He , Suhang Wang

Learning to Share in Multi-Agent Reinforcement Learning

In this paper, we study the problem of networked multi-agent reinforcement learning (MARL), where a number of agents are deployed as a partially connected network and each interacts only with nearby agents. Networked MARL requires all…

Machine Learning · Computer Science 2022-06-22 Yuxuan Yi , Ge Li , Yaowei Wang , Zongqing Lu

Lightweight Learner for Shared Knowledge Lifelong Learning

In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting…

Machine Learning · Computer Science 2023-05-26 Yunhao Ge , Yuecheng Li , Di Wu , Ao Xu , Adam M. Jones , Amanda Sofie Rios , Iordanis Fostiropoulos , Shixian Wen , Po-Hsuan Huang , Zachary William Murdock , Gozde Sahin , Shuo Ni , Kiran Lekkala , Sumedh Anand Sontakke , Laurent Itti

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Current test-time scaling (TTS) techniques enhance large language model (LLM) performance by allocating additional computation at inference time, yet they remain insufficient for agentic settings, where actions directly interact with…

Computation and Language · Computer Science 2026-02-04 Xingshan Zeng , Lingzhi Wang , Weiwen Liu , Liangyou Li , Yasheng Wang , Lifeng Shang , Xin Jiang , Qun Liu

INMS: Memory Sharing for Large Language Model based Agents

While Large Language Model (LLM) based agents excel at complex tasks, their performance in open-ended scenarios is often constrained by isolated operation and reliance on static databases, missing the dynamic knowledge exchange of human…

Computation and Language · Computer Science 2026-03-06 Hang Gao , Yongfeng Zhang

Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control

Complex tasks are increasingly delegated to ensembles of specialized LLM-based agents that reason, communicate, and coordinate actions-both among themselves and through interactions with external tools, APIs, and databases. While persistent…

Multiagent Systems · Computer Science 2025-05-27 Alireza Rezazadeh , Zichao Li , Ange Lou , Yuying Zhao , Wei Wei , Yujia Bao

Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Large language model (LLM) agents face fundamental limitations in long-horizon reasoning due to finite context windows, making effective memory management critical. Existing methods typically handle long-term memory (LTM) and short-term…

Computation and Language · Computer Science 2026-05-01 Yi Yu , Liuyi Yao , Yuexiang Xie , Qingquan Tan , Jiaqi Feng , Yaliang Li , Libing Wu

Context-Based Concurrent Experience Sharing in Multiagent Systems

One of the key challenges for multi-agent learning is scalability. In this paper, we introduce a technique for speeding up multi-agent learning by exploiting concurrent and incremental experience sharing. This solution adaptively identifies…

Multiagent Systems · Computer Science 2017-03-07 Dan Garant , Bruno da Silva , Victor Lesser , Chongjie Zhang

Parallel Knowledge Transfer in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning is a standard framework for modeling multi-agent interactions applied in real-world scenarios. Inspired by experience sharing in human groups, learning knowledge parallel reusing between agents can…

Artificial Intelligence · Computer Science 2020-04-01 Yongyuan Liang , Bangwei Li

Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity

Recent advances in large language models (LLMs) have demonstrated the power of reasoning through self-generated chains of thought. Multiple reasoning agents can collaborate to raise joint reasoning quality above individual outcomes.…

Artificial Intelligence · Computer Science 2025-05-19 Chan-Jan Hsu , Davide Buffelli , Jamie McGowan , Feng-Ting Liao , Yi-Chang Chen , Sattar Vakili , Da-shan Shiu

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

Test-Time Scaling (TTS) enhances the reasoning capabilities of large language models by allocating additional inference compute to explore the solution space. However, existing parallel TTS methods typically keep branches isolated during…

Computation and Language · Computer Science 2026-05-27 Xinglin Wang , Hao Lin , Shaoxiong Feng , Peiwen Yuan , Yiwei Li , Jiayi Shi , Yueqi Zhang , Chuyi Tan , Ji Zhang , Boyuan Pan , Yao Hu , Kan Li

Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination

Multi-agent reinforcement learning shines as the pinnacle of multi-agent systems, conquering intricate real-world challenges, fostering collaboration and coordination among agents, and unleashing the potential for intelligent…

Multiagent Systems · Computer Science 2023-12-27 Jiawei Wang , Jian Zhao , Zhengtao Cao , Ruili Feng , Rongjun Qin , Yang Yu

Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning

We present a novel multi-agent RL approach, Selective Multi-Agent Prioritized Experience Relay, in which agents share with other agents a limited number of transitions they observe during training. The intuition behind this is that even a…

Machine Learning · Computer Science 2024-04-25 Matthias Gerstgrasser , Tom Danino , Sarah Keren

Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

How should an agent decide when and how to plan? A dominant approach builds agents as reactive policies with adaptive computation (e.g., chain-of-thought), trained end-to-end expecting planning to emerge implicitly. Without control over the…

Artificial Intelligence · Computer Science 2026-05-22 Mingkai Deng , Jinyu Hou , Lara Sá Neves , Varad Pimpalkhute , Taylor W. Killian , Zhengzhong Liu , Eric P. Xing

EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models

Reinforcement learning (RL) has become a pivotal component of large language model (LLM) post-training, and agentic RL extends this paradigm to operate as agents through multi-turn interaction and tool use. Scaling such systems exposes two…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-08 Zheyue Tan , Mustapha Abdullahi , Tuo Shi , Huining Yuan , Zelai Xu , Chao Yu , Boxun Li , Bo Zhao

Enhancing Reasoning with Collaboration and Memory

We envision a continuous collaborative learning system where groups of LLM agents work together to solve reasoning problems, drawing on memory they collectively build to improve performance as they gain experience. This work establishes the…

Artificial Intelligence · Computer Science 2025-03-11 Julie Michelman , Nasrin Baratalipour , Matthew Abueg

Continual Learning, Not Training: Online Adaptation For Agents

Continual Learning (CL) methods have traditionally focused on mitigating catastrophic forgetting through gradient-based retraining, an approach ill-suited for deployed agents that must adapt in real time. We introduce our Adaptive Teaching…

Machine Learning · Computer Science 2025-11-04 Aman Jaglan , Jarrod Barnes

Experience Sharing Between Cooperative Reinforcement Learning Agents

The idea of experience sharing between cooperative agents naturally emerges from our understanding of how humans learn. Our evolution as a species is tightly linked to the ability to exchange learned knowledge with one another. It follows…

Machine Learning · Computer Science 2019-11-07 Lucas Oliveira Souza , Gabriel de Oliveira Ramos , Celia Ghedini Ralha

LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization

Large language models (LLMs) have demonstrated impressive capabilities in reasoning with the emergence of reasoning models like OpenAI-o1 and DeepSeek-R1. Recent research focuses on integrating reasoning capabilities into the realm of…

Computation and Language · Computer Science 2025-05-26 Qi Zhang , Shouqing Yang , Lirong Gao , Hao Chen , Xiaomeng Hu , Jinglei Chen , Jiexiang Wang , Sheng Guo , Bo Zheng , Haobo Wang , Junbo Zhao