English
Related papers

Related papers: Learning to Share: Selective Memory for Efficient …

200 papers

Multi-agent systems (MAS) enable complex reasoning by coordinating multiple agents, but often incur high inference latency due to multi-step execution and repeated model invocations, severely limiting their scalability and usability in…

Multiagent Systems · Computer Science 2026-01-16 Xi Shi , Mengxin Zheng , Qian Lou

Test-time scaling (TTS) enhances the performance of large language models (LLMs) by allocating additional compute resources during inference. However, existing research primarily investigates TTS in single-stage tasks; while many real-world…

Artificial Intelligence · Computer Science 2025-10-23 Fali Wang , Hui Liu , Zhenwei Dai , Jingying Zeng , Zhiwei Zhang , Zongyu Wu , Chen Luo , Zhen Li , Xianfeng Tang , Qi He , Suhang Wang

In this paper, we study the problem of networked multi-agent reinforcement learning (MARL), where a number of agents are deployed as a partially connected network and each interacts only with nearby agents. Networked MARL requires all…

Machine Learning · Computer Science 2022-06-22 Yuxuan Yi , Ge Li , Yaowei Wang , Zongqing Lu

In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting…

Current test-time scaling (TTS) techniques enhance large language model (LLM) performance by allocating additional computation at inference time, yet they remain insufficient for agentic settings, where actions directly interact with…

Computation and Language · Computer Science 2026-02-04 Xingshan Zeng , Lingzhi Wang , Weiwen Liu , Liangyou Li , Yasheng Wang , Lifeng Shang , Xin Jiang , Qun Liu

While Large Language Model (LLM) based agents excel at complex tasks, their performance in open-ended scenarios is often constrained by isolated operation and reliance on static databases, missing the dynamic knowledge exchange of human…

Computation and Language · Computer Science 2026-03-06 Hang Gao , Yongfeng Zhang

Complex tasks are increasingly delegated to ensembles of specialized LLM-based agents that reason, communicate, and coordinate actions-both among themselves and through interactions with external tools, APIs, and databases. While persistent…

Multiagent Systems · Computer Science 2025-05-27 Alireza Rezazadeh , Zichao Li , Ange Lou , Yuying Zhao , Wei Wei , Yujia Bao

Large language model (LLM) agents face fundamental limitations in long-horizon reasoning due to finite context windows, making effective memory management critical. Existing methods typically handle long-term memory (LTM) and short-term…

Computation and Language · Computer Science 2026-05-01 Yi Yu , Liuyi Yao , Yuexiang Xie , Qingquan Tan , Jiaqi Feng , Yaliang Li , Libing Wu

One of the key challenges for multi-agent learning is scalability. In this paper, we introduce a technique for speeding up multi-agent learning by exploiting concurrent and incremental experience sharing. This solution adaptively identifies…

Multiagent Systems · Computer Science 2017-03-07 Dan Garant , Bruno da Silva , Victor Lesser , Chongjie Zhang

Multi-agent reinforcement learning is a standard framework for modeling multi-agent interactions applied in real-world scenarios. Inspired by experience sharing in human groups, learning knowledge parallel reusing between agents can…

Artificial Intelligence · Computer Science 2020-04-01 Yongyuan Liang , Bangwei Li

Recent advances in large language models (LLMs) have demonstrated the power of reasoning through self-generated chains of thought. Multiple reasoning agents can collaborate to raise joint reasoning quality above individual outcomes.…

Artificial Intelligence · Computer Science 2025-05-19 Chan-Jan Hsu , Davide Buffelli , Jamie McGowan , Feng-Ting Liao , Yi-Chang Chen , Sattar Vakili , Da-shan Shiu

Test-Time Scaling (TTS) enhances the reasoning capabilities of large language models by allocating additional inference compute to explore the solution space. However, existing parallel TTS methods typically keep branches isolated during…

Computation and Language · Computer Science 2026-05-27 Xinglin Wang , Hao Lin , Shaoxiong Feng , Peiwen Yuan , Yiwei Li , Jiayi Shi , Yueqi Zhang , Chuyi Tan , Ji Zhang , Boyuan Pan , Yao Hu , Kan Li

Multi-agent reinforcement learning shines as the pinnacle of multi-agent systems, conquering intricate real-world challenges, fostering collaboration and coordination among agents, and unleashing the potential for intelligent…

Multiagent Systems · Computer Science 2023-12-27 Jiawei Wang , Jian Zhao , Zhengtao Cao , Ruili Feng , Rongjun Qin , Yang Yu

We present a novel multi-agent RL approach, Selective Multi-Agent Prioritized Experience Relay, in which agents share with other agents a limited number of transitions they observe during training. The intuition behind this is that even a…

Machine Learning · Computer Science 2024-04-25 Matthias Gerstgrasser , Tom Danino , Sarah Keren

How should an agent decide when and how to plan? A dominant approach builds agents as reactive policies with adaptive computation (e.g., chain-of-thought), trained end-to-end expecting planning to emerge implicitly. Without control over the…

Artificial Intelligence · Computer Science 2026-05-22 Mingkai Deng , Jinyu Hou , Lara Sá Neves , Varad Pimpalkhute , Taylor W. Killian , Zhengzhong Liu , Eric P. Xing

Reinforcement learning (RL) has become a pivotal component of large language model (LLM) post-training, and agentic RL extends this paradigm to operate as agents through multi-turn interaction and tool use. Scaling such systems exposes two…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-08 Zheyue Tan , Mustapha Abdullahi , Tuo Shi , Huining Yuan , Zelai Xu , Chao Yu , Boxun Li , Bo Zhao

We envision a continuous collaborative learning system where groups of LLM agents work together to solve reasoning problems, drawing on memory they collectively build to improve performance as they gain experience. This work establishes the…

Artificial Intelligence · Computer Science 2025-03-11 Julie Michelman , Nasrin Baratalipour , Matthew Abueg

Continual Learning (CL) methods have traditionally focused on mitigating catastrophic forgetting through gradient-based retraining, an approach ill-suited for deployed agents that must adapt in real time. We introduce our Adaptive Teaching…

Machine Learning · Computer Science 2025-11-04 Aman Jaglan , Jarrod Barnes

The idea of experience sharing between cooperative agents naturally emerges from our understanding of how humans learn. Our evolution as a species is tightly linked to the ability to exchange learned knowledge with one another. It follows…

Machine Learning · Computer Science 2019-11-07 Lucas Oliveira Souza , Gabriel de Oliveira Ramos , Celia Ghedini Ralha

Large language models (LLMs) have demonstrated impressive capabilities in reasoning with the emergence of reasoning models like OpenAI-o1 and DeepSeek-R1. Recent research focuses on integrating reasoning capabilities into the realm of…

Computation and Language · Computer Science 2025-05-26 Qi Zhang , Shouqing Yang , Lirong Gao , Hao Chen , Xiaomeng Hu , Jinglei Chen , Jiexiang Wang , Sheng Guo , Bo Zheng , Haobo Wang , Junbo Zhao
‹ Prev 1 2 3 10 Next ›