Related papers: What Limits Agentic Systems Efficiency?

Don't Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks

Recent advancements in Large Language Model (LLM) agents have enabled complex multi-turn agentic tasks requiring extensive tool calling, where conversations can span dozens of API calls with increasingly large context windows. However,…

Computation and Language · Computer Science 2026-02-03 Elias Lumer , Faheem Nizar , Akshaya Jangiti , Kevin Frank , Anmol Gulati , Mandar Phadate , Vamse Kumar Subbiah

Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents

LLM-based agent applications have shown increasingly remarkable capabilities in complex workflows but incur substantial costs and latency due to extensive planning and reasoning requirements. Existing LLM caching techniques (like context…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-28 Qizheng Zhang , Michael Wornow , Gerry Wan , Kunle Olukotun

AgentBench: Evaluating LLMs as Agents

The potential of Large Language Model (LLM) as agents has been widely acknowledged recently. Thus, there is an urgent need to quantitatively \textit{evaluate LLMs as agents} on challenging tasks in interactive environments. We present…

Artificial Intelligence · Computer Science 2025-10-07 Xiao Liu , Hao Yu , Hanchen Zhang , Yifan Xu , Xuanyu Lei , Hanyu Lai , Yu Gu , Hangliang Ding , Kaiwen Men , Kejuan Yang , Shudan Zhang , Xiang Deng , Aohan Zeng , Zhengxiao Du , Chenhui Zhang , Sheng Shen , Tianjun Zhang , Yu Su , Huan Sun , Minlie Huang , Yuxiao Dong , Jie Tang

Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective

Agentic workflows are composed of sequences of interdependent Large Language Model (LLM) calls, and they have become a dominant workload in modern AI systems. These workflows exhibit extensive redundancy from overlapping prompts and…

Multiagent Systems · Computer Science 2026-03-18 Noppanat Wadlom , Junyi Shen , Yao Lu

Efficient Agents: Building Effective Agents While Reducing Cost

The remarkable capabilities of Large Language Model (LLM)-driven agents have enabled sophisticated systems to tackle complex, multi-step tasks, but their escalating costs threaten scalability and accessibility. This work presents the first…

Artificial Intelligence · Computer Science 2025-08-06 Ningning Wang , Xavier Hu , Pai Liu , He Zhu , Yue Hou , Heyuan Huang , Shengyu Zhang , Jian Yang , Jiaheng Liu , Ge Zhang , Changwang Zhang , Jun Wang , Yuchen Eleanor Jiang , Wangchunshu Zhou

Towards Efficient Agents: A Co-Design of Inference Architecture and System

The rapid development of large language model (LLM)-based agents has unlocked new possibilities for autonomous multi-turn reasoning and tool-augmented decision-making. However, their real-world deployment is hindered by severe…

Computation and Language · Computer Science 2026-02-25 Weizhe Lin , Hui-Ling Zhen , Shuai Yang , Xian Wang , Renxi Liu , Hanting Chen , Wangze Zhang , Chuansai Zhou , Yiming Li , Chen Chen , Xing Li , Zhiyuan Yang , Xiaosong Li , Xianzhi Yu , Zhenhua Dong , Mingxuan Yuan , Yunhe Wang

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Large Language Models (LLMs) based autonomous agents demonstrate multifaceted capabilities to contribute substantially to economic production. However, existing benchmarks remain focused on single agentic capability, failing to capture…

Artificial Intelligence · Computer Science 2026-04-24 Keyu Li , Junhao Shi , Yang Xiao , Mohan Jiang , Jie Sun , Yunze Wu , Dayuan Fu , Shijie Xia , Xiaojie Cai , Tianze Xu , Weiye Si , Wenjie Li , Dequan Wang , Pengfei Liu

Agentic Reasoning: A Streamlined Framework for Enhancing LLM Reasoning with Agentic Tools

We introduce Agentic Reasoning, a framework that enhances large language model (LLM) reasoning by integrating external tool-using agents. Agentic Reasoning dynamically leverages web search, code execution, and structured memory to address…

Artificial Intelligence · Computer Science 2025-07-16 Junde Wu , Jiayuan Zhu , Yuyuan Liu , Min Xu , Yueming Jin

Optimizing Agentic Language Model Inference via Speculative Tool Calls

Language models (LMs) are becoming increasingly dependent on external tools. LM-based agentic frameworks frequently interact with their environment via such tools to search files, run code, call APIs, etc. Further, modern reasoning-based…

Programming Languages · Computer Science 2025-12-19 Daniel Nichols , Prajwal Singhania , Charles Jekel , Abhinav Bhatele , Harshitha Menon

The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective

Large-language-model (LLM)-based AI agents have recently showcased impressive versatility by employing dynamic reasoning, an adaptive, multi-step process that coordinates with external tools. This shift from static, single-turn inference to…

Machine Learning · Computer Science 2026-01-08 Jiin Kim , Byeongjun Shin , Jinha Chung , Minsoo Rhu

Understanding Multi-Agent LLM Frameworks: A Unified Benchmark and Experimental Analysis

Multi-agent LLM frameworks are widely used to accelerate the development of agent systems powered by large language models (LLMs). These frameworks impose distinct architectural structures that govern how agents interact, store information,…

Artificial Intelligence · Computer Science 2026-02-04 Abdelghny Orogat , Ana Rostam , Essam Mansour

Improving Coherence and Persistence in Agentic AI for System Optimization

Designing high-performance system heuristics is a creative, iterative process requiring experts to form hypotheses and execute multi-step conceptual shifts. While Large Language Models (LLMs) show promise in automating this loop, they…

Artificial Intelligence · Computer Science 2026-03-24 Pantea Karimi , Kimia Noorbakhsh , Mohammad Alizadeh , Hari Balakrishnan

Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization

Rapid advances in Large Language Models (LLMs) create new opportunities by enabling efficient exploration of broad, complex design spaces. This is particularly valuable in computer architecture, where performance depends on…

Artificial Intelligence · Computer Science 2026-04-29 Alexander Blasberg , Vasilis Kypriotis , Dimitrios Skarlatos

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Agentic memory systems enable large language model (LLM) agents to maintain state across long interactions, supporting long-horizon reasoning and personalization beyond fixed context windows. Despite rapid architectural development, the…

Computation and Language · Computer Science 2026-05-21 Dongming Jiang , Yi Li , Songtao Wei , Jinxin Yang , Ayushi Kishore , Alysa Zhao , Dingyi Kang , Xu Hu , Feng Chen , Qiannan Li , Bingzhe Li

LLM-based Agentic Reasoning Frameworks: A Survey from Methods to Scenarios

Recent advances in the intrinsic reasoning capabilities of large language models (LLMs) have given rise to LLM-based agent systems that exhibit near-human performance on a variety of automated tasks. However, although these systems share…

Artificial Intelligence · Computer Science 2025-08-26 Bingxi Zhao , Lin Geng Foo , Ping Hu , Christian Theobalt , Hossein Rahmani , Jun Liu

AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents

Embodied AI agents increasingly rely on large language models (LLMs) for planning, yet per-step LLM calls impose severe latency and cost. In this paper, we show that embodied tasks exhibit strong plan locality, where the next plan is…

Machine Learning · Computer Science 2026-04-28 Hojoon Kim , Yuheng Wu , Thierry Tambe

Latent Collaboration in Multi-Agent Systems

Multi-agent systems (MAS) extend large language models (LLMs) from independent single-model reasoning to coordinative system-level intelligence. While existing LLM agents depend on text-based mediation for reasoning and communication, we…

Computation and Language · Computer Science 2025-12-09 Jiaru Zou , Xiyuan Yang , Ruizhong Qiu , Gaotang Li , Katherine Tieu , Pan Lu , Ke Shen , Hanghang Tong , Yejin Choi , Jingrui He , James Zou , Mengdi Wang , Ling Yang

Multi-agent Architecture Search via Agentic Supernet

Large Language Model (LLM)-empowered multi-agent systems extend the cognitive boundaries of individual agents through disciplined collaboration and interaction, while constructing these systems often requires labor-intensive manual designs.…

Machine Learning · Computer Science 2025-06-10 Guibin Zhang , Luyang Niu , Junfeng Fang , Kun Wang , Lei Bai , Xiang Wang

The Path Ahead for Agentic AI: Challenges and Opportunities

The evolution of Large Language Models (LLMs) from passive text generators to autonomous, goal-driven systems represents a fundamental shift in artificial intelligence. This chapter examines the emergence of agentic AI systems that…

Artificial Intelligence · Computer Science 2026-01-07 Nadia Sibai , Yara Ahmed , Serry Sibaee , Sawsan AlHalawani , Adel Ammar , Wadii Boulila

RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems

Retrieval-Augmented Generation (RAG) mitigates key limitations of Large Language Models (LLMs)-such as factual errors, outdated knowledge, and hallucinations-by dynamically retrieving external information. Recent work extends this paradigm…

Computation and Language · Computer Science 2026-05-22 Jingru Lin , Chen Zhang , Stephen Y. Liu , Haizhou Li