Related papers: ThunderAgent: A Simple, Fast and Program-Aware Age…

Towards Efficient Agents: A Co-Design of Inference Architecture and System

The rapid development of large language model (LLM)-based agents has unlocked new possibilities for autonomous multi-turn reasoning and tool-augmented decision-making. However, their real-world deployment is hindered by severe…

Computation and Language · Computer Science 2026-02-25 Weizhe Lin , Hui-Ling Zhen , Shuai Yang , Xian Wang , Renxi Liu , Hanting Chen , Wangze Zhang , Chuansai Zhou , Yiming Li , Chen Chen , Xing Li , Zhiyuan Yang , Xiaosong Li , Xianzhi Yu , Zhenhua Dong , Mingxuan Yuan , Yunhe Wang

Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective

Agentic workflows are composed of sequences of interdependent Large Language Model (LLM) calls, and they have become a dominant workload in modern AI systems. These workflows exhibit extensive redundancy from overlapping prompts and…

Multiagent Systems · Computer Science 2026-03-18 Noppanat Wadlom , Junyi Shen , Yao Lu

Towards General Computer Control with Hierarchical Agents and Multi-Level Action Spaces

Controlling desktop applications via software remains a fundamental yet under-served problem. Existing multi-modal large language models (MLLMs) ingest screenshots and task instructions to generate keystrokes and mouse events, but they…

Artificial Intelligence · Computer Science 2025-09-24 Zihan Dong , Xinyu Fan , Zixiang Tang , Yunqing Li

TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering

Large language models (LLMs) exhibit strong symbolic and compositional reasoning, yet they struggle with time series question answering as the data is typically transformed into an LLM-compatible modality, e.g., serialized text, plotted…

Artificial Intelligence · Computer Science 2026-04-08 Penghang Liu , Elizabeth Fons , Annita Vapsi , Mohsen Ghassemi , Svitlana Vyetrenko , Daniel Borrajo , Vamsi K. Potluru , Manuela Veloso

TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

The aerodynamic design of turbomachinery is a complex and tightly coupled multi-stage process involving geometry generation, performance prediction, optimization, and high-fidelity physical validation. Existing intelligent design approaches…

Artificial Intelligence · Computer Science 2026-04-10 Juan Du , Yueteng Wu , Pan Zhao , Yuze Liu , Min Zhang , Xiaobin Xu , Xinglong Zhang

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Large Language Model (LLM)-based agents exhibit significant potential across various domains, operating as interactive systems that process environmental observations to generate executable actions for target tasks. The effectiveness of…

Computation and Language · Computer Science 2024-08-20 Mengkang Hu , Tianxing Chen , Qiguang Chen , Yao Mu , Wenqi Shao , Ping Luo

MapAgent: A Hierarchical Agent for Geospatial Reasoning with Dynamic Map Tool Integration

Agentic AI has significantly extended the capabilities of large language models (LLMs) by enabling complex reasoning and tool use. However, most existing frameworks are tailored to domains such as mathematics, coding, or web automation, and…

Artificial Intelligence · Computer Science 2025-10-15 Md Hasebul Hasan , Mahir Labib Dihan , Tanzima Hashem , Mohammed Eunus Ali , Md Rizwan Parvez

Sutradhara: An Intelligent Orchestrator-Engine Co-design for Tool-based Agentic Inference

Agentic applications are LLMs that iteratively invoke external tools to accomplish complex tasks. Such tool-based agents are rapidly becoming the dominant paradigm for deploying language models in production. Unlike traditional single-turn…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-23 Anish Biswas , Kanishk Goel , Srivarshinee S , Jayashree Mohan , Alind Khare , Anjaly Parayil , Ramachandran Ramjee , Chetan Bansal

Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents

Large Language Models (LLMs) are increasingly being deployed as intelligent agents. Their multi-stage workflows, which alternate between local computation and calls to external network services like Web APIs, introduce a mismatch in their…

Computation and Language · Computer Science 2025-12-17 Hongqiu Ni , Jiabao Zhang , Guopeng Li , Zilong Wang , Ruiqi Wu , Chi Zhang , Haisheng Tan

TimeART: Towards Agentic Time Series Reasoning via Tool-Augmentation

Time series data widely exist in real-world cyber-physical systems. Though analyzing and interpreting them contributes to significant values, e.g, disaster prediction and financial risk control, current workflows mainly rely on human data…

Machine Learning · Computer Science 2026-01-21 Xingjian Wu , Junkai Lu , Zhengyu Li , Xiangfei Qiu , Jilin Hu , Chenjuan Guo , Christian S. Jensen , Bin Yang

HexAGenT: Efficient Agentic LLM Serving via Workflow- and Heterogeneity-Aware Scheduling

Agentic LLM applications increasingly execute user requests as multi-step workflows involving planning, tool use, branching, refinement, and synthesis. In such settings, users experience the end-to-end latency of an entire workflow, not the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-19 You Peng , Youhe Jiang , Wenshuang Li , Xu Xu , Ke Zhou , Jiawei Jiang , Chen Wang , Binhang Yuan

CaveAgent: Transforming LLMs into Stateful Runtime Operators

LLM-based agents are increasingly capable of complex task execution, yet current agentic systems remain constrained by text-centric paradigms that struggle with long-horizon tasks due to fragile multi-turn dependencies and context drift. We…

Artificial Intelligence · Computer Science 2026-02-20 Maohao Ran , Zhenglin Wan , Cooper Lin , Yanting Zhang , Hongyu Xin , Hongwei Fan , Yibo Xu , Beier Luo , Yaxin Zhou , Wangbo Zhao , Lijie Yang , Lang Feng , Fuchao Yang , Jingxuan Wu , Yiqiao Huang , Chendong Ma , Dailing Jiang , Jianbo Deng , Sirui Han , Yang You , Bo An , Yike Guo , Jun Song

Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents

Large Language Model (LLM)-based search agents have shown remarkable capabilities in solving complex tasks by dynamically decomposing problems and addressing them through interleaved reasoning and retrieval. However, this interleaved…

Artificial Intelligence · Computer Science 2025-05-20 Tiannuo Yang , Zebin Yao , Bowen Jin , Lixiao Cui , Yusen Li , Gang Wang , Xiaoguang Liu

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Training AI models has always been challenging, especially when there is a need for custom models to provide personalized services. Algorithm engineers often face a lengthy process to iteratively develop models tailored to specific business…

Artificial Intelligence · Computer Science 2023-11-27 Haoyuan Li , Hao Jiang , Tianke Zhang , Zhelun Yu , Aoxiong Yin , Hao Cheng , Siming Fu , Yuhao Zhang , Wanggui He

SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models

Spreadsheets are ubiquitous across the World Wide Web, playing a critical role in enhancing work efficiency across various domains. Large language model (LLM) has been recently attempted for automatic spreadsheet manipulation but has not…

Artificial Intelligence · Computer Science 2025-03-04 Yibin Chen , Yifu Yuan , Zeyu Zhang , Yan Zheng , Jinyi Liu , Fei Ni , Jianye Hao , Hangyu Mao , Fuzheng Zhang

A Survey of LLM Inference Systems

The past few years has witnessed specialized large language model (LLM) inference systems, such as vLLM, SGLang, Mooncake, and DeepFlow, alongside rapid LLM adoption via services like ChatGPT. Driving these system design efforts is the…

Databases · Computer Science 2025-06-30 James Pan , Guoliang Li

A Cloud-based Multi-Agentic Workflow for Science

As Large Language Models (LLMs) become ubiquitous across various scientific domains, their lack of ability to perform complex tasks like running simulations or to make complex decisions limits their utility. LLM-based agents bridge this gap…

Computation and Language · Computer Science 2026-01-21 Anurag Acharya , Timothy Vega , Rizwan A. Ashraf , Anshu Sharma , Derek Parker , Robert Rallo

TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers

Agentic systems, AI architectures that autonomously execute multi-step workflows to achieve complex goals, are often built using repeated large language model (LLM) calls for closed-set decision tasks such as routing, shortlisting, gating,…

Computation and Language · Computer Science 2026-02-19 Ido Levy , Eilam Shapira , Yinon Goldshtein , Avi Yaeli , Nir Mashkif , Segev Shlomov

RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Large Language Models (LLMs) have substantially influenced various software engineering tasks. Indeed, in the case of software refactoring, traditional LLMs have shown the ability to reduce development time and enhance code quality.…

Software Engineering · Computer Science 2026-03-06 Khouloud Oueslati , Maxime Lamothe , Foutse Khomh

Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs

Multi-agent applications often execute complex tasks as multi-stage workflows, where each stage is an LLM call whose output becomes part of context for subsequent steps. Existing LLM serving systems largely assume homogeneous clusters with…

Machine Learning · Computer Science 2026-03-24 Kangqi Ni , Wenyue Hua , Xiaoxiang Shi , Jiang Guo , Shiyu Chang , Tianlong Chen