Related papers: Pythia: Exploiting Workflow Predictability for Eff…

Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective

Agentic workflows are composed of sequences of interdependent Large Language Model (LLM) calls, and they have become a dominant workload in modern AI systems. These workflows exhibit extensive redundancy from overlapping prompts and…

Multiagent Systems · Computer Science 2026-03-18 Noppanat Wadlom , Junyi Shen , Yao Lu

Query Optimization Beyond Data Systems: The Case for Multi-Agent Systems

The proliferation of large language models (LLMs) has accelerated the adoption of agent-based workflows, where multiple autonomous agents reason, invoke functions, and collaborate to compose complex data pipelines. However, current…

Databases · Computer Science 2025-12-15 Zoi Kaoudi , Ioana Giurgiu

Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs

Large language models (LLMs) are increasingly being applied to programming tasks, ranging from single-turn code completion to autonomous agents. Current code agent designs frequently depend on complex, hand-crafted workflows and tool sets.…

Artificial Intelligence · Computer Science 2025-10-01 Hankun Dai , Maoquan Wang , Mengnan Qi , Yikai Zhang , Zijian Jin , Yongqiang Yao , Yufan Huang , Shengyu Fu , Elsie Nallipogu

Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs

Multi-agent applications often execute complex tasks as multi-stage workflows, where each stage is an LLM call whose output becomes part of context for subsequent steps. Existing LLM serving systems largely assume homogeneous clusters with…

Machine Learning · Computer Science 2026-03-24 Kangqi Ni , Wenyue Hua , Xiaoxiang Shi , Jiang Guo , Shiyu Chang , Tianlong Chen

Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First

Large Language Model (LLM) agents, acting on their users' behalf to manipulate and analyze data, are likely to become the dominant workload for data systems in the future. When working with data, agents employ a high-throughput process of…

Artificial Intelligence · Computer Science 2025-12-09 Shu Liu , Soujanya Ponnapalli , Shreya Shankar , Sepanta Zeighami , Alan Zhu , Shubham Agarwal , Ruiqi Chen , Samion Suwito , Shuo Yuan , Ion Stoica , Matei Zaharia , Alvin Cheung , Natacha Crooks , Joseph E. Gonzalez , Aditya G. Parameswaran

Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents

Large Language Models (LLMs) are increasingly being deployed as intelligent agents. Their multi-stage workflows, which alternate between local computation and calls to external network services like Web APIs, introduce a mismatch in their…

Computation and Language · Computer Science 2025-12-17 Hongqiu Ni , Jiabao Zhang , Guopeng Li , Zilong Wang , Ruiqi Wu , Chi Zhang , Haisheng Tan

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Large language model (LLM) applications are evolving beyond simple chatbots into dynamic, general-purpose agentic programs, which scale LLM calls and output tokens to help AI agents reason, explore, and solve complex tasks. However,…

Machine Learning · Computer Science 2025-02-20 Michael Luo , Xiaoxiang Shi , Colin Cai , Tianjun Zhang , Justin Wong , Yichuan Wang , Chi Wang , Yanping Huang , Zhifeng Chen , Joseph E. Gonzalez , Ion Stoica

stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

Recent advances in large language models (LLMs) transform how machine learning (ML) pipelines are developed and evaluated. LLMs enable a new type of workload, agentic pipeline search, in which autonomous or semi-autonomous agents generate,…

Databases · Computer Science 2026-03-06 Arnab Phani , Elias Strauss , Sebastian Schelter

Scepsy: Serving Agentic Workflows Using Aggregate LLM Pipelines

Agentic workflows carry out complex tasks by orchestrating multiple large language models (LLMs) and tools. Serving such workflows at a target throughput with low latency is challenging because they can be defined using arbitrary agentic…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-17 Marcel Wagenländer , Otto White , Britannio Jarrett , Pedro Silvestre , Yanda Tao , Guo Li , Huanzhou Zhu , Llúis Vilanova , Peter Pietzuch

A Survey on Agent Workflow -- Status and Future

In the age of large language models (LLMs), autonomous agents have emerged as a powerful paradigm for achieving general intelligence. These agents dynamically leverage tools, memory, and reasoning capabilities to accomplish user-defined…

Artificial Intelligence · Computer Science 2025-08-05 Chaojia Yu , Zihan Cheng , Hanwen Cui , Yishuo Gao , Zexu Luo , Yijin Wang , Hangbin Zheng , Yong Zhao

Learning to Choose: An Empowerment-Guided Multi-Agent System with semantic communication for Adaptive Method Selection

Automating scientific computing workflows requires more than generating executable code: autonomous systems must also select appropriate computational strategies, implement them faithfully, and ensure that the resulting outcomes remain…

Artificial Intelligence · Computer Science 2026-05-29 Geremy Loachamín-Suntaxi , Robert Lazar , Dimitrios G. Giovanis , Ioannis G. Kevrekidis , Eleni D. Koronaki

AutoFlow: Automated Workflow Generation for Large Language Model Agents

Recent advancements in Large Language Models (LLMs) have shown significant progress in understanding complex natural language. One important application of LLM is LLM-based AI Agent, which leverages the ability of LLM as well as external…

Computation and Language · Computer Science 2024-07-19 Zelong Li , Shuyuan Xu , Kai Mei , Wenyue Hua , Balaji Rama , Om Raheja , Hao Wang , He Zhu , Yongfeng Zhang

LLM-Powered AI Agent Systems and Their Applications in Industry

The emergence of Large Language Models (LLMs) has reshaped agent systems. Unlike traditional rule-based agents with limited task scope, LLM-powered agents offer greater flexibility, cross-domain reasoning, and natural language interaction.…

Artificial Intelligence · Computer Science 2026-05-05 Guannan Liang , Qianqian Tong

Agentic AI Workload Characteristics

Agentic AI shifts LLM serving from isolated prompt-generation requests to stateful, multi-turn executions that repeatedly invoke the model, call tools, and grow context over time. This paper characterizes ReAct-style agents from both the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-27 Yichao Yuan , Ankita Nayak , Souvik Kundu , Nishil Talati

PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

Multi-agent LLM workflows -- systems composed of multiple role-specific LLM calls -- often outperform single-prompt baselines, but they remain difficult to debug and refine. Failures can originate from subtle errors in intermediate outputs…

Computation and Language · Computer Science 2026-05-19 Kazuki Kawamura , Satoshi Waki , Kei Tateno

KnowThyself: An Agentic Assistant for LLM Interpretability

We develop KnowThyself, an agentic assistant that advances large language model (LLM) interpretability. Existing tools provide useful insights but remain fragmented and code-intensive. KnowThyself consolidates these capabilities into a…

Artificial Intelligence · Computer Science 2025-11-07 Suraj Prasai , Mengnan Du , Ying Zhang , Fan Yang

Asynchronous Tool Usage for Real-Time Agents

While frontier large language models (LLMs) are capable tool-using agents, current AI systems still operate in a strict turn-based fashion, oblivious to passage of time. This synchronous design forces user queries and tool-use to occur…

Artificial Intelligence · Computer Science 2024-10-30 Antonio A. Ginart , Naveen Kodali , Jason Lee , Caiming Xiong , Silvio Savarese , John Emmons

HEXGEN-FLOW: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL

Recent advances in agentic large language models (LLMs) have substantially improved Text-to-SQL, enabling users without database expertise to query databases intuitively. However, deploying agentic LLM-based Text-to-SQL systems in…

Databases · Computer Science 2026-03-10 You Peng , Youhe Jiang , Wenqi Jiang , Chen Wang , Binhang Yuan

Lemon Agent Technical Report

Recent advanced LLM-powered agent systems have exhibited their remarkable capabilities in tackling complex, long-horizon tasks. Nevertheless, they still suffer from inherent limitations in resource efficiency, context management, and…

Multiagent Systems · Computer Science 2026-02-10 Haipeng Jiang , Kailong Ren , Zimo Yin , Zhetao Sun , Xin Gan , Guangyi Lv , Ming He , Peng Wang , Congli Yin , Hong Pan , Changwen Zhang , Shan Tong , Zhengyu Xu , Zeping Chen , Yubin Huangfu , Yanzhi Xu , Xing Su , Qin Feng , Dong An , Jianping Fan

SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly

Recent advancements have increasingly focused on leveraging large language models (LLMs) to construct autonomous agents for complex problem-solving tasks. However, existing approaches predominantly employ a single-agent framework to…

Artificial Intelligence · Computer Science 2026-02-02 Wei Zhu , Zhiwen Tang , Kun Yue