Related papers: FlowCompile: An Optimizing Compiler for Structured…

An LLM Compiler for Parallel Function Calling

The reasoning capabilities of the recent LLMs enable them to execute external function calls to overcome their inherent limitations, such as knowledge cutoffs, poor arithmetic skills, or lack of access to private data. This development has…

Computation and Language · Computer Science 2024-06-06 Sehoon Kim , Suhong Moon , Ryan Tabrizi , Nicholas Lee , Michael W. Mahoney , Kurt Keutzer , Amir Gholami

PlanCompiler: A Deterministic Compilation Architecture for Structured Multi-Step LLM Pipelines

Large language models (LLMs) remain brittle in multi-step structured workflows, where errors compound across sequential transformations, validation stages, and stateful operations such as SQL persistence. We present PlanCompiler, a…

Software Engineering · Computer Science 2026-04-16 Pranav Harikumar

REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving

While model serving has unlocked unprecedented capabilities, the high cost of serving large-scale models continues to be a significant barrier to widespread accessibility and rapid innovation. Compiler optimizations have long driven…

Machine Learning · Computer Science 2026-02-05 Annabelle Sujun Tang , Christopher Priebe , Rohan Mahapatra , Lianhui Qin , Hadi Esmaeilzadeh

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Large language model (LLM)-based systems are becoming increasingly popular for solving tasks by constructing executable workflows that interleave LLM calls, information retrieval, tool use, code execution, memory updates, and verification.…

Artificial Intelligence · Computer Science 2026-03-25 Ling Yue , Kushal Raj Bhandari , Ching-Yun Ko , Dhaval Patel , Shuxin Lin , Nianjun Zhou , Jianxi Gao , Pin-Yu Chen , Shaowu Pan

FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients

LLM workflows, which coordinate structured calls to individual LLMs/agents to achieve a particular goal, offer a promising path towards building powerful AI systems that can tackle diverse tasks. However, existing approaches for building…

Computation and Language · Computer Science 2026-05-04 Hongyeon Yu , Young-Bum Kim , Yoon Kim

FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning

LLMs can solve complex tasks through reasoning and tool use, but accurately translating these solutions into structured workflows remains challenging. We model workflows as sequences of tool use and reformulate the problem as designing a…

Artificial Intelligence · Computer Science 2026-02-13 Yihao Liu , Ziyun Zhang , Zile He , Huaqian Cai

Agentic Code Optimization via Compiler-LLM Cooperation

Generating performant executables from high level languages is critical to software performance across a wide range of domains. Modern compilers perform this task by passing code through a series of well-studied optimizations at…

Programming Languages · Computer Science 2026-04-07 Benjamin Mikek , Danylo Vashchilenko , Bryan Lu , Panpan Xu

LiteCoOp: Lightweight Multi-LLM Shared-Tree Reasoning for Model-Serving Compiler Optimizations

LLM-guided compiler optimization has recently shown promise, but existing approaches rely on a single large LLM throughout search, making them expensive and excluding smaller models. We pose the research question: whether heterogeneous LLMs…

Machine Learning · Computer Science 2026-05-22 Annabelle Sujun Tang , Christopher Priebe , Lianhui Qin , Hadi Esmaeilzadeh

CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System

With open-source projects growing in size and complexity, manual compilation becomes tedious and error-prone, highlighting the need for automation to improve efficiency and accuracy. However, the complexity of compilation instruction search…

Software Engineering · Computer Science 2025-05-08 Li Hu , Guoqiang Chen , Xiuwei Shang , Shaoyin Cheng , Benlong Wu , Gangyang Li , Xu Zhu , Weiming Zhang , Nenghai Yu

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Machine Learning · Computer Science 2025-12-19 Hao Liang , Xiaochen Ma , Zhou Liu , Zhen Hao Wong , Zhengyang Zhao , Zimo Meng , Runming He , Chengyu Shen , Qifeng Cai , Zhaoyang Han , Meiyi Qiang , Yalin Feng , Tianyi Bai , Zewei Pan , Ziyi Guo , Yizhen Jiang , Jingwen Deng , Qijie You , Peichao Lai , Tianyu Guo , Chi Hsu Tsai , Hengyi Feng , Rui Hu , Wenkai Yu , Junbo Niu , Bohan Zeng , Ruichuan An , Lu Ma , Jihao Huang , Yaowei Zheng , Conghui He , Linpeng Tang , Bin Cui , Weinan E , Wentao Zhang

StreamTensor: Make Tensors Stream in Dataflow Accelerators for LLMs

Efficient execution of deep learning workloads on dataflow architectures is crucial for overcoming memory bottlenecks and maximizing performance. While streaming intermediate results between computation kernels can significantly improve…

Hardware Architecture · Computer Science 2025-09-24 Hanchen Ye , Deming Chen

CompILE: Compositional Imitation Learning and Execution

We introduce Compositional Imitation Learning and Execution (CompILE): a framework for learning reusable, variable-length segments of hierarchically-structured behavior from demonstration data. CompILE uses a novel unsupervised,…

Machine Learning · Statistics 2019-05-15 Thomas Kipf , Yujia Li , Hanjun Dai , Vinicius Zambaldi , Alvaro Sanchez-Gonzalez , Edward Grefenstette , Pushmeet Kohli , Peter Battaglia

The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers

This survey has provided a systematic overview of the emerging field of LLM-enabled compilation by addressing several key research questions. We first answered how LLMs are being integrated by proposing a comprehensive, multi-dimensional…

Programming Languages · Computer Science 2026-01-06 Shuoming Zhang , Jiacheng Zhao , Qiuchu Yu , Chunwei Xia , Zheng Wang , Xiaobing Feng , Huimin Cui

AwareCompiler: Agentic Context-Aware Compiler Optimization via a Synergistic Knowledge-Data Driven Framework

Compiler optimization is crucial for enhancing program performance by transforming the sequence of optimization passes while maintaining correctness. Despite the promising potential of large language models (LLMs)-based agent for software…

Programming Languages · Computer Science 2025-10-15 Hongyu Lin , Haolin Pan , Haoran Luo , Yuchen Li , Kaichun Yao , Libo Zhang , Mingjie Xing , Yanjun Wu

An LLM-Tool Compiler for Fused Parallel Function Calling

State-of-the-art sequential reasoning in Large Language Models (LLMs) has expanded the capabilities of Copilots beyond conversational tasks to complex function calling, managing thousands of API calls. However, the tendency of compositional…

Programming Languages · Computer Science 2024-05-29 Simranjit Singh , Andreas Karatzas , Michael Fore , Iraklis Anagnostopoulos , Dimitrios Stamoulis

FlowAgent: Achieving Compliance and Flexibility for Workflow Agents

The integration of workflows with large language models (LLMs) enables LLM-based agents to execute predefined procedures, enhancing automation in real-world applications. Traditional rule-based methods tend to limit the inherent flexibility…

Artificial Intelligence · Computer Science 2025-02-21 Yuchen Shi , Siqi Cai , Zihan Xu , Yuei Qin , Gang Li , Hang Shao , Jiawei Chen , Deqing Yang , Ke Li , Xing Sun

Query Optimization Beyond Data Systems: The Case for Multi-Agent Systems

The proliferation of large language models (LLMs) has accelerated the adoption of agent-based workflows, where multiple autonomous agents reason, invoke functions, and collaborate to compose complex data pipelines. However, current…

Databases · Computer Science 2025-12-15 Zoi Kaoudi , Ioana Giurgiu

nncase: An End-to-End Compiler for Efficient LLM Deployment on Heterogeneous Storage Architectures

The efficient deployment of large language models (LLMs) is hindered by memory architecture heterogeneity, where traditional compilers suffer from fragmented workflows and high adaptation costs. We present nncase, an open-source, end-to-end…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-29 Hui Guo , Qihang Zheng , Chenghai Huo , Dongliang Guo , Haoqi Yang , Yang Zhang

Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation

We study compiled AI, a paradigm in which large language models generate executable code artifacts during a compilation phase, after which workflows execute deterministically without further model invocation. This paradigm has antecedents…

Software Engineering · Computer Science 2026-04-08 Geert Trooskens , Aaron Karlsberg , Anmol Sharma , Lamara De Brouwer , Max Van Puyvelde , Matthew Young , John Thickstun , Gil Alterovitz , Walter A. De Brouwer

CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports

Current compiler optimization reports often present complex, technical information that is difficult for programmers to interpret and act upon effectively. This paper assesses the capability of large language models (LLM) to understand…

Programming Languages · Computer Science 2025-06-16 Peter Pirkelbauer , Chunhua Liao