Related papers: RuleFlow : Generating Reusable Program Optimizatio…

Learning Performance-Improving Code Edits

With the decline of Moore's law, optimizing program performance has become a major focus of software research. However, high-level optimizations such as API and algorithm changes remain elusive due to the difficulty of understanding the…

Software Engineering · Computer Science 2024-04-29 Alexander Shypula , Aman Madaan , Yimeng Zeng , Uri Alon , Jacob Gardner , Milad Hashemi , Graham Neubig , Parthasarathy Ranganathan , Osbert Bastani , Amir Yazdanbakhsh

SysLLMatic: Large Language Models are Software System Optimizers

Automatic software system optimization can improve software speed, reduce operating costs, and save energy. Traditional approaches to optimization rely on manual tuning and compiler heuristics, limiting their ability to generalize across…

Software Engineering · Computer Science 2026-05-07 Huiyun Peng , Arjun Gupte , Ryan Hasler , Nicholas John Eliopoulos , Chien-Chou Ho , Rishi Mantri , Leo Deng , Konstantin Läufer , George K. Thiruvathukal , James C. Davis

FasterPy: An LLM-based Code Execution Efficiency Optimization Framework

Code often suffers from performance bugs. These bugs necessitate the research and practice of code optimization. Traditional rule-based methods rely on manually designing and maintaining rules for specific performance bugs (e.g., redundant…

Software Engineering · Computer Science 2025-12-30 Yue Wu , Minghao Han , Ruiyin Li , Peng Liang , Amjed Tahir , Zengyang Li , Qiong Feng , Mojtaba Shahin

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization

Large language models (LLMs) show promise for automated code optimization. However, without performance context, they struggle to produce correct and effective code transformations. Existing performance tools can identify bottlenecks but…

Performance · Computer Science 2026-04-28 Mohammad Zaeed , Tanzima Z. Islam , Vladimir Indic

Drawing Pandas: A Benchmark for LLMs in Generating Plotting Code

This paper introduces the human-curated PandasPlotBench dataset, designed to evaluate language models' effectiveness as assistants in visual data exploration. Our benchmark focuses on generating code for visualizing tabular data - such as a…

Software Engineering · Computer Science 2025-02-27 Timur Galimzyanov , Sergey Titov , Yaroslav Golubev , Egor Bogomolov

From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs

Large Language Models (LLMs) have significantly advanced artificial intelligence by optimizing traditional Natural Language Processing (NLP) workflows, facilitating their integration into various systems. Many such NLP systems, including…

Computation and Language · Computer Science 2025-05-13 Jiliang Ni , Jiachen Pu , Zhongyi Yang , Kun Zhou , Hui Wang , Xiaoliang Xiao , Dakui Wang , Xin Li , Jingfeng Luo , Conggang Hu

Rethinking LLM-Based RTL Code Optimization Via Timing Logic Metamorphosis

Register Transfer Level(RTL) code optimization is crucial for achieving high performance and low power consumption in digital circuit design. However, traditional optimization methods often rely on manual tuning and heuristics, which can be…

Software Engineering · Computer Science 2025-07-23 Zhihao Xu , Bixin Li , Lulu Wang

Performance-Aligned LLMs for Generating Fast Code

Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardware among others. Causes of poor…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-30 Daniel Nichols , Pranav Polasam , Harshitha Menon , Aniruddha Marathe , Todd Gamblin , Abhinav Bhatele

Search-Based LLMs for Code Optimization

The code written by developers usually suffers from efficiency problems and contain various performance bugs. These inefficiencies necessitate the research of automated refactoring methods for code optimization. Early research in code…

Software Engineering · Computer Science 2024-08-23 Shuzheng Gao , Cuiyun Gao , Wenchao Gu , Michael Lyu

LOOPRAG: Enhancing Loop Transformation Optimization with Retrieval-Augmented Large Language Models

Loop transformations are semantics-preserving optimization techniques, widely used to maximize objectives such as parallelism. Despite decades of research, applying the optimal composition of loop transformations remains challenging due to…

Programming Languages · Computer Science 2025-12-19 Yijie Zhi , Yayu Cao , Jianhua Dai , Xiaoyang Han , Jingwen Pu , Qingran Wu , Sheng Cheng , Ming Cai

From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph

Despite significant evolution of CUDA programming and domain-specific libraries, effectively utilizing GPUs with massively parallel engines remains difficult. Large language models (LLMs) show strong potential in generating optimized CUDA…

Machine Learning · Computer Science 2025-10-24 Junfeng Gong , Zhiyi Wei , Junying Chen , Cheng Liu , Huawei Li

EditFlow: Benchmarking and Optimizing Code Edit Recommendation Systems via Reconstruction of Developer Flows

Large language models (LLMs) for code editing have achieved remarkable progress, yet recent empirical studies reveal a fundamental disconnect between technical accuracy and developer productivity. Despite their strong benchmark performance,…

Software Engineering · Computer Science 2026-04-09 Chenyan Liu , Yun Lin , Jiaxin Chang , Jiawei Liu , Binhang Qi , Bo Jiang , Zhiyong Huang , Jin Song Dong

Query Rewriting via LLMs

When complex SQL queries suffer slow executions despite query optimization, DBAs typically invoke automated query rewriting tools to recommend ``lean'' equivalents that are conducive to faster execution. The rewritings are usually achieved…

Databases · Computer Science 2025-09-03 Sriram Dharwada , Himanshu Devrani , Jayant Haritsa , Harish Doraiswamy

Agentic Code Optimization via Compiler-LLM Cooperation

Generating performant executables from high level languages is critical to software performance across a wide range of domains. Modern compilers perform this task by passing code through a series of well-studied optimizations at…

Programming Languages · Computer Science 2026-04-07 Benjamin Mikek , Danylo Vashchilenko , Bryan Lu , Panpan Xu

LLMDFA: Analyzing Dataflow in Code with Large Language Models

Dataflow analysis is a fundamental code analysis technique that identifies dependencies between program values. Traditional approaches typically necessitate successful compilation and expert customization, hindering their applicability and…

Programming Languages · Computer Science 2024-11-26 Chengpeng Wang , Wuqi Zhang , Zian Su , Xiangzhe Xu , Xiaoheng Xie , Xiangyu Zhang

Dias: Dynamic Rewriting of Pandas Code

In recent years, dataframe libraries, such as pandas have exploded in popularity. Due to their flexibility, they are increasingly used in ad-hoc exploratory data analysis (EDA) workloads. These workloads are diverse, including custom…

Databases · Computer Science 2024-06-12 Stefanos Baziotis , Daniel Kang , Charith Mendis

Automatic Optimizations for Stream-based Monitoring Languages

Runtime monitors that are specified in a stream-based monitoring language tend to be easier to understand, maintain, and reuse than those written in a standard programming language. Because of their formal semantics, such specification…

Formal Languages and Automata Theory · Computer Science 2020-11-30 Jan Baumeister , Bernd Finkbeiner , Matthis Kruse , Maximilian Schwenger

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Despite recent advancements in large language models (LLMs), their performance on complex reasoning problems requiring multi-step thinking and combining various skills is still limited. To address this, we propose a novel framework HDFlow…

Computation and Language · Computer Science 2024-09-27 Wenlin Yao , Haitao Mi , Dong Yu

SemOpt: LLM-Driven Code Optimization via Rule-Based Analysis

Automated code optimization aims to improve performance in programs by refactoring code, and recent studies focus on utilizing LLMs for the optimization. Typical existing approaches mine optimization commits from open-source codebases to…

Software Engineering · Computer Science 2025-10-21 Yuwei Zhao , Yuan-An Xiao , Qianyu Xiao , Zhao Zhang , Yingfei Xiong

stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

Recent advances in large language models (LLMs) transform how machine learning (ML) pipelines are developed and evaluated. LLMs enable a new type of workload, agentic pipeline search, in which autonomous or semi-autonomous agents generate,…

Databases · Computer Science 2026-03-06 Arnab Phani , Elias Strauss , Sebastian Schelter