Related papers: ECO: Enhanced Code Optimization via Performance-Aw…

ECO: An LLM-Driven Efficient Code Optimizer for Warehouse Scale Computers

With the end of Moore's Law, optimizing code for performance has become paramount for meeting ever-increasing compute demands, particularly in hyperscale data centers where even small efficiency gains translate to significant resource and…

Software Engineering · Computer Science 2025-03-21 Hannah Lin , Martin Maas , Maximilian Roquemore , Arman Hasanzadeh , Fred Lewis , Yusuf Simonson , Tzu-Wei Yang , Amir Yazdanbakhsh , Deniz Altinbüken , Florin Papa , Maggie Nolan Edmonds , Aditya Patil , Don Schwarz , Satish Chandra , Chris Kennelly , Milad Hashemi , Parthasarathy Ranganathan

Tuning LLM-based Code Optimization via Meta-Prompting: An Industrial Perspective

There is a growing interest in leveraging multiple large language models (LLMs) for automated code optimization. However, industrial platforms deploying multiple LLMs face a critical challenge: prompts optimized for one LLM often fail with…

Software Engineering · Computer Science 2025-10-06 Jingzhi Gong , Rafail Giavrimis , Paul Brookes , Vardan Voskanyan , Fan Wu , Mari Ashiga , Matthew Truscott , Mike Basios , Leslie Kanthan , Jie Xu , Zheng Wang

How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation

Designing optimal prompts and reasoning processes for large language models (LLMs) on domain-specific tasks is both necessary and challenging in real-world applications. Determining how to integrate domain knowledge, enhance reasoning…

Artificial Intelligence · Computer Science 2025-10-27 Yang Zhao , Pu Wang , Hao Frank Yang

CAPO: Cost-Aware Prompt Optimization

Large language models (LLMs) have revolutionized natural language processing by solving a wide range of tasks simply guided by a prompt. Yet their performance is highly sensitive to prompt formulation. While automatic prompt optimization…

Computation and Language · Computer Science 2025-06-18 Tom Zehle , Moritz Schlager , Timo Heiß , Matthias Feurer

ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?

Although large language models (LLMs) have been largely successful in generating functionally correct programs, conditioning models to produce efficient solutions while ensuring correctness remains a challenge. Further, unreliability in…

Computation and Language · Computer Science 2024-10-11 Siddhant Waghjale , Vishruth Veerendranath , Zora Zhiruo Wang , Daniel Fried

Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation

Optimizing software performance through automated code refinement offers a promising avenue for enhancing execution speed and efficiency. Despite recent advancements in LLMs, a significant gap remains in their ability to perform in-depth…

Software Engineering · Computer Science 2025-01-30 Manish Acharya , Yifan Zhang , Kevin Leach , Yu Huang

Self-Supervised Prompt Optimization

Well-designed prompts are crucial for enhancing Large language models' (LLMs) reasoning capabilities while aligning their outputs with task requirements across diverse domains. However, manually designed prompts require expertise and…

Computation and Language · Computer Science 2025-08-22 Jinyu Xiang , Jiayi Zhang , Zhaoyang Yu , Xinbing Liang , Fengwei Teng , Jinhao Tu , Fashen Ren , Xiangru Tang , Sirui Hong , Chenglin Wu , Yuyu Luo

Optimizing Prompts for Large Language Models: A Causal Approach

Large Language Models (LLMs) are increasingly embedded in enterprise workflows, yet their performance remains highly sensitive to prompt design. Automatic Prompt Optimization (APO) seeks to mitigate this instability, but existing approaches…

Artificial Intelligence · Computer Science 2026-02-03 Wei Chen , Yanbin Fang , Shuran Fu , Fasheng Xu , Xuan Wei

Prompt Codebooks: Discrete Compositional Optimization for Language Model Instruction Refinement

Automatic prompt optimization (APO) has driven significant gains in LLM-based agentic workflows. However, existing methods treat each task's prompt as a monolithic, instance-blind string optimized through global edits, producing brittle…

Artificial Intelligence · Computer Science 2026-05-28 Jyotirmoy Nath , Neeraj Kumar , Brejesh Lall

iPOE: Interpretable Prompt Optimization via Explanations

Prompt optimization has often been framed as a discrete search problem to find high-performing and robust instructions for an LLM. However, the search result might not make it transparent why and where specific prompt changes lead to…

Computation and Language · Computer Science 2026-05-28 Jiahui Li , Yarik Menchaca Resendiz , Sean Papay , Roman Klinger

Orchestration for Domain-specific Edge-Cloud Language Models

The remarkable performance of Large Language Models (LLMs) has inspired many applications, which often necessitate edge-cloud collaboration due to connectivity, privacy, and cost considerations. Traditional methods primarily focus on…

Databases · Computer Science 2025-07-15 Prasoon Patidar , Alex Crown , Kevin Hsieh , Yifei Xu , Tusher Chakraborty , Ranveer Chandra , Yuvraj Agarwal

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Reasoning is a fundamental component of language understanding. Recent prompting techniques, such as chain of thought, have consistently improved LLMs' performance on various reasoning tasks. Nevertheless, there is still little…

Computation and Language · Computer Science 2024-10-01 Haritz Puerto , Martin Tutek , Somak Aditya , Xiaodan Zhu , Iryna Gurevych

ELPO: Ensemble Learning Based Prompt Optimization for Large Language Models

The remarkable performance of Large Language Models (LLMs) highly relies on crafted prompts. However, manual prompt engineering is a laborious process, creating a core bottleneck for practical application of LLMs. This phenomenon has led to…

Computation and Language · Computer Science 2025-11-21 Qing Zhang , Bing Xu , Xudong Zhang , Yifan Shi , Yang Li , Chen Zhang , Yik Chung Wu , Ngai Wong , Yijie Chen , Hong Dai , Xiansen Chen , Mian Zhang

MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization

Large Language Models (LLMs) demonstrate strong capabilities in general coding tasks but encounter two key challenges when optimizing code: (i) the complexity of writing optimized code (such as performant CUDA kernels and competition-level…

Machine Learning · Computer Science 2026-01-12 Jiefu Ou , Sapana Chaudhary , Kaj Bostrom , Nathaniel Weir , Shuai Zhang , Huzefa Rangwala , George Karypis

MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing

Large language models (LLMs) have transformed software development through code generation capabilities, yet their effectiveness for high-performance computing (HPC) remains limited. HPC code requires specialized optimizations for…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-26 Asif Rahman , Veljko Cvetkovic , Kathleen Reece , Aidan Walters , Yasir Hassan , Aneesh Tummeti , Bryan Torres , Denise Cooney , Margaret Ellis , Dimitrios S. Nikolopoulos

In-context Demonstration Matters: On Prompt Optimization for Pseudo-Supervision Refinement

Large language models (LLMs) have achieved great success across diverse tasks, and fine-tuning is sometimes needed to further enhance generation quality. Most existing methods rely on human supervision or parameter retraining, both of which…

Computation and Language · Computer Science 2025-05-27 Zhen-Yu Zhang , Jiandong Zhang , Huaxiu Yao , Gang Niu , Masashi Sugiyama

ECO: Energy-Constrained Optimization with Reinforcement Learning for Humanoid Walking

Achieving stable and energy-efficient locomotion is essential for humanoid robots to operate continuously in real-world applications. Existing MPC and RL approaches often rely on energy-related metrics embedded within a multi-objective…

Robotics · Computer Science 2026-02-09 Weidong Huang , Jingwen Zhang , Jiongye Li , Shibowen Zhang , Jiayang Wu , Jiayi Wang , Hangxin Liu , Yaodong Yang , Yao Su

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference

Large Language Models (LLMs) have become an integral part of many real-world workflows. However, LLMs consume a lot of energy, which becomes a large concern in the scale of the demand for these tools. As LLMs become integrated into…

Software Engineering · Computer Science 2026-05-01 Katelyn Crumpacker , Dimitrios Nikolopoulos

Towards Effective Code-Integrated Reasoning

In this paper, we investigate code-integrated reasoning, where models generate code when necessary and integrate feedback by executing it through a code interpreter. To acquire this capability, models must learn when and how to use external…

Computation and Language · Computer Science 2025-06-02 Fei Bai , Yingqian Min , Beichen Zhang , Zhipeng Chen , Wayne Xin Zhao , Lei Fang , Zheng Liu , Zhongyuan Wang , Ji-Rong Wen

COPO: Consistency-Aware Policy Optimization

Reinforcement learning has significantly enhanced the reasoning capabilities of Large Language Models (LLMs) in complex problem-solving tasks. Recently, the introduction of DeepSeek R1 has inspired a surge of interest in leveraging…

Machine Learning · Computer Science 2025-08-07 Jinghang Han , Jiawei Chen , Hang Shao , Hao Ma , Mingcheng Li , Xintian Shen , Lihao Zheng , Wei Chen , Tao Wei , Lihua Zhang