Related papers: ECO: An LLM-Driven Efficient Code Optimizer for Wa…

ECO: Enhanced Code Optimization via Performance-Aware Prompting for Code-LLMs

Code runtime optimization-the task of rewriting a given code to a faster one-remains challenging, as it requires reasoning about performance trade-offs involving algorithmic and structural choices. Recent approaches employ code-LLMs with…

Programming Languages · Computer Science 2025-10-14 Su-Hyeon Kim , Joonghyuk Hahn , Sooyoung Cha , Yo-Sub Han

MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing

Large language models (LLMs) have transformed software development through code generation capabilities, yet their effectiveness for high-performance computing (HPC) remains limited. HPC code requires specialized optimizations for…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-26 Asif Rahman , Veljko Cvetkovic , Kathleen Reece , Aidan Walters , Yasir Hassan , Aneesh Tummeti , Bryan Torres , Denise Cooney , Margaret Ellis , Dimitrios S. Nikolopoulos

E-code: Mastering Efficient Code Generation through Pretrained Models and Expert Encoder Group

Context: With the waning of Moore's Law, the software industry is placing increasing importance on finding alternative solutions for continuous performance enhancement. The significance and research results of software performance…

Software Engineering · Computer Science 2024-08-26 Yue Pan , Chen Lyu , Zhenyu Yang , Lantian Li , Qi Liu , Xiuting Shao

Learning Performance-Improving Code Edits

With the decline of Moore's law, optimizing program performance has become a major focus of software research. However, high-level optimizations such as API and algorithm changes remain elusive due to the difficulty of understanding the…

Software Engineering · Computer Science 2024-04-29 Alexander Shypula , Aman Madaan , Yimeng Zeng , Uri Alon , Jacob Gardner , Milad Hashemi , Graham Neubig , Parthasarathy Ranganathan , Osbert Bastani , Amir Yazdanbakhsh

ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?

Although large language models (LLMs) have been largely successful in generating functionally correct programs, conditioning models to produce efficient solutions while ensuring correctness remains a challenge. Further, unreliability in…

Computation and Language · Computer Science 2024-10-11 Siddhant Waghjale , Vishruth Veerendranath , Zora Zhiruo Wang , Daniel Fried

Learning to Improve Code Efficiency

Improvements in the performance of computing systems, driven by Moore's Law, have transformed society. As such hardware-driven gains slow down, it becomes even more important for software developers to focus on performance and efficiency…

Software Engineering · Computer Science 2022-08-11 Binghong Chen , Daniel Tarlow , Kevin Swersky , Martin Maas , Pablo Heiber , Ashish Naik , Milad Hashemi , Parthasarathy Ranganathan

Large Language Models for Energy-Efficient Code: Emerging Results and Future Directions

Energy-efficient software helps improve mobile device experiences and reduce the carbon footprint of data centers. However, energy goals are often de-prioritized in order to meet other requirements. We take inspiration from recent work…

Software Engineering · Computer Science 2024-10-15 Huiyun Peng , Arjun Gupte , Nicholas John Eliopoulos , Chien Chou Ho , Rishi Mantri , Leo Deng , Wenxin Jiang , Yung-Hsiang Lu , Konstantin Läufer , George K. Thiruvathukal , James C. Davis

ECO: Quantized Training without Full-Precision Master Weights

Quantization has significantly improved the compute and memory efficiency of Large Language Model (LLM) training. However, existing approaches still rely on accumulating their updates in high-precision: concretely, gradient updates must be…

Computation and Language · Computer Science 2026-01-30 Mahdi Nikdan , Amir Zandieh , Dan Alistarh , Vahab Mirrokni

Can We Make Code Green? Understanding Trade-Offs in LLMs vs. Human Code Optimizations

The rapid technological evolution has accelerated software development for various domains and use cases, contributing to a growing share of global carbon emissions. While recent large language models (LLMs) claim to assist developers in…

Software Engineering · Computer Science 2025-03-27 Pooja Rani , Jan-Andrea Bard , June Sallou , Alexander Boll , Timo Kehrer , Alberto Bacchelli

Orchestration for Domain-specific Edge-Cloud Language Models

The remarkable performance of Large Language Models (LLMs) has inspired many applications, which often necessitate edge-cloud collaboration due to connectivity, privacy, and cost considerations. Traditional methods primarily focus on…

Databases · Computer Science 2025-07-15 Prasoon Patidar , Alex Crown , Kevin Hsieh , Yifei Xu , Tusher Chakraborty , Ranveer Chandra , Yuvraj Agarwal

GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation

Large Language Models (LLMs) are becoming integral to daily life, showcasing their vast potential across various Natural Language Processing (NLP) tasks. Beyond NLP, LLMs are increasingly used in software development tasks, such as code…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-24 Shashikant Ilager , Lukas Florian Briem , Ivona Brandic

MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization

Large Language Models (LLMs) demonstrate strong capabilities in general coding tasks but encounter two key challenges when optimizing code: (i) the complexity of writing optimized code (such as performant CUDA kernels and competition-level…

Machine Learning · Computer Science 2026-01-12 Jiefu Ou , Sapana Chaudhary , Kaj Bostrom , Nathaniel Weir , Shuai Zhang , Huzefa Rangwala , George Karypis

Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-aware Cache Compression

Large language models (LLMs) have demonstrated transformative capabilities across diverse artificial intelligence applications, yet their deployment is hindered by substantial memory and computational demands, especially in…

Hardware Architecture · Computer Science 2025-05-13 Feng Cheng , Cong Guo , Chiyue Wei , Junyao Zhang , Changchun Zhou , Edward Hanson , Jiaqi Zhang , Xiaoxiao Liu , Hai "Helen" Li , Yiran Chen

LLM4EFFI: Leveraging Large Language Models to Enhance Code Efficiency and Correctness

Large Language Models (LLMs), particularly Code LLMs, have demonstrated impressive performance in code generation. Current research primarily focuses on the correctness of generated code, while efficiency remains less explored. Recent works…

Software Engineering · Computer Science 2025-02-27 Tong Ye , Weigang Huang , Xuhong Zhang , Tengfei Ma , Peiyu Liu , Jianwei Yin , Wenhai Wang

Tuning LLM-based Code Optimization via Meta-Prompting: An Industrial Perspective

There is a growing interest in leveraging multiple large language models (LLMs) for automated code optimization. However, industrial platforms deploying multiple LLMs face a critical challenge: prompts optimized for one LLM often fail with…

Software Engineering · Computer Science 2025-10-06 Jingzhi Gong , Rafail Giavrimis , Paul Brookes , Vardan Voskanyan , Fan Wu , Mari Ashiga , Matthew Truscott , Mike Basios , Leslie Kanthan , Jie Xu , Zheng Wang

PEACE: Towards Efficient Project-Level Efficiency Optimization via Hybrid Code Editing

Large Language Models (LLMs) have demonstrated significant capability in code generation, but their potential in code efficiency optimization remains underexplored. Previous LLM-based code efficiency optimization approaches exclusively…

Software Engineering · Computer Science 2025-10-22 Xiaoxue Ren , Jun Wan , Yun Peng , Zhongxin Liu , Ming Liang , Dajun Chen , Wei Jiang , Yong Li

Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes

Large language models (LLMs) are increasingly used for generating parallel scientific codes, with a primary focus on generating functionally correct code. Recent work has focused on generating performant code, with an emphasis on its…

Artificial Intelligence · Computer Science 2026-05-12 Matthew T. Dearing , Yiheng Tao , Xingfu Wu , Zhiling Lan , Valerie Taylor

ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs

We propose an efficient layer-specific optimization (ELO) method designed to enhance continual pretraining (CP) for specific languages in multilingual large language models (MLLMs). This approach addresses the common challenges of high…

Computation and Language · Computer Science 2026-01-21 HanGyeol Yoo , ChangSu Choi , Minjun Kim , Seohyun Song , SeungWoo Song , Inho Won , Jongyoul Park , Cheoneum Park , KyungTae Lim

ECO: Energy-Constrained Optimization with Reinforcement Learning for Humanoid Walking

Achieving stable and energy-efficient locomotion is essential for humanoid robots to operate continuously in real-world applications. Existing MPC and RL approaches often rely on energy-related metrics embedded within a multi-objective…

Robotics · Computer Science 2026-02-09 Weidong Huang , Jingwen Zhang , Jiongye Li , Shibowen Zhang , Jiayang Wu , Jiayi Wang , Hangxin Liu , Yaodong Yang , Yao Su

Performance-Aligned LLMs for Generating Fast Code

Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardware among others. Causes of poor…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-30 Daniel Nichols , Pranav Polasam , Harshitha Menon , Aniruddha Marathe , Todd Gamblin , Abhinav Bhatele