Related papers: Code-Optimise: Self-Generated Preference Data for …

CodeDPO: Aligning Code Models with Self Generated and Verified Source Code

Code generation models have shown significant potential for programming tasks. However, existing training methods like supervised fine-tuning face key limitations: they do not effectively teach models to prioritize correct over incorrect…

Software Engineering · Computer Science 2025-06-04 Kechi Zhang , Ge Li , Yihong Dong , Jingjing Xu , Jun Zhang , Jing Su , Yongfei Liu , Zhi Jin

Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points

Code generation models have shown significant potential for automating programming tasks. However, the challenge of generating accurate and reliable code persists due to the highly complex and long-reasoning nature of the task. Even…

Software Engineering · Computer Science 2025-06-04 Kechi Zhang , Ge Li , Jia Li , Yihong Dong , Jia Li , Zhi Jin

Learning to superoptimize programs

Code super-optimization is the task of transforming any given program to a more efficient version while preserving its input-output behaviour. In some sense, it is similar to the paraphrase problem from natural language processing where the…

Machine Learning · Computer Science 2017-06-29 Rudy Bunel , Alban Desmaison , M. Pawan Kumar , Philip H. S. Torr , Pushmeet Kohli

Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation

Optimizing software performance through automated code refinement offers a promising avenue for enhancing execution speed and efficiency. Despite recent advancements in LLMs, a significant gap remains in their ability to perform in-depth…

Software Engineering · Computer Science 2025-01-30 Manish Acharya , Yifan Zhang , Kevin Leach , Yu Huang

A Data-driven Analysis of Code Optimizations

As the demand for computational power grows, optimizing code through compilers becomes increasingly crucial. In this context, we focus on fully automatic code optimization techniques that automate the process of selecting and applying code…

Programming Languages · Computer Science 2025-11-11 Yacine Hakimi , Riyadh Baghdadi

Thinking Before Running! Efficient Code Generation with Thorough Exploration and Optimal Refinement

Code generation is crucial in software engineering for automating the coding process efficiently. While test-time computation methods show promise, they suffer from high latency due to multiple computation rounds. To overcome this, we…

Software Engineering · Computer Science 2025-05-28 Xiaoqing Zhang , Yuhan Liu , Flood Sung , Xiuying Chen , Shuo Shang , Rui Yan

Towards Better Correctness and Efficiency in Code Generation

While code large language models have demonstrated remarkable progress in code generation, the generated code often exhibits poor runtime efficiency, limiting its practical application in performance-sensitive scenarios. To address this…

Software Engineering · Computer Science 2025-08-29 Yunlong Feng , Yang Xu , Xiao Xu , Binyuan Hui , Junyang Lin

Teaching Your Models to Understand Code via Focal Preference Alignment

Preference learning extends the performance of Code LLMs beyond traditional supervised fine-tuning by leveraging relative quality comparisons. In existing approaches, a set of n candidate solutions is evaluated based on test case success…

Computation and Language · Computer Science 2025-10-10 Jie Wu , Haoling Li , Xin Zhang , Xiao Liu , Yangyu Huang , Jianwen Luo , Yizhen Zhang , Zuchao Li , Ruihang Chu , Yujiu Yang , Scarlett Li

CodeT: Code Generation with Generated Tests

The task of generating code solutions for a given programming problem can benefit from the use of pre-trained language models such as Codex, which can produce multiple diverse samples. However, a major challenge for this task is to select…

Computation and Language · Computer Science 2022-11-24 Bei Chen , Fengji Zhang , Anh Nguyen , Daoguang Zan , Zeqi Lin , Jian-Guang Lou , Weizhu Chen

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization

Large language models (LLMs) show promise for automated code optimization. However, without performance context, they struggle to produce correct and effective code transformations. Existing performance tools can identify bottlenecks but…

Performance · Computer Science 2026-04-28 Mohammad Zaeed , Tanzima Z. Islam , Vladimir Indic

AP2O-Coder: Adaptively Progressive Preference Optimization for Reducing Compilation and Runtime Errors in LLM-Generated Code

LLMs' code generation capabilities have yielded substantial improvements in the effectiveness of programming tasks. However, LLM-generated code still suffers from compilation and runtime errors. Existing offline preference optimization…

Software Engineering · Computer Science 2026-01-09 Jianqing Zhang , Wei Xia , Hande Dong , Qiang Lin , Jian Cao

MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization

Large Language Models (LLMs) demonstrate strong capabilities in general coding tasks but encounter two key challenges when optimizing code: (i) the complexity of writing optimized code (such as performant CUDA kernels and competition-level…

Machine Learning · Computer Science 2026-01-12 Jiefu Ou , Sapana Chaudhary , Kaj Bostrom , Nathaniel Weir , Shuai Zhang , Huzefa Rangwala , George Karypis

ECO: Enhanced Code Optimization via Performance-Aware Prompting for Code-LLMs

Code runtime optimization-the task of rewriting a given code to a faster one-remains challenging, as it requires reasoning about performance trade-offs involving algorithmic and structural choices. Recent approaches employ code-LLMs with…

Programming Languages · Computer Science 2025-10-14 Su-Hyeon Kim , Joonghyuk Hahn , Sooyoung Cha , Yo-Sub Han

Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

Code generation has been greatly enhanced by the profound advancements in Large Language Models (LLMs) recently. Nevertheless, such LLM-based code generation approaches still struggle to generate error-free code in a few tries when faced…

Artificial Intelligence · Computer Science 2024-08-13 Zhi-Cun Lyu , Xin-Ye Li , Zheng Xie , Ming Li

Learning Code Preference via Synthetic Evolution

Large Language Models (LLMs) have recently demonstrated remarkable coding capabilities. However, assessing code generation based on well-formed properties and aligning it with developer preferences remains challenging. In this paper, we…

Machine Learning · Computer Science 2024-10-25 Jiawei Liu , Thanh Nguyen , Mingyue Shang , Hantian Ding , Xiaopeng Li , Yu Yu , Varun Kumar , Zijian Wang

Bias and Error Mitigation in Software-Generated Data: An Advanced Search and Optimization Framework Leveraging Generative Code Models

Data generation and analysis is a fundamental aspect of many industries and disciplines, from strategic decision making in business to research in the physical and social sciences. However, data generated using software and algorithms can…

Software Engineering · Computer Science 2023-10-19 Ernesto Giralt Hernández

Investigating Execution-Aware Language Models for Code Optimization

Code optimization is the process of enhancing code efficiency, while preserving its intended functionality. This process often requires a deep understanding of the code execution behavior at run-time to identify and address inefficiencies…

Software Engineering · Computer Science 2026-04-02 Federico Di Menna , Luca Traini , Gabriele Bavota , Vittorio Cortellessa

CYCLE: Learning to Self-Refine the Code Generation

Pre-trained code language models have achieved promising performance in code generation and improved the programming efficiency of human developers. However, their self-refinement capability is typically overlooked by the existing…

Software Engineering · Computer Science 2024-03-28 Yangruibo Ding , Marcus J. Min , Gail Kaiser , Baishakhi Ray

Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus

Improving the code generation capabilities of large language models (LLMs) typically relies on supervised fine-tuning or preference optimization, both of which require costly external resources such as powerful teacher models or reliable…

Software Engineering · Computer Science 2026-04-01 Huan Zhang , Wei Cheng , Wei Hu

LLM4EFFI: Leveraging Large Language Models to Enhance Code Efficiency and Correctness

Large Language Models (LLMs), particularly Code LLMs, have demonstrated impressive performance in code generation. Current research primarily focuses on the correctness of generated code, while efficiency remains less explored. Recent works…

Software Engineering · Computer Science 2025-02-27 Tong Ye , Weigang Huang , Xuhong Zhang , Tengfei Ma , Peiyu Liu , Jianwei Yin , Wenhai Wang