PerfRL: A Small Language Model Framework for Efficient Code Optimization

Shukai Duan; Nikos Kanakaris; Xiongye Xiao; Heng Ping; Chenyu Zhou; Nesreen K. Ahmed; Guixiang Ma; Mihai Capota; Theodore L. Willke; Shahin Nazarian; Paul Bogdan

PerfRL: A Small Language Model Framework for Efficient Code Optimization

Machine Learning 2025-03-11 v2 Artificial Intelligence Programming Languages Software Engineering

Authors: Shukai Duan , Nikos Kanakaris , Xiongye Xiao , Heng Ping , Chenyu Zhou , Nesreen K. Ahmed , Guixiang Ma , Mihai Capota , Theodore L. Willke , Shahin Nazarian , Paul Bogdan

View on arXiv ↗ PDF ↗

Abstract

Code optimization is a challenging task requiring a substantial level of expertise from developers. Nonetheless, this level of human capacity is not sufficient considering the rapid evolution of new hardware architectures and software environments. In light of this, recent research proposes adopting machine learning and artificial intelligence techniques to automate the code optimization process. In this paper, we introduce PerfRL, an innovative framework designed to tackle the problem of code optimization. Our framework leverages the capabilities of small language models (SLMs) and reinforcement learning (RL), facilitating a system where SLMs can assimilate feedback from their environment during the fine-tuning phase, notably through unit tests. When benchmarked against existing models, PerfRL demonstrates superior efficiency in terms of speed and computational resource usage, attributed to its reduced need for training steps and its compatibility with SLMs. Furthermore, it substantially diminishes the risk of logical and syntactical errors. To evaluate our framework, we conduct experiments on the PIE dataset using a lightweight large language model (i.e., CodeT5) and a new reinforcement learning algorithm, namely RRHF. For evaluation purposes, we use a list of evaluation metrics related to optimization quality and speedup. The evaluation results show that our approach achieves similar or better results compared to state-of-the-art models using shorter training times and smaller pre-trained models.

Keywords

code generation large language model training parameter-efficient fine-tuning

Cite

@article{arxiv.2312.05657,
  title  = {PerfRL: A Small Language Model Framework for Efficient Code Optimization},
  author = {Shukai Duan and Nikos Kanakaris and Xiongye Xiao and Heng Ping and Chenyu Zhou and Nesreen K. Ahmed and Guixiang Ma and Mihai Capota and Theodore L. Willke and Shahin Nazarian and Paul Bogdan},
  journal= {arXiv preprint arXiv:2312.05657},
  year   = {2025}
}

PerfRL: A Small Language Model Framework for Efficient Code Optimization

Abstract

Keywords

Cite

Related papers