Related papers: ACPO: AI-Enabled Compiler Framework

MLGO: a Machine Learning Guided Compiler Optimizations Framework

Leveraging machine-learning (ML) techniques for compiler optimizations has been widely studied and explored in academia. However, the adoption of ML in general-purpose, industry strength compilers has yet to happen. We propose MLGO, a…

Programming Languages · Computer Science 2021-01-14 Mircea Trofin , Yundi Qian , Eugene Brevdo , Zinan Lin , Krzysztof Choromanski , David Li

ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning

Aligning large-scale vision-language models (VLMs) for complex reasoning via reinforcement learning is often hampered by the limitations of existing policy optimization algorithms, such as static training schedules and the rigid, uniform…

Artificial Intelligence · Computer Science 2025-10-02 Yunhao Wang , Ziting Li , Shuai Chen , Tao Liu , Chao Song , Junjie Jiang , Jian Zhu , Peng Gao , Bin Qin

MLGOPerf: An ML Guided Inliner to Optimize Performance

For the past 25 years, we have witnessed an extensive application of Machine Learning to the Compiler space; the selection and the phase-ordering problem. However, limited works have been upstreamed into the state-of-the-art compilers,…

Programming Languages · Computer Science 2023-01-18 Amir H. Ashouri , Mostafa Elhoushi , Yuzhe Hua , Xiang Wang , Muhammad Asif Manzoor , Bryan Chan , Yaoqing Gao

Agentic Auto-Scheduling: An Experimental Study of LLM-Guided Loop Optimization

Automatic code optimization remains a difficult challenge, particularly for complex loop nests on modern hardware. This paper investigates a novel approach to code optimization where Large Language Models (LLMs) guide the process through a…

Programming Languages · Computer Science 2025-12-30 Massinissa Merouani , Islem Kara Bernou , Riyadh Baghdadi

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

While polyhedral compilers have shown success in implementing advanced code transformations, they still face challenges in selecting the ones that lead to the most profitable speedups. This has motivated the use of machine learning based…

Programming Languages · Computer Science 2025-12-30 Massinissa Merouani , Afif Boudaoud , Iheb Nassim Aouadj , Nassim Tchoulak , Islem Kara Bernou , Hamza Benyamina , Fatima Benbouzid-Si Tayeb , Karima Benatchba , Hugh Leather , Riyadh Baghdadi

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

Autonomous Machine Learning Engineering (MLE) requires agents to perform sustained, iterative optimization over long horizons. While recent LLM-based agents show promise, current prompt-based agents for MLE suffer from behavioral stagnation…

Machine Learning · Computer Science 2026-05-08 Yuzhu Cai , Zexi Liu , Xinyu Zhu , Cheng Wang , Yanfeng Wang , Siheng Chen

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning

Large reasoning models (LRMs) have demonstrated strong performance on complex reasoning tasks, but often suffer from overthinking, generating redundant content regardless of task difficulty. Inspired by the dual process theory in cognitive…

Artificial Intelligence · Computer Science 2025-05-26 Xiaoxue Cheng , Junyi Li , Zhenduo Zhang , Xinyu Tang , Wayne Xin Zhao , Xinyu Kong , Zhiqiang Zhang

Learning to Make Compiler Optimizations More Effective

Because loops execute their body many times, compiler developers place much emphasis on their optimization. Nevertheless, in view of highly diverse source code and hardware, compilers still struggle to produce optimal target code. The sheer…

Programming Languages · Computer Science 2021-03-01 Rahim Mammadli , Marija Selakovic , Felix Wolf , Michael Pradel

Autocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators

Hardware accelerators, especially those designed for tensor processing, have become ubiquitous in today's computing landscape. However, even with significant efforts in building compilers, programming these tensor accelerators remains…

Programming Languages · Computer Science 2025-11-07 Charles Hong , Sahil Bhatia , Alvin Cheung , Yakun Sophia Shao

Loop Unrolling in Multi-pipeline ASIP Design

Application Specific Instruction-set Processor (ASIP) is one of the popular processor design techniques for embedded systems which allows customizability in processor design without overly hindering design flexibility. Multi-pipeline ASIPs…

Programming Languages · Computer Science 2014-02-05 Rajitha Navarathna , Swarnalatha Radhakrishnan , Roshan Ragel

Less is More: Exploiting the Standard Compiler Optimization Levels for Better Performance and Energy Consumption

This paper presents the interesting observation that by performing fewer of the optimizations available in a standard compiler optimization level such as -O2, while preserving their original ordering, significant savings can be achieved in…

Performance · Computer Science 2018-06-19 Kyriakos Georgiou , Craig Blackmore , Samuel Xavier-de-Souza , Kerstin Eder

Lost in translation: Exposing hidden compiler optimization opportunities

Existing iterative compilation and machine-learning-based optimization techniques have been proven very successful in achieving better optimizations than the standard optimization levels of a compiler. However, they were not engineered to…

Programming Languages · Computer Science 2020-08-11 Kyriakos Georgiou , Zbigniew Chamski , Andres Amaya Garcia , David May , Kerstin Eder

ALCOP: Automatic Load-Compute Pipelining in Deep Learning Compiler for AI-GPUs

Pipelining between data loading and computation is a critical tensor program optimization for GPUs. In order to unleash the high performance of latest GPUs, we must perform a synergetic optimization of multi-stage pipelining across the…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-09 Guyue Huang , Yang Bai , Liu Liu , Yuke Wang , Bei Yu , Yufei Ding , Yuan Xie

An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation

With the increasing demand for computing capability given limited resource and power budgets, it is crucial to deploy applications to customized accelerators like FPGAs. However, FPGA programming is non-trivial. Although existing high-level…

Hardware Architecture · Computer Science 2024-01-11 Weichuang Zhang , Jieru Zhao , Guan Shen , Quan Chen , Chen Chen , Minyi Guo

Progress Report: A Deep Learning Guided Exploration of Affine Unimodular Loop Transformations

In this paper, we present a work in progress about a deep learning based approach for automatic code optimization in polyhedral compilers. The proposed technique explores combinations of affine and non-affine loop transformations to find…

Programming Languages · Computer Science 2022-06-09 Massinissa Merouani , Khaled Afif Boudaoud , Iheb Nassim Aouadj , Nassim Tchoulak , Fatima Benbouzid-Sitayeb , Karima Benatchba , Hugh Leather , Riyadh Baghdadi

Loop Optimization Framework

The LLVM compiler framework supports a selection of loop transformations such as vectorization, distribution and unrolling. Each transformation is carried-out by specialized passes that have been developed independently. In this paper we…

Programming Languages · Computer Science 2018-11-05 Michael Kruse , Hal Finkel

MLComp: A Methodology for Machine Learning-based Performance Estimation and Adaptive Selection of Pareto-Optimal Compiler Optimization Sequences

Embedded systems have proliferated in various consumer and industrial applications with the evolution of Cyber-Physical Systems and the Internet of Things. These systems are subjected to stringent constraints so that embedded software must…

Machine Learning · Computer Science 2021-10-12 Alessio Colucci , Dávid Juhász , Martin Mosbeck , Alberto Marchisio , Semeen Rehman , Manfred Kreutzer , Guenther Nadbath , Axel Jantsch , Muhammad Shafique

LOOPRAG: Enhancing Loop Transformation Optimization with Retrieval-Augmented Large Language Models

Loop transformations are semantics-preserving optimization techniques, widely used to maximize objectives such as parallelism. Despite decades of research, applying the optimal composition of loop transformations remains challenging due to…

Programming Languages · Computer Science 2025-12-19 Yijie Zhi , Yayu Cao , Jianhua Dai , Xiaoyang Han , Jingwen Pu , Qingran Wu , Sheng Cheng , Ming Cai

PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks

We present the design and implementation of PolyBlocks, a modular and reusable MLIR-based compiler infrastructure for AI programming frameworks and AI chips. PolyBlocks is based on pass pipelines that compose transformations on loop nests…

Programming Languages · Computer Science 2026-03-11 Uday Bondhugula , Akshay Baviskar , Navdeep Katel , Vimal Patel , Anoop JS , Arnab Dutta

ELPO: Ensemble Learning Based Prompt Optimization for Large Language Models

The remarkable performance of Large Language Models (LLMs) highly relies on crafted prompts. However, manual prompt engineering is a laborious process, creating a core bottleneck for practical application of LLMs. This phenomenon has led to…

Computation and Language · Computer Science 2025-11-21 Qing Zhang , Bing Xu , Xudong Zhang , Yifan Shi , Yang Li , Chen Zhang , Yik Chung Wu , Ngai Wong , Yijie Chen , Hong Dai , Xiansen Chen , Mian Zhang