Related papers: Opal: A Modular Framework for Optimizing Performan…

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization

Large language models (LLMs) show promise for automated code optimization. However, without performance context, they struggle to produce correct and effective code transformations. Existing performance tools can identify bottlenecks but…

Performance · Computer Science 2026-04-28 Mohammad Zaeed , Tanzima Z. Islam , Vladimir Indic

OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

Generating high-performance CUDA kernels remains challenging due to the need to navigate a combinatorial space of low-level transformations under noisy and expensive hardware feedback. Although large language models can synthesize…

Machine Learning · Computer Science 2026-02-16 Arijit Bhattacharjee , Heng Ping , Son Vu Le , Paul Bogdan , Nesreen K. Ahmed , Ali Jannesari

LLMPerf: GPU Performance Modeling meets Large Language Models

Performance modeling, a pivotal domain in program cost analysis, currently relies on manually crafted models constrained by various program and hardware limitations, especially in the intricate landscape of GPGPU. Meanwhile, Large Language…

Performance · Computer Science 2025-03-17 Khoi N. M. Nguyen , Hoang Duy Nguyen Do , Huyen Thao Le , Thanh Tuan Dao

LIMINAL: Exploring The Frontiers of LLM Decode Performance

The rapid advancement of Large Language Models (LLMs) necessitates a deep understanding of their fundamental performance limits. This paper investigates the limits of LLM inference, focusing on hardware-imposed bottlenecks in…

Hardware Architecture · Computer Science 2025-11-17 Michael Davies , Neal Crago , Karthikeyan Sankaralingam , Christos Kozyrakis

OptMATH: A Scalable Bidirectional Data Synthesis Framework for Optimization Modeling

Despite the rapid development of large language models (LLMs), a fundamental challenge persists: the lack of high-quality optimization modeling datasets hampers LLMs' robust modeling of practical optimization problems from natural language…

Artificial Intelligence · Computer Science 2025-02-24 Hongliang Lu , Zhonglin Xie , Yaoyu Wu , Can Ren , Yuxuan Chen , Zaiwen Wen

Solving General Natural-Language-Description Optimization Problems with Large Language Models

Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications. Modeling and solving optimization problems in a specific domain typically require…

Optimization and Control · Mathematics 2024-07-12 Jihai Zhang , Wei Wang , Siyan Guo , Li Wang , Fangquan Lin , Cheng Yang , Wotao Yin

Autonomous Multi-Objective Optimization Using Large Language Model

Multi-objective optimization problems (MOPs) are ubiquitous in real-world applications, presenting a complex challenge of balancing multiple conflicting objectives. Traditional evolutionary algorithms (EAs), though effective, often rely on…

Neural and Evolutionary Computing · Computer Science 2024-07-29 Yuxiao Huang , Shenghao Wu , Wenjie Zhang , Jibin Wu , Liang Feng , Kay Chen Tan

OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale

Optimization problems are pervasive in sectors from manufacturing and distribution to healthcare. However, most such problems are still solved heuristically by hand rather than optimally by state-of-the-art solvers because the expertise…

Artificial Intelligence · Computer Science 2025-08-29 Ali AhmadiTeshnizi , Wenzhi Gao , Herman Brunborg , Shayan Talaei , Connor Lawless , Madeleine Udell

OWL: A Large Language Model for IT Operations

With the rapid development of IT operations, it has become increasingly crucial to efficiently manage and analyze large volumes of data for practical applications. The techniques of Natural Language Processing (NLP) have shown remarkable…

Computation and Language · Computer Science 2024-09-30 Hongcheng Guo , Jian Yang , Jiaheng Liu , Liqun Yang , Linzheng Chai , Jiaqi Bai , Junran Peng , Xiaorong Hu , Chao Chen , Dongfeng Zhang , Xu Shi , Tieqiao Zheng , Liangfan Zheng , Bo Zhang , Ke Xu , Zhoujun Li

Do Large Language Models Understand Performance Optimization?

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

Query Performance Explanation through Large Language Model for HTAP Systems

In hybrid transactional and analytical processing (HTAP) systems, users often struggle to understand why query plans from one engine (OLAP or OLTP) perform significantly slower than those from another. Although optimizers provide plan…

Databases · Computer Science 2024-12-03 Haibo Xiu , Li Zhang , Tieying Zhang , Jun Yang , Jianjun Chen

A New Benchmark for the Appropriate Evaluation of RTL Code Optimization

The rapid progress of artificial intelligence increasingly relies on efficient integrated circuit (IC) design. Recent studies have explored the use of large language models (LLMs) for generating Register Transfer Level (RTL) code, but…

Artificial Intelligence · Computer Science 2026-01-06 Yao Lu , Shang Liu , Hangan Zhou , Wenji Fang , Qijun Zhang , Zhiyao Xie

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

Large language models (LLMs) have exhibited their problem-solving abilities in mathematical reasoning. Solving realistic optimization (OPT) problems in application scenarios requires advanced and applied mathematics ability. However,…

Machine Learning · Computer Science 2025-06-05 Zhicheng Yang , Yiwei Wang , Yinya Huang , Zhijiang Guo , Wei Shi , Xiongwei Han , Liang Feng , Linqi Song , Xiaodan Liang , Jing Tang

Inference Performance Optimization for Large Language Models on CPUs

Large language models (LLMs) have shown exceptional performance and vast potential across diverse tasks. However, the deployment of LLMs with high performance in low-resource environments has garnered significant attention in the industry.…

Artificial Intelligence · Computer Science 2024-07-11 Pujiang He , Shan Zhou , Wenhuan Huang , Changqing Li , Duyi Wang , Bin Guo , Chen Meng , Sheng Gui , Weifei Yu , Yi Xie

OptiMind: Teaching LLMs to Think Like Optimization Experts

Mathematical programming -- the task of expressing operations and decision-making problems in precise mathematical language -- is fundamental across domains, yet remains a skill-intensive process requiring operations research expertise.…

Machine Learning · Computer Science 2026-01-15 Xinzhi Zhang , Zeyi Chen , Humishka Zope , Hugo Barbalho , Konstantina Mellou , Marco Molinaro , Janardhan Kulkarni , Ishai Menache , Sirui Li

Omniwise: Predicting GPU Kernels Performance with LLMs

In recent years, the rapid advancement of deep neural networks (DNNs) has revolutionized artificial intelligence, enabling models with unprecedented capabilities in understanding, generating, and processing complex data. These powerful…

Machine Learning · Computer Science 2025-06-27 Zixian Wang , Cole Ramos , Muhammad A. Awad , Keith Lowery

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

Large Language Models (LLMs) have seen great advance in both academia and industry, and their popularity results in numerous open-source frameworks and techniques in accelerating LLM pre-training, fine-tuning, and inference. Training and…

Performance · Computer Science 2023-12-04 Longteng Zhang , Xiang Liu , Zeyu Li , Xinglin Pan , Peijie Dong , Ruibo Fan , Rui Guo , Xin Wang , Qiong Luo , Shaohuai Shi , Xiaowen Chu

Large-Scale Optimization Model Auto-Formulation: Harnessing LLM Flexibility via Structured Workflow

Large-scale optimization is a key backbone of modern business decision-making. However, building these models is often labor-intensive and time-consuming. We address this by proposing LEAN-LLM-OPT, a LightwEight AgeNtic workflow…

Artificial Intelligence · Computer Science 2026-02-03 Kuo Liang , Yuhang Lu , Jianming Mao , Shuyi Sun , Chunwei Yang , Congcong Zeng , Xiao Jin , Hanzhang Qin , Ruihao Zhu , Chung-Piaw Teo

OptLLM: Optimal Assignment of Queries to Large Language Models

Large Language Models (LLMs) have garnered considerable attention owing to their remarkable capabilities, leading to an increasing number of companies offering LLMs as services. Different LLMs achieve different performance at different…

Software Engineering · Computer Science 2024-05-27 Yueyue Liu , Hongyu Zhang , Yuantian Miao , Van-Hoang Le , Zhiqiang Li

PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations

Advancements in large language models (LLMs) are showing promising impact in software development and programming assistance. However, these models struggle when operating on low-level backend code. This challenge is exacerbated in the…

Software Engineering · Computer Science 2025-12-23 Muhammad Usman Tariq , Abhinav Jangda , Angelica Moreira , Madan Musuvathi , Tyler Sorensen