Related papers: Verified Code Transpilation with LLMs

Guided Tensor Lifting

Domain-specific languages (DSLs) for machine learning are revolutionizing the speed and efficiency of machine learning workloads as they enable users easy access to high-performance compiler optimizations and accelerators. However, to take…

Software Engineering · Computer Science 2025-04-29 Yixuan Li , José Wesley de Souza Magalhães , Alexander Brauckmann , Michael F. P. O'Boyle , Elizabeth Polgreen

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Code translation tools (transpilers) are developed for automatic source-to-source translation. Although learning-based transpilers have shown impressive enhancement against rule-based counterparts, owing to their task-specific pre-training…

Software Engineering · Computer Science 2024-05-14 Zhen Yang , Fang Liu , Zhongxing Yu , Jacky Wai Keung , Jia Li , Shuo Liu , Yifan Hong , Xiaoxue Ma , Zhi Jin , Ge Li

CodeSift: An LLM-Based Reference-Less Framework for Automatic Code Validation

The advent of large language models (LLMs) has greatly facilitated code generation, but ensuring the functional correctness of generated code remains a challenge. Traditional validation methods are often time-consuming, error-prone, and…

Software Engineering · Computer Science 2024-08-29 Pooja Aggarwal , Oishik Chatterjee , Ting Dai , Prateeti Mohapatra , Brent Paulovicks , Brad Blancett , Arthur De Magalhaes

Building-Blocks for Performance Oriented DSLs

Domain-specific languages raise the level of abstraction in software development. While it is evident that programmers can more easily reason about very high-level programs, the same holds for compilers only if the compiler has an accurate…

Programming Languages · Computer Science 2011-09-06 Tiark Rompf , Arvind K. Sujeeth , HyoukJoong Lee , Kevin J. Brown , Hassan Chafi , Martin Odersky , Kunle Olukotun

CodePivot: Bootstrapping Multilingual Transpilation in LLMs via Reinforcement Learning without Parallel Corpora

Transpilation, or code translation, aims to convert source code from one programming language (PL) to another. It is beneficial for many downstream applications, from modernizing large legacy codebases to augmenting data for low-resource…

Software Engineering · Computer Science 2026-04-21 Shangyu Li , Juyong Jiang , Meibo Ren , Sizhe Zhong , Huiri Tan , Yunhao Gou , Xu Han , Chun Yong Chong , Yun Peng , Jiasi Shen

Evaluating LLM-generated code for domain-specific languages: molecular dynamics with LAMMPS

Large language models (LLMs) are changing the way researchers interact with code and data in scientific computing. While their ability to generate general-purpose code is well established, their effectiveness in producing scientifically…

Software Engineering · Computer Science 2026-05-25 Ethan Holbrook , Juan C. Verduzco , Alejandro Strachan

Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study

Large language models (LLMs) perform strongly on general-purpose code generation, yet their applicability to enterprise domain-specific languages (DSLs) remains underexplored, especially for repository-scale change generation spanning…

Software Engineering · Computer Science 2026-04-28 Sivajeet Chand , Kevin Nguyen , Peter Kuntz , Alexander Pretschner

D-LiFT: Improving LLM-based Decompiler Backend via Code Quality-driven Fine-tuning

As one of the key tools in many security tasks, decompilers reconstruct human-readable source code from binaries. Yet, despite recent advances, their outputs often suffer from syntactic and semantic errors and remain difficult to read.…

Cryptography and Security · Computer Science 2025-08-19 Muqi Zou , Hongyu Cai , Hongwei Wu , Zion Leonahenahe Basque , Arslan Khan , Berkay Celik , Dave , Tian , Antonio Bianchi , Ruoyu , Wang , Dongyan Xu

Leveraging LLMs for Formal Software Requirements -- Challenges and Prospects

Software correctness is ensured mathematically through formal verification, which involves the resources of generating formal requirement specifications and having an implementation that must be verified. Tools such as model-checkers and…

Software Engineering · Computer Science 2025-08-29 Arshad Beg , Diarmuid O'Donoghue , Rosemary Monahan

A Roadmap for Tamed Interactions with Large Language Models

We are witnessing a bloom of AI-powered software driven by Large Language Models (LLMs). Although the applications of these LLMs are impressive and seemingly countless, their unreliability hinders adoption. In fact, the tendency of LLMs to…

Software Engineering · Computer Science 2025-10-30 Vincenzo Scotti , Jan Keim , Tobias Hey , Andreas Metzger , Anne Koziolek , Raffaela Mirandola

PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C

Python has become the dominant language for general-purpose programming, yet it lacks robust tools for formal verification. In contrast, programmers working in languages such as C benefit from mature model checkers, for example CBMC, which…

Software Engineering · Computer Science 2025-08-12 Pedro Orvalho , Marta Kwiatkowska

Natural Language based Specification and Verification

Recent frontier large language models (LLMs) have shown strong performance in identifying security vulnerabilities in large, mature open-source systems. As LLM-generated code becomes increasingly common, a natural goal is to prevent such…

Software Engineering · Computer Science 2026-05-13 Zhaorui Li , Chengyu Song

LLM-Powered Quantum Code Transpilation

There exist various Software Development Kits (SDKs) tailored to different quantum computing platforms. These are known as Quantum SDKs (QSDKs). Examples include but are not limited to Qiskit, Cirq, and PennyLane. However, this diversity…

Software Engineering · Computer Science 2025-12-09 Nazanin Siavash , Armin Moin

AutoVeriFix: Automatically Correcting Errors and Enhancing Functional Correctness in LLM-Generated Verilog Code

Large language models (LLMs) have demonstrated impressive capabilities in generating software code for high-level programming languages such as Python and C++. However, their application to hardware description languages, such as Verilog,…

Hardware Architecture · Computer Science 2025-09-11 Yan Tan , Xiangchen Meng , Zijun Jiang , Yangdi Lyu

What a diff makes: automating code migration with large language models

Modern software programs are built on stacks that are often undergoing changes that introduce updates and improvements, but may also break any project that depends upon them. In this paper we explore the use of Large Language Models (LLMs)…

Software Engineering · Computer Science 2025-11-04 Katherine A. Rosenfeld , Cliff C. Kerr , Jessica Lundin

StackPilot: Autonomous Function Agents for Scalable and Environment-Free Code Execution

Recent advances in large language models (LLMs) have substantially enhanced automated code generation across a wide range of programming languages. Nonetheless, verifying the correctness and executability of LLM-generated code remains a…

Programming Languages · Computer Science 2026-01-14 Xinkui Zhao , Yifan Zhang , Zhengyi Zhou , Yueshen Xu

An LLM-powered Natural-to-Robotic Language Translation Framework with Correctness Guarantees

The Large Language Models (LLM) are increasingly being deployed in robotics to generate robot control programs for specific user tasks, enabling embodied intelligence. Existing methods primarily focus on LLM training and prompt design that…

Robotics · Computer Science 2025-08-27 ZhenDong Chen , ZhanShang Nie , ShiXing Wan , JunYi Li , YongTian Cheng , Shuai Zhao

Code Transpilation for Hardware Accelerators

DSLs and hardware accelerators have proven to be very effective in optimizing computationally expensive workloads. In this paper, we propose a solution to the challenge of manually rewriting legacy or unoptimized code in domain-specific…

Programming Languages · Computer Science 2023-08-15 Yuto Nishida , Sahil Bhatia , Shadaj Laddad , Hasan Genc , Yakun Sophia Shao , Alvin Cheung

Towards Semantics Lifting for Scientific Computing: A Case Study on FFT

The rise of automated code generation tools, such as large language models (LLMs), has introduced new challenges in ensuring the correctness and efficiency of scientific software, particularly in complex kernels, where numerical stability,…

Programming Languages · Computer Science 2025-01-17 Naifeng Zhang , Sanil Rao , Mike Franusich , Franz Franchetti

UVLLM: An Automated Universal RTL Verification Framework using LLMs

Verifying hardware designs in embedded systems is crucial but often labor-intensive and time-consuming. While existing solutions have improved automation, they frequently rely on unrealistic assumptions. To address these challenges, we…

Hardware Architecture · Computer Science 2024-11-26 Yuchen Hu , Junhao Ye , Ke Xu , Jialin Sun , Shiyue Zhang , Xinyao Jiao , Dingrong Pan , Jie Zhou , Ning Wang , Weiwei Shan , Xinwei Fang , Xi Wang , Nan Guan , Zhe Jiang