Related papers: Guided Debugging of Auto-Translated Code Using Dif…

Designing for Novice Debuggers: A Pilot Study on an AI-Assisted Debugging Tool

Debugging is a fundamental skill that novice programmers must develop. Numerous tools have been created to assist novice programmers in this process. Recently, large language models (LLMs) have been integrated with automated program repair…

Software Engineering · Computer Science 2025-11-10 Oka Kurniawan , Erick Chandra , Christopher M. Poskitt , Yannic Noller , Kenny Tsu Wei Choo , Cyrille Jegourel

HInter: Exposing Hidden Intersectional Bias in Large Language Models

Large Language Models (LLMs) may portray discrimination towards certain individuals, especially those characterized by multiple attributes (aka intersectional bias). Discovering intersectional bias in LLMs is challenging, as it involves…

Computation and Language · Computer Science 2025-03-18 Badr Souani , Ezekiel Soremekun , Mike Papadakis , Setsuko Yokoyama , Sudipta Chattopadhyay , Yves Le Traon

RulER: Automated Rule-Based Semantic Error Localization and Repair for Code Translation

Automated code translation aims to convert programs between different programming languages while maintaining their functionality. Due to the imperfections of code translation models, the generated translations may contain errors that…

Software Engineering · Computer Science 2025-09-19 Shuo Jin , Songqiang Chen , Xiaoyuan Xie , Shing-Chi Cheung

Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code

Code translation aims to convert source code from one programming language (PL) to another. Given the promising abilities of large language models (LLMs) in code synthesis, researchers are exploring their potential to automate code…

Software Engineering · Computer Science 2024-01-17 Rangeet Pan , Ali Reza Ibrahimzada , Rahul Krishna , Divya Sankar , Lambert Pouguem Wassi , Michele Merler , Boris Sobolev , Raju Pavuluri , Saurabh Sinha , Reyhaneh Jabbarvand

Mutation Testing via Iterative Large Language Model-Driven Scientific Debugging

Large Language Models (LLMs) can generate plausible test code. Intuitively they generate this by imitating tests seen in their training data, rather than reasoning about execution semantics. However, such reasoning is important when…

Software Engineering · Computer Science 2025-03-12 Philipp Straubinger , Marvin Kreis , Stephan Lukasczyk , Gordon Fraser

Teaching Large Language Models to Self-Debug

Large language models (LLMs) have achieved impressive performance on code generation. However, for complex programming tasks, generating the correct solution in one go becomes challenging, thus some prior works have designed program repair…

Computation and Language · Computer Science 2023-10-06 Xinyun Chen , Maxwell Lin , Nathanael Schärli , Denny Zhou

Beyond Translation Accuracy: Addressing False Failures in LLM-Based Code Translation

Large Language Models (LLMs) have achieved remarkable success in automated code translation. While prior work has focused on improving translation accuracy through advanced prompting and iterative repair, the reliability of the underlying…

Software Engineering · Computer Science 2026-05-11 Fazle Rabbi , Soumit Kanti Saha , Jinqiu Yang

TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement

Large Language Models (LLMs) have achieved impressive results in Machine Translation (MT). However, careful evaluations by human reveal that the translations produced by LLMs still contain multiple errors. Importantly, feeding back such…

Computation and Language · Computer Science 2024-06-24 Zhaopeng Feng , Yan Zhang , Hao Li , Bei Wu , Jiayu Liao , Wenqiang Liu , Jun Lang , Yang Feng , Jian Wu , Zuozhu Liu

Effective Large Language Model Debugging with Best-first Tree Search

Large Language Models (LLMs) show promise in code generation tasks. However, their code-writing abilities are often limited in scope: while they can successfully implement simple functions, they struggle with more complex tasks. A…

Software Engineering · Computer Science 2024-07-30 Jialin Song , Jonathan Raiman , Bryan Catanzaro

Code Linting using Language Models

Code linters play a crucial role in developing high-quality software systems by detecting potential problems (e.g., memory leaks) in the source code of systems. Despite their benefits, code linters are often language-specific, focused on…

Software Engineering · Computer Science 2024-07-24 Darren Holden , Nafiseh Kahani

TIM: Teaching Large Language Models to Translate with Comparison

Open-sourced large language models (LLMs) have demonstrated remarkable efficacy in various tasks with instruction tuning. However, these models can sometimes struggle with tasks that require more specialized knowledge such as translation.…

Computation and Language · Computer Science 2024-01-23 Jiali Zeng , Fandong Meng , Yongjing Yin , Jie Zhou

HDLdebugger: Streamlining HDL debugging with Large Language Models

In the domain of chip design, Hardware Description Languages (HDLs) play a pivotal role. However, due to the complex syntax of HDLs and the limited availability of online resources, debugging HDL codes remains a difficult and time-intensive…

Hardware Architecture · Computer Science 2024-03-19 Xufeng Yao , Haoyang Li , Tsz Ho Chan , Wenyi Xiao , Mingxuan Yuan , Yu Huang , Lei Chen , Bei Yu

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Code translation tools (transpilers) are developed for automatic source-to-source translation. Although learning-based transpilers have shown impressive enhancement against rule-based counterparts, owing to their task-specific pre-training…

Software Engineering · Computer Science 2024-05-14 Zhen Yang , Fang Liu , Zhongxing Yu , Jacky Wai Keung , Jia Li , Shuo Liu , Yifan Hong , Xiaoxue Ma , Zhi Jin , Ge Li

Towards Translating Real-World Code with LLMs: A Study of Translating to Rust

Large language models (LLMs) show promise in code translation - the task of translating code written in one programming language to another language - due to their ability to write code in most programming languages. However, LLM's…

Software Engineering · Computer Science 2025-04-18 Hasan Ferit Eniser , Hanliang Zhang , Cristina David , Meng Wang , Maria Christakis , Brandon Paulsen , Joey Dodds , Daniel Kroening

xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality…

Computation and Language · Computer Science 2024-07-01 Marcos Treviso , Nuno M. Guerreiro , Sweta Agrawal , Ricardo Rei , José Pombal , Tania Vaz , Helena Wu , Beatriz Silva , Daan van Stigt , André F. T. Martins

Real, Fake, or Manipulated? Detecting Machine-Influenced Text

Large Language Model (LLMs) can be used to write or modify documents, presenting a challenge for understanding the intent behind their use. For example, benign uses may involve using LLM on a human-written document to improve its grammar or…

Computation and Language · Computer Science 2025-09-22 Yitong Wang , Zhongping Zhang , Margherita Piana , Zheng Zhou , Peter Gerstoft , Bryan A. Plummer

Rectifier: Code Translation with Corrector via LLMs

Software migration is garnering increasing attention with the evolution of software and society. Early studies mainly relied on handcrafted translation rules to translate between two languages, the translation process is error-prone and…

Software Engineering · Computer Science 2024-07-11 Xin Yin , Chao Ni , Tien N. Nguyen , Shaohua Wang , Xiaohu Yang

Finding Missed Code Size Optimizations in Compilers using LLMs

Compilers are complex, and significant effort has been expended on testing them. Techniques such as random program generation and differential testing have proved highly effective and have uncovered thousands of bugs in production…

Software Engineering · Computer Science 2025-01-03 Davide Italiano , Chris Cummins

MEIC: Re-thinking RTL Debug Automation using LLMs

The deployment of Large Language Models (LLMs) for code debugging (e.g., C and Python) is widespread, benefiting from their ability to understand and interpret intricate concepts. However, in the semiconductor industry, utilising LLMs to…

Hardware Architecture · Computer Science 2024-05-14 Ke Xu , Jialin Sun , Yuchen Hu , Xinwei Fang , Weiwei Shan , Xi Wang , Zhe Jiang

TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment

Code translation transforms code between programming languages while preserving functionality, which is critical in software development and maintenance. While traditional learning-based code translation methods have limited effectiveness…

Software Engineering · Computer Science 2026-04-08 Zhiqiang Yuan , Weitong Chen , Hanlin Wang , Xin Peng , Zhenpeng Chen , Yiling Lou