Related papers: Neuron Patching: Semantic-based Neuron-level Langu…

A Semantic-based Optimization Approach for Repairing LLMs: Case Study on Code Generation

Language Models (LMs) are widely used in software engineering for code generation, but they may produce erroneous code. Rather than repairing outputs, a more thorough remedy is to address underlying model failures. LM repair offers a…

Software Engineering · Computer Science 2025-12-02 Jian Gu , Aldeida Aleti , Chunyang Chen , Hongyu Zhang

Repair Is Nearly Generation: Multilingual Program Repair with LLMs

Most programmers make mistakes when writing code. Some of these mistakes are small and require few edits to the original program -- a class of errors recently termed last mile mistakes. These errors break the flow for experienced developers…

Software Engineering · Computer Science 2022-12-06 Harshit Joshi , José Cambronero , Sumit Gulwani , Vu Le , Ivan Radicek , Gust Verbruggen

Semantic-Based Neural Network Repair

Recently, neural networks have spread into numerous fields including many safety-critical systems. Neural networks are built (and trained) by programming in frameworks such as TensorFlow and PyTorch. Developers apply a rich set of…

Machine Learning · Computer Science 2023-06-16 Richard Schumi , Jun Sun

Neuron-Level Sequential Editing for Large Language Models

This work explores sequential model editing in large language models (LLMs), a critical task that involves modifying internal knowledge within LLMs continuously through multi-round editing, each incorporating updates or corrections to…

Computation and Language · Computer Science 2024-10-08 Houcheng Jiang , Junfeng Fang , Tianyu Zhang , An Zhang , Ruipeng Wang , Tao Liang , Xiang Wang

NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning

The emergence of finetuning-as-a-service has revealed a new vulnerability in large language models (LLMs). A mere handful of malicious data uploaded by users can subtly manipulate the finetuning process, resulting in an alignment-broken…

Computation and Language · Computer Science 2024-12-18 Xin Yi , Shunfan Zheng , Linlin Wang , Gerard de Melo , Xiaoling Wang , Liang He

SemAgent: A Semantics Aware Program Repair Agent

Large Language Models (LLMs) have shown impressive capabilities in downstream software engineering tasks such as Automated Program Repair (APR). In particular, there has been a lot of research on repository-level issue-resolution benchmarks…

Software Engineering · Computer Science 2025-06-23 Anvith Pabba , Alex Mathai , Anindya Chakraborty , Baishakhi Ray

Semantic Code Repair using Neuro-Symbolic Transformation Networks

We study the problem of semantic code repair, which can be broadly defined as automatically fixing non-syntactic bugs in source code. The majority of past work in semantic code repair assumed access to unit tests against which candidate…

Artificial Intelligence · Computer Science 2017-10-31 Jacob Devlin , Jonathan Uesato , Rishabh Singh , Pushmeet Kohli

Memory-Efficient Large Language Models for Program Repair with Semantic-Guided Patch Generation

In this paper, we first show that increases in beam size, even for small-sized LLMs (1B-7B params), require extensive GPU usage, leading to up to 80% of recurring crashes due to memory overloads in LLM-based APR. Seemingly simple solutions…

Software Engineering · Computer Science 2025-10-20 Thanh Le-Cong , Bach Le , Toby Murray

Code Security Vulnerability Repair Using Reinforcement Learning with Large Language Models

With the recent advancement of Large Language Models (LLMs), generating functionally correct code has become less complicated for a wide array of developers. While using LLMs has sped up the functional development process, it poses a heavy…

Cryptography and Security · Computer Science 2024-02-01 Nafis Tanveer Islam , Mohammad Bahrami Karkevandi , Peyman Najafirad

SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning

Recent advancements in large language models (LLMs) have shown very impressive capabilities in code generation across many programming languages. However, even state-of-the-art LLMs generate programs that contains syntactic errors and fail…

Software Engineering · Computer Science 2025-11-25 David Jiahao Fu , Aryan Gupta , Aaron Councilman , David Grove , Yu-Xiong Wang , Vikram Adve

Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model

Large Language Models (LLMs) are composed of neurons that exhibit various behaviors and roles, which become increasingly diversified as models scale. Recent studies have revealed that not all neurons are active across different datasets,…

Computation and Language · Computer Science 2024-03-19 Haoyun Xu , Runzhe Zhan , Derek F. Wong , Lidia S. Chao

Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?

Large Language Models (LLMs) often produce code with subtle implementation-level bugs despite strong benchmark performance. These errors are hard for LLMs to spot and can have large behavioural effects; yet when asked to summarise code,…

Software Engineering · Computer Science 2025-11-25 Lukas Twist

Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering

Sequence-to-sequence models have been used to transform erroneous programs into correct ones when trained with a large enough dataset. Some recent studies also demonstrated strong empirical evidence that code review could improve the…

Machine Learning · Computer Science 2023-07-25 Rishov Paul , Md. Mohib Hossain , Mohammed Latif Siddiq , Masum Hasan , Anindya Iqbal , Joanna C. S. Santos

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

Large language models (LLMs) have shown great success in various Natural Language Processing (NLP) tasks, whist they still need updates after deployment to fix errors or keep pace with the changing knowledge in the world. Researchers…

Computation and Language · Computer Science 2023-12-20 Lang Yu , Qin Chen , Jie Zhou , Liang He

Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

Large Language Models (LLMs) have recently revolutionized the NLP field, while they still fall short in some specific down-stream tasks. In the work, we focus on utilizing LLMs to perform machine translation, where we observe that two…

Computation and Language · Computer Science 2024-10-10 Weichuan Wang , Zhaoyi Li , Defu Lian , Chen Ma , Linqi Song , Ying Wei

MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing

Large language models are known for encoding a vast amount of factual knowledge, but they often becomes outdated due to the ever-changing nature of external information. A promising solution to this challenge is the utilization of model…

Computation and Language · Computer Science 2024-01-09 Nianwen Si , Hao Zhang , Weiqiang Zhang

Enhancing Code Generation for Low-Resource Languages: No Silver Bullet

The advent of Large Language Models (LLMs) has significantly advanced the field of automated code generation. LLMs rely on large and diverse datasets to learn syntax, semantics, and usage patterns of programming languages. For low-resource…

Software Engineering · Computer Science 2025-02-03 Alessandro Giagnorio , Alberto Martin-Lopez , Gabriele Bavota

The Achilles' Heel of LLMs: How Altering a Handful of Neurons Can Cripple Language Abilities

Large Language Models (LLMs) have become foundational tools in natural language processing, powering a wide range of applications and research. Many studies have shown that LLMs share significant similarities with the human brain. Recent…

Artificial Intelligence · Computer Science 2026-02-10 Zixuan Qin , Qingchen Yu , Kunlin Lyu , Zhaoxin Fan , Yifan Sun

DeepCode AI Fix: Fixing Security Vulnerabilities with Large Language Models

The automated program repair field has attracted substantial interest over the years, but despite significant research efforts, creating a system that works well for complex semantic bugs such as security vulnerabilities has proven…

Cryptography and Security · Computer Science 2024-02-26 Berkay Berabi , Alexey Gronskiy , Veselin Raychev , Gishor Sivanrupan , Victor Chibotaru , Martin Vechev

Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models

Language confusion -- where large language models (LLMs) generate unintended languages against the user's need -- remains a critical challenge, especially for English-centric models. We present the first mechanistic interpretability (MI)…

Computation and Language · Computer Science 2025-09-19 Ercong Nie , Helmut Schmid , Hinrich Schütze