Related papers: Past as a Guide: Leveraging Retrospective Learning…

WIP: Leveraging LLMs for Enforcing Design Principles in Student Code: Analysis of Prompting Strategies and RAG

This work-in-progress research-to-practice paper explores the integration of Large Language Models (LLMs) into the code-review process for open-source software projects developed in computer science and software engineering courses. The…

Software Engineering · Computer Science 2025-08-19 Dhruv Kolhatkar , Soubhagya Akkena , Edward F. Gehringer

LLMs are Bug Replicators: An Empirical Study on LLMs' Capability in Completing Bug-prone Code

Large Language Models (LLMs) have demonstrated remarkable performance in code completion. However, the training data used to develop these models often contain a significant amount of buggy code. Yet, it remains unclear to what extent these…

Software Engineering · Computer Science 2025-03-17 Liwei Guo , Sixiang Ye , Zeyu Sun , Xiang Chen , Yuxia Zhang , Bo Wang , Jie M. Zhang , Zheng Li , Yong Liu

Rethinking Code Refinement: Learning to Judge Code Efficiency

Large Language Models (LLMs) have demonstrated impressive capabilities in understanding and generating codes. Due to these capabilities, many recent methods are proposed to automatically refine the codes with LLMs. However, we should…

Software Engineering · Computer Science 2024-10-31 Minju Seo , Jinheon Baek , Sung Ju Hwang

Retrieval-augmented code completion for local projects using large language models

The use of large language models (LLMs) is becoming increasingly widespread among software developers. However, privacy and computational requirements are problematic with commercial solutions and the use of LLMs. In this work, we focus on…

Software Engineering · Computer Science 2025-06-17 Marko Hostnik , Marko Robnik-Šikonja

A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models

Recent advances in self-refinement have demonstrated significant potential for improving the outputs of large language models (LLMs) through iterative refinement. However, most existing self-refinement methods rely on a reactive process…

Computation and Language · Computer Science 2025-10-07 Jinyi Han , Xinyi Wang , Haiquan Zhao , Tingyun li , Zishang Jiang , Sihang Jiang , Jiaqing Liang , Xin Lin , Weikang Zhou , Zeye Sun , Fei Yu , Yanghua Xiao

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Large Language Models (LLMs) have demonstrated impressive capabilities in complex reasoning tasks, yet they still struggle to reliably verify the correctness of their own outputs. Existing solutions to this verification challenge often…

Computation and Language · Computer Science 2025-06-13 Yuhua Jiang , Yuwen Xiong , Yufeng Yuan , Chao Xin , Wenyuan Xu , Yu Yue , Qianchuan Zhao , Lin Yan

Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning

Retrieval-Augmented Generation (RAG) offers a cost-effective approach to injecting real-time knowledge into large language models (LLMs). Nevertheless, constructing and validating high-quality knowledge repositories require considerable…

Computation and Language · Computer Science 2024-05-28 Xun Liang , Simin Niu , Zhiyu li , Sensen Zhang , Shichao Song , Hanyu Wang , Jiawei Yang , Feiyu Xiong , Bo Tang , Chenyang Xi

Failure-Aware Enhancements for Large Language Model (LLM) Code Generation: An Empirical Study on Decision Framework

Large language models (LLMs) show promise for automating software development by translating requirements into code. However, even advanced prompting workflows like progressive prompting often leave some requirements unmet. Although methods…

Software Engineering · Computer Science 2026-02-04 Jianru Shen , Zedong Peng , Lucy Owen

Teaching LLMs to Refine with Tools

Large language models (LLMs) can refine their responses based on feedback, enabling self-improvement through iterative training or test-time refinement. However, existing methods predominantly focus on refinement within the same reasoning…

Computation and Language · Computer Science 2024-12-24 Dian Yu , Yuheng Zhang , Jiahao Xu , Tian Liang , Linfeng Song , Zhaopeng Tu , Haitao Mi , Dong Yu

Planning-Driven Programming: A Large Language Model Programming Workflow

The strong performance of large language models (LLMs) raises extensive discussion on their application to code generation. Recent research suggests continuous program refinements through visible tests to improve code generation accuracy in…

Software Engineering · Computer Science 2025-05-26 Chao Lei , Yanchuan Chang , Nir Lipovetzky , Krista A. Ehinger

Prompt engineering and framework: implementation to increase code reliability based guideline for LLMs

In this paper, we propose a novel prompting approach aimed at enhancing the ability of Large Language Models (LLMs) to generate accurate Python code. Specifically, we introduce a prompt template designed to improve the quality and…

Software Engineering · Computer Science 2025-06-16 Rogelio Cruz , Jonatan Contreras , Francisco Guerrero , Ezequiel Rodriguez , Carlos Valdez , Citlali Carrillo

Context-Augmented Code Generation Using Programming Knowledge Graphs

Large Language Models (LLMs) excel at code generation but struggle with complex problems. Retrieval-Augmented Generation (RAG) mitigates this issue by integrating external knowledge, yet retrieval models often miss relevant context, and…

Software Engineering · Computer Science 2026-01-29 Shahd Seddik , Fahd Seddik , Iman Saberi , Fatemeh Fard , Minh Hieu Huynh , Patanamon Thongtanunam

Large Language Models as Code Executors: An Exploratory Study

The capabilities of Large Language Models (LLMs) have significantly evolved, extending from natural language processing to complex tasks like code understanding and generation. We expand the scope of LLMs' capabilities to a broader context,…

Computation and Language · Computer Science 2024-10-11 Chenyang Lyu , Lecheng Yan , Rui Xing , Wenxi Li , Younes Samih , Tianbo Ji , Longyue Wang

Better Python Programming for all: With the focus on Maintainability

This study aims to enhance the maintainability of code generated by Large Language Models (LLMs), with a focus on the Python programming language. As the use of LLMs for coding assistance grows, so do concerns about the maintainability of…

Software Engineering · Computer Science 2024-08-20 Karthik Shivashankar , Antonio Martini

Precedent-Informed Reasoning: Mitigating Overthinking in Large Reasoning Models via Test-Time Precedent Learning

Reasoning in Large Language Models (LLMs) often suffers from inefficient long chain-of-thought traces with redundant self-exploration and validation, which inflate computational costs and even degrade performance. Inspired by human…

Artificial Intelligence · Computer Science 2026-02-17 Qianyue Wang , Jinwu Hu , Huanxiang Lin , Bolin Chen , Zhiquan Wen , Yaofo Chen , Yu Rong , Mingkui Tan

Re-Ranking Step by Step: Investigating Pre-Filtering for Re-Ranking with Large Language Models

Large Language Models (LLMs) have been revolutionizing a myriad of natural language processing tasks with their diverse zero-shot capabilities. Indeed, existing work has shown that LLMs can be used to great effect for many tasks, such as…

Computation and Language · Computer Science 2024-06-28 Baharan Nouriinanloo , Maxime Lamothe

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating additional information from retrieval. However, studies have shown that LLMs still face challenges in effectively using the retrieved information,…

Computation and Language · Computer Science 2024-06-13 Shicheng Xu , Liang Pang , Mo Yu , Fandong Meng , Huawei Shen , Xueqi Cheng , Jie Zhou

LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation

In an effort to automatically evaluate and select the best model and improve code quality for automatic incident remediation in IT Automation, it is crucial to verify if the generated code for remediation action is syntactically and…

Software Engineering · Computer Science 2025-06-16 Ngoc Phuoc An Vo , Brent Paulovicks , Vadim Sheinin

REINFOREST: Reinforcing Semantic Code Similarity for Cross-Lingual Code Search Models

This paper introduces a novel code-to-code search technique that enhances the performance of Large Language Models (LLMs) by including both static and dynamic features as well as utilizing both similar and dissimilar examples during…

Software Engineering · Computer Science 2024-04-17 Anthony Saieva , Saikat Chakraborty , Gail Kaiser

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

Recent studies have demonstrated the effectiveness of using large language language models (LLMs) in passage ranking. The listwise approaches, such as RankGPT, have become new state-of-the-art in this task. However, the efficiency of…

Computation and Language · Computer Science 2025-01-29 Qi Liu , Bo Wang , Nan Wang , Jiaxin Mao