Related papers: Functional Overlap Reranking for Neural Code Gener…

VRank: Enhancing Verilog Code Generation from Large Language Models via Self-Consistency

Large Language Models (LLMs) have demonstrated promising capabilities in generating Verilog code from module specifications. To improve the quality of such generated Verilog codes, previous methods require either time-consuming manual…

Hardware Architecture · Computer Science 2025-02-04 Zhuorui Zhao , Ruidi Qiu , Ing-Chao Lin , Grace Li Zhang , Bing Li , Ulf Schlichtmann

Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

Code generation has been greatly enhanced by the profound advancements in Large Language Models (LLMs) recently. Nevertheless, such LLM-based code generation approaches still struggle to generate error-free code in a few tries when faced…

Artificial Intelligence · Computer Science 2024-08-13 Zhi-Cun Lyu , Xin-Ye Li , Zheng Xie , Ming Li

Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates

Large Language Models (LLMs), such as GPT-4, StarCoder, and CodeLlama, are transforming the way developers approach programming by automatically generating code based on given natural language descriptions. Despite advancements, generating…

Software Engineering · Computer Science 2024-09-20 Zhihong Sun , Yao Wan , Jia Li , Hongyu Zhang , Zhi Jin , Ge Li , Chen Lyu

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking

Reranking is fundamental to information retrieval and retrieval-augmented generation, with recent Large Language Models (LLMs) significantly advancing reranking quality. Most current works rely on large-scale LLMs (>7B parameters),…

Information Retrieval · Computer Science 2026-04-17 Xianming Li , Aamir Shakir , Rui Huang , Tsz-fung Andrew Lee , Julius Lipp , Benjamin Clavié , Jing Li

Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning

Large Language Models (LLMs) have significantly advanced automated code generation, yet they struggle with complex coding tasks requiring multi-step logical reasoning. High-quality reasoning data is crucial for improving LLMs' reasoning…

Software Engineering · Computer Science 2025-03-20 Chengran Yang , Zhensu Sun , Hong Jin Kang , Jieke Shi , David Lo

ERank: Fusing Supervised Fine-Tuning and Reinforcement Learning for Effective and Efficient Text Reranking

Text reranking models are a crucial component in modern systems like Retrieval-Augmented Generation, tasked with selecting the most relevant documents prior to generation. However, current Large Language Models (LLMs) powered rerankers…

Information Retrieval · Computer Science 2025-09-03 Yuzheng Cai , Yanzhao Zhang , Dingkun Long , Mingxin Li , Pengjun Xie , Weiguo Zheng

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning

Despite the remarkable success of large language models (LLMs) on traditional natural language processing tasks, their planning ability remains a critical bottleneck in tackling complex multi-step reasoning tasks. Existing approaches mainly…

Computation and Language · Computer Science 2024-10-07 Jiaxin Wen , Jian Guan , Hongning Wang , Wei Wu , Minlie Huang

ReCode: Improving LLM-based Code Repair with Fine-Grained Retrieval-Augmented Generation

Recent advances in large language models (LLMs) have demonstrated impressive capabilities in code-related tasks, such as code generation and automated program repair. Despite their promising performance, most existing approaches for code…

Software Engineering · Computer Science 2025-09-03 Yicong Zhao , Shisong Chen , Jiacheng Zhang , Zhixu Li

Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models

Large language models (LLMs) have demonstrated impressive capabilities in natural language generation. However, their output quality can be inconsistent, posing challenges for generating natural language from logical forms (LFs). This task…

Computation and Language · Computer Science 2023-09-22 Levon Haroutunian , Zhuang Li , Lucian Galescu , Philip Cohen , Raj Tumuluri , Gholamreza Haffari

CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment

While Large Language Models (LLMs) excel at code generation by learning from vast code corpora, a fundamental semantic gap remains between their training on textual patterns and the goal of functional correctness, which is governed by…

Software Engineering · Computer Science 2026-04-23 Xue Jiang , Yihong Dong , Mengyang Liu , Hongyi Deng , Tian Wang , Yongding Tao , Rongyu Cao , Binhua Li , Zhi Jin , Wenpin Jiao , Fei Huang , Yongbin Li , Ge Li

Fault-Aware Neural Code Rankers

Large language models (LLMs) have demonstrated an impressive ability to generate code for various programming tasks. In many instances, LLMs can generate a correct program for a task when given numerous trials. Consequently, a recent trend…

Programming Languages · Computer Science 2022-12-13 Jeevana Priya Inala , Chenglong Wang , Mei Yang , Andres Codas , Mark Encarnación , Shuvendu K Lahiri , Madanlal Musuvathi , Jianfeng Gao

OpenCoderRank: Personalized Technical Assessments with Generative AI

Organizations and educational institutions use time-bound assessment tasks to evaluate coding and problem-solving skills. These assessments measure not only the correctness of the solutions, but also their efficiency. Problem setters…

Software Engineering · Computer Science 2026-04-07 Hridoy Sankar Dutta , Sana Ansari , Swati Kumari , Shounak Ravi Bhalerao

An Empirical Study on the Code Refactoring Capability of Large Language Models

Large Language Models (LLMs) have shown potential to enhance software development through automated code generation and refactoring, reducing development time and improving code quality. This study empirically evaluates StarCoder2, an LLM…

Software Engineering · Computer Science 2024-11-05 Jonathan Cordeiro , Shayan Noei , Ying Zou

ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression

Large language model (LLM) based listwise reranking has emerged as the dominant paradigm for achieving state-of-the-art ranking effectiveness in information retrieval. However, its reliance on feeding full passage texts into the LLM…

Information Retrieval · Computer Science 2026-04-27 Xiaojie Ke , Shuai Zhang , Liansheng Sun , Yongjin Wang , Hengjun Jiang , Xiangkun Liu , Cunxin Gu , Jian Xu , Guanjun Jiang

DiffuRank: Effective Document Reranking with Diffusion Language Models

Recent advances in large language models (LLMs) have inspired new paradigms for document reranking. While this paradigm better exploits the reasoning and contextual understanding capabilities of LLMs, most existing LLM-based rerankers rely…

Information Retrieval · Computer Science 2026-02-16 Qi Liu , Kun Ai , Jiaxin Mao , Yanzhao Zhang , Mingxin Li , Dingkun Long , Pengjun Xie , Fengbin Zhu , Ji-Rong Wen

SACL: Understanding and Combating Textual Bias in Code Retrieval with Semantic-Augmented Reranking and Localization

Retrieval-Augmented Code Generation (RACG) is a critical technique for enhancing code generation by retrieving relevant information. In this work, we conduct an in-depth analysis of code retrieval by systematically masking specific features…

Computation and Language · Computer Science 2025-06-27 Dhruv Gupta , Gayathri Ganesh Lakshmy , Yiqing Xie

GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs

Large Language Models (LLMs) have emerged as powerful tools for passage reranking in information retrieval, leveraging their superior reasoning capabilities to address the limitations of conventional models on complex queries. However,…

Information Retrieval · Computer Science 2026-05-01 Meixiu Long , Duolin Sun , Dan Yang , Yihan Jiao , Lei Liu , Jiahai Wang , BinBin Hu , Yue Shen , Jie Feng , Zhehao Tan , Junjie Wang , Lianzhen Zhong , Jian Wang , Peng Wei , Jinjie Gu

The Cream Rises to the Top: Efficient Reranking Method for Verilog Code Generation

LLMs face significant challenges in Verilog generation due to limited domain-specific knowledge. While sampling techniques improve pass@k metrics, hardware engineers need one trustworthy solution rather than uncertain candidates. To bridge…

Hardware Architecture · Computer Science 2025-12-10 Guang Yang , Wei Zheng , Xiang Chen , Yifan Sun , Fengji Zhang , Terry Yue Zhuo

Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs

Large Language Models (LLMs) with inference-time scaling techniques show promise for code generation, yet face notable efficiency and scalability challenges. Construction-based tree-search methods suffer from rapid growth in tree size, high…

Computation and Language · Computer Science 2025-08-12 Zhiyi Lyu , Jianguo Huang , Yanchen Deng , Steven Hoi , Bo An

Lightweight reranking for language model generations

Large Language Models (LLMs) can exhibit considerable variation in the quality of their sampled outputs. Reranking and selecting the best generation from the sampled set is a popular way of obtaining strong gains in generation quality. In…

Artificial Intelligence · Computer Science 2024-01-15 Siddhartha Jain , Xiaofei Ma , Anoop Deoras , Bing Xiang