English

CodeSift: An LLM-Based Reference-Less Framework for Automatic Code Validation

Software Engineering 2024-08-29 v1 Artificial Intelligence

Abstract

The advent of large language models (LLMs) has greatly facilitated code generation, but ensuring the functional correctness of generated code remains a challenge. Traditional validation methods are often time-consuming, error-prone, and impractical for large volumes of code. We introduce CodeSift, a novel framework that leverages LLMs as the first-line filter of code validation without the need for execution, reference code, or human feedback, thereby reducing the validation effort. We assess the effectiveness of our method across three diverse datasets encompassing two programming languages. Our results indicate that CodeSift outperforms state-of-the-art code evaluation methods. Internal testing conducted with subject matter experts reveals that the output generated by CodeSift is in line with human preference, reinforcing its effectiveness as a dependable automated code validation tool.

Keywords

Cite

@article{arxiv.2408.15630,
  title  = {CodeSift: An LLM-Based Reference-Less Framework for Automatic Code Validation},
  author = {Pooja Aggarwal and Oishik Chatterjee and Ting Dai and Prateeti Mohapatra and Brent Paulovicks and Brad Blancett and Arthur De Magalhaes},
  journal= {arXiv preprint arXiv:2408.15630},
  year   = {2024}
}