English

Validating AI-Generated Code with Live Programming

Human-Computer Interaction 2024-02-26 v3 Programming Languages

Abstract

AI-powered programming assistants are increasingly gaining popularity, with GitHub Copilot alone used by over a million developers worldwide. These tools are far from perfect, however, producing code suggestions that may be incorrect in subtle ways. As a result, developers face a new challenge: validating AI's suggestions. This paper explores whether Live Programming (LP), a continuous display of a program's runtime values, can help address this challenge. To answer this question, we built a Python editor that combines an AI-powered programming assistant with an existing LP environment. Using this environment in a between-subjects study (N=17), we found that by lowering the cost of validation by execution, LP can mitigate over- and under-reliance on AI-generated programs and reduce the cognitive load of validation for certain types of tasks.

Keywords

Cite

@article{arxiv.2306.09541,
  title  = {Validating AI-Generated Code with Live Programming},
  author = {Kasra Ferdowsi and Ruanqianqian Huang and Michael B. James and Nadia Polikarpova and Sorin Lerner},
  journal= {arXiv preprint arXiv:2306.09541},
  year   = {2024}
}

Comments

8 pages, 4 figures