English

SampleFix: Learning to Generate Functionally Diverse Fixes

Software Engineering 2021-09-24 v3 Machine Learning Programming Languages Machine Learning

Abstract

Automatic program repair holds the potential of dramatically improving the productivity of programmers during the software development process and correctness of software in general. Recent advances in machine learning, deep learning, and NLP have rekindled the hope to eventually fully automate the process of repairing programs. However, previous approaches that aim to predict a single fix are prone to fail due to uncertainty about the true intend of the programmer. Therefore, we propose a generative model that learns a distribution over potential fixes. Our model is formulated as a deep conditional variational autoencoder that can efficiently sample fixes for a given erroneous program. In order to ensure diverse solutions, we propose a novel regularizer that encourages diversity over a semantic embedding space. Our evaluations on common programming errors show for the first time the generation of diverse fixes and strong improvements over the state-of-the-art approaches by fixing up to 45% of the erroneous programs. We additionally show that for the 65% of the repaired programs, our approach was able to generate multiple programs with diverse functionalities.

Keywords

Cite

@article{arxiv.1906.10502,
  title  = {SampleFix: Learning to Generate Functionally Diverse Fixes},
  author = {Hossein Hajipour and Apratim Bhattacharyya and Cristian-Alexandru Staicu and Mario Fritz},
  journal= {arXiv preprint arXiv:1906.10502},
  year   = {2021}
}

Comments

15 pages, 6 figures

R2 v1 2026-06-23T10:03:01.931Z