English

Syntax-Guided Program Reduction for Understanding Neural Code Intelligence Models

Software Engineering 2022-06-15 v2 Machine Learning Programming Languages

Abstract

Neural code intelligence (CI) models are opaque black-boxes and offer little insight on the features they use in making predictions. This opacity may lead to distrust in their prediction and hamper their wider adoption in safety-critical applications. Recently, input program reduction techniques have been proposed to identify key features in the input programs to improve the transparency of CI models. However, this approach is syntax-unaware and does not consider the grammar of the programming language. In this paper, we apply a syntax-guided program reduction technique that considers the grammar of the input programs during reduction. Our experiments on multiple models across different types of input programs show that the syntax-guided program reduction technique is faster and provides smaller sets of key tokens in reduced programs. We also show that the key tokens could be used in generating adversarial examples for up to 65% of the input programs.

Keywords

Cite

@article{arxiv.2205.14374,
  title  = {Syntax-Guided Program Reduction for Understanding Neural Code Intelligence Models},
  author = {Md Rafiqul Islam Rabin and Aftab Hussain and Mohammad Amin Alipour},
  journal= {arXiv preprint arXiv:2205.14374},
  year   = {2022}
}

Comments

The 6th ACM SIGPLAN International Symposium on Machine Programming (MAPS'22); Related to arXiv:2202.06474

R2 v1 2026-06-24T11:31:44.645Z