Related papers: Syntax-Guided Program Reduction for Understanding …

Extracting Label-specific Key Input Features for Neural Code Intelligence Models

The code intelligence (CI) models are often black-box and do not offer any insights on the input features that they learn for making correct predictions. This opacity may lead to distrust in their prediction and hamper their wider adoption…

Software Engineering · Computer Science 2022-02-15 Md Rafiqul Islam Rabin

Understanding Neural Code Intelligence Through Program Simplification

A wide range of code intelligence (CI) tools, powered by deep neural networks, have been developed recently to improve programming productivity and perform program analysis. To reliably use such tools, developers often need to reason about…

Software Engineering · Computer Science 2021-09-10 Md Rafiqul Islam Rabin , Vincent J. Hellendoorn , Mohammad Amin Alipour

Inferring Input Grammars from Dynamic Control Flow

A program is characterized by its input model, and a formal input model can be of use in diverse areas including vulnerability analysis, reverse engineering, fuzzing and software testing, clone detection and refactoring. Unfortunately,…

Software Engineering · Computer Science 2019-12-13 Rahul Gopinath , Björn Mathis , Andreas Zeller

Programs as Black-Box Explanations

Recent work in model-agnostic explanations of black-box machine learning has demonstrated that interpretability of complex models does not have to come at the cost of accuracy or model flexibility. However, it is not clear what kind of…

Machine Learning · Statistics 2016-11-24 Sameer Singh , Marco Tulio Ribeiro , Carlos Guestrin

Benchmarking Language Models for Code Syntax Understanding

Pre-trained language models have demonstrated impressive performance in both natural language processing and program understanding, which represent the input as a token sequence without explicitly modeling its structure. Some prior works…

Computation and Language · Computer Science 2022-10-27 Da Shen , Xinyun Chen , Chenguang Wang , Koushik Sen , Dawn Song

Fantastic Features and Where to Find Them: Detecting Cognitive Impairment with a Subsequence Classification Guided Approach

Despite the widely reported success of embedding-based machine learning methods on natural language processing tasks, the use of more easily interpreted engineered features remains common in fields such as cognitive impairment (CI)…

Machine Learning · Computer Science 2020-10-14 Benjamin Eyre , Aparna Balagopalan , Jekaterina Novikova

A Survey on Pretrained Language Models for Neural Code Intelligence

As the complexity of modern software continues to escalate, software engineering has become an increasingly daunting and error-prone endeavor. In recent years, the field of Neural Code Intelligence (NCI) has emerged as a promising solution,…

Software Engineering · Computer Science 2022-12-21 Yichen Xu , Yanqiao Zhu

Improving Deep Learning Interpretability by Saliency Guided Training

Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Aya Abdelsalam Ismail , Héctor Corrada Bravo , Soheil Feizi

Leveraging Models to Reduce Test Cases in Software Repositories

Given a failing test case, test case reduction yields a smaller test case that reproduces the failure. This process can be time consuming due to repeated trial and error with smaller test cases. Current techniques speed up reduction by only…

Software Engineering · Computer Science 2021-03-23 Golnaz Gharachorlu , Nick Sumner

Study of Distractors in Neural Models of Code

Finding important features that contribute to the prediction of neural models is an active area of research in explainable AI. Neural models are opaque and finding such features sheds light on a better understanding of their predictions. In…

Machine Learning · Computer Science 2023-08-15 Md Rafiqul Islam Rabin , Aftab Hussain , Sahil Suneja , Mohammad Amin Alipour

Discovering Continuous-Time Memory-Based Symbolic Policies using Genetic Programming

Artificial intelligence techniques are increasingly being applied to solve control problems, but often rely on black-box methods without transparent output generation. To improve the interpretability and transparency in control systems,…

Neural and Evolutionary Computing · Computer Science 2025-06-11 Sigur de Vries , Sander Keemink , Marcel van Gerven

Reductions for Safety Proofs (Extended Version)

Program reductions are used widely to simplify reasoning about the correctness of concurrent and distributed programs. In this paper, we propose a general approach to proof simplification of concurrent programs based on exploring generic…

Programming Languages · Computer Science 2019-11-01 Azadeh Farzan , Anthony Vandikas

Diffusion On Syntax Trees For Program Synthesis

Large language models generate code one token at a time. Their autoregressive generation process lacks the feedback of observing the program's output. Training LLMs to suggest edits directly can be challenging due to the scarcity of rich…

Artificial Intelligence · Computer Science 2024-06-03 Shreyas Kapur , Erik Jenner , Stuart Russell

Learning Model Agnostic Explanations via Constraint Programming

Interpretable Machine Learning faces a recurring challenge of explaining the predictions made by opaque classifiers such as ensemble models, kernel methods, or neural networks in terms that are understandable to humans. When the model is…

Machine Learning · Computer Science 2024-11-14 Frederic Koriche , Jean-Marie Lagniez , Stefan Mengel , Chi Tran

Sensitivity based Neural Networks Explanations

Although neural networks can achieve very high predictive performance on various different tasks such as image recognition or natural language processing, they are often considered as opaque "black boxes". The difficulty of interpreting the…

Machine Learning · Statistics 2020-01-22 Enguerrand Horel , Virgile Mison , Tao Xiong , Kay Giesecke , Lidia Mangu

SYNTAX: A computer program to compress a sequence and to estimate its information content

The determination of block-entropies is a well established method for the investigation of discrete data, also called symbols (7). There is a large variety of such symbolic sequences, ranging from texts written in natural languages,…

Disordered Systems and Neural Networks · Physics 2007-05-23 Miguel Angel Jimenez-Montano , Werner Ebeling , Thorsten Poeschel

Position: Intelligent Coding Systems Should Write Programs with Justifications

Intelligent coding systems are transforming software development by enabling users to specify code behavior in natural language. However, the opaque decision-making of AI-driven coders raises trust and usability concerns, particularly for…

Software Engineering · Computer Science 2025-08-11 Xiangzhe Xu , Shiwei Feng , Zian Su , Chengpeng Wang , Xiangyu Zhang

Redundancy and Concept Analysis for Code-trained Language Models

Code-trained language models have proven to be highly effective for various code intelligence tasks. However, they can be challenging to train and deploy for many software engineering applications due to computational bottlenecks and memory…

Software Engineering · Computer Science 2024-02-19 Arushi Sharma , Zefu Hu , Christopher Quinn , Ali Jannesari

What is Learned in Visually Grounded Neural Syntax Acquisition

Visual features are a promising signal for learning bootstrap textual models. However, blackbox learning models make it difficult to isolate the specific contribution of visual components. In this analysis, we consider the case study of the…

Computation and Language · Computer Science 2020-05-20 Noriyuki Kojima , Hadar Averbuch-Elor , Alexander M. Rush , Yoav Artzi

Program Synthesis with Pragmatic Communication

Program synthesis techniques construct or infer programs from user-provided specifications, such as input-output examples. Yet most specifications, especially those given by end-users, leave the synthesis problem radically ill-posed,…

Artificial Intelligence · Computer Science 2020-10-22 Yewen Pu , Kevin Ellis , Marta Kryven , Josh Tenenbaum , Armando Solar-Lezama