Related papers: A Robust Prototype-Based Network with Interpretabl…

Robust Text Classification: Analyzing Prototype-Based Networks

Downstream applications often require text classification models to be accurate and robust. While the accuracy of the state-of-the-art Language Models (LMs) approximates human performance, they often exhibit a drop in performance on noisy…

Computation and Language · Computer Science 2024-10-29 Zhivar Sourati , Darshan Deshpande , Filip Ilievski , Kiril Gashteovski , Sascha Saralajew

Neural Probabilistic Circuits: Enabling Compositional and Interpretable Predictions through Logical Reasoning

End-to-end deep neural networks have achieved remarkable success across various domains but are often criticized for their lack of interpretability. While post hoc explanation methods attempt to address this issue, they often fail to…

Machine Learning · Computer Science 2025-01-22 Weixin Chen , Simon Yu , Huajie Shao , Lui Sha , Han Zhao

Robust Explainability: A Tutorial on Gradient-Based Attribution Methods for Deep Neural Networks

With the rise of deep neural networks, the challenge of explaining the predictions of these networks has become increasingly recognized. While many methods for explaining the decisions of deep neural networks exist, there is currently no…

Machine Learning · Computer Science 2022-07-13 Ian E. Nielsen , Dimah Dera , Ghulam Rasool , Nidhal Bouaynaya , Ravi P. Ramachandran

Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable Prototypes

We present a deformable prototypical part network (Deformable ProtoPNet), an interpretable image classifier that integrates the power of deep learning and the interpretability of case-based reasoning. This model classifies input images by…

Computer Vision and Pattern Recognition · Computer Science 2024-05-06 Jon Donnelly , Alina Jade Barnett , Chaofan Chen

Interpretable Image Classification via Non-parametric Part Prototype Learning

Classifying images with an interpretable decision-making process is a long-standing problem in computer vision. In recent years, Prototypical Part Networks has gained traction as an approach for self-explainable neural networks, due to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Zhijie Zhu , Lei Fan , Maurice Pagnucco , Yang Song

Interpreting Robustness Proofs of Deep Neural Networks

In recent years numerous methods have been developed to formally verify the robustness of deep neural networks (DNNs). Though the proposed techniques are effective in providing mathematical guarantees about the DNNs behavior, it is not…

Machine Learning · Computer Science 2023-02-01 Debangshu Banerjee , Avaljot Singh , Gagandeep Singh

Robust Classification with Convolutional Prototype Learning

Convolutional neural networks (CNNs) have been widely used for image classification. Despite its high accuracies, CNN has been shown to be easily fooled by some adversarial examples, indicating that CNN is not robust enough for pattern…

Computer Vision and Pattern Recognition · Computer Science 2018-05-10 Hong-Ming Yang , Xu-Yao Zhang , Fei Yin , Cheng-Lin Liu

This Looks Like That: Deep Learning for Interpretable Image Recognition

When we are faced with challenging image classification tasks, we often explain our reasoning by dissecting the image, and pointing out prototypical aspects of one class or another. The mounting evidence for each of the classes helps us…

Machine Learning · Computer Science 2020-01-01 Chaofan Chen , Oscar Li , Chaofan Tao , Alina Jade Barnett , Jonathan Su , Cynthia Rudin

Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks

The lack of transparency of Deep Neural Networks continues to be a limitation that severely undermines their reliability and usage in high-stakes applications. Promising approaches to overcome such limitations are Prototype-Based…

Machine Learning · Computer Science 2025-07-21 Jon Vadillo , Roberto Santana , Jose A. Lozano , Marta Kwiatkowska

On the Lack of Robust Interpretability of Neural Text Classifiers

With the ever-increasing complexity of neural language models, practitioners have turned to methods for understanding the predictions of these models. One of the most well-adopted approaches for model interpretability is feature-based…

Computation and Language · Computer Science 2021-06-10 Muhammad Bilal Zafar , Michele Donini , Dylan Slack , Cédric Archambeau , Sanjiv Das , Krishnaram Kenthapadi

Interpretability Beyond Classification Output: Semantic Bottleneck Networks

Today's deep learning systems deliver high performance based on end-to-end training. While they deliver strong performance, these systems are hard to interpret. To address this issue, we propose Semantic Bottleneck Networks (SBN): deep…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Max Losch , Mario Fritz , Bernt Schiele

Unifying Model Explainability and Robustness via Machine-Checkable Concepts

As deep neural networks (DNNs) get adopted in an ever-increasing number of applications, explainability has emerged as a crucial desideratum for these models. In many real-world tasks, one of the principal reasons for requiring…

Artificial Intelligence · Computer Science 2020-07-03 Vedant Nanda , Till Speicher , John P. Dickerson , Krishna P. Gummadi , Muhammad Bilal Zafar

Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks

Part-prototype networks (e.g., ProtoPNet, ProtoTree, and ProtoPool) have attracted broad research interest for their intrinsic interpretability and comparable accuracy to non-interpretable counterparts. However, recent works find that the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Qihan Huang , Mengqi Xue , Wenqi Huang , Haofei Zhang , Jie Song , Yongcheng Jing , Mingli Song

Probabilistic Concept Bottleneck Models

Interpretable models are designed to make decisions in a human-interpretable manner. Representatively, Concept Bottleneck Models (CBM) follow a two-step process of concept prediction and class prediction based on the predicted concepts. CBM…

Machine Learning · Computer Science 2023-06-05 Eunji Kim , Dahuin Jung , Sangha Park , Siwon Kim , Sungroh Yoon

Why Robust Natural Language Understanding is a Challenge

With the proliferation of Deep Machine Learning into real-life applications, a particular property of this technology has been brought to attention: robustness Neural Networks notoriously present low robustness and can be highly sensitive…

Computation and Language · Computer Science 2022-07-14 Marco Casadio , Ekaterina Komendantskaya , Verena Rieser , Matthew L. Daggitt , Daniel Kienitz , Luca Arnaboldi , Wen Kokke

On the Robustness of Explanations of Deep Neural Network Models: A Survey

Explainability has been widely stated as a cornerstone of the responsible and trustworthy use of machine learning models. With the ubiquitous use of Deep Neural Network (DNN) models expanding to risk-sensitive and safety-critical domains,…

Machine Learning · Computer Science 2022-11-10 Amlan Jyoti , Karthik Balaji Ganesh , Manoj Gayala , Nandita Lakshmi Tunuguntla , Sandesh Kamath , Vineeth N Balasubramanian

Causal Interpretability for Adversarial Robustness: A Hybrid Generative Classification Approach

Deep learning-based discriminative classifiers, despite their remarkable success, remain vulnerable to adversarial examples that can mislead model predictions. While adversarial training can enhance robustness, it fails to address the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Chunheng Zhao , Pierluigi Pisu , Gurcan Comert , Negash Begashaw , Varghese Vaidyan , Nina Christine Hubig

Robust Models Are More Interpretable Because Attributions Look Normal

Recent work has found that adversarially-robust deep networks used for image classification are more interpretable: their feature attributions tend to be sharper, and are more concentrated on the objects associated with the image's…

Machine Learning · Computer Science 2021-10-07 Zifan Wang , Matt Fredrikson , Anupam Datta

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to their over-parameterized black-box nature, it is often difficult to understand the prediction…

Machine Learning · Computer Science 2022-07-18 Xuhong Li , Haoyi Xiong , Xingjian Li , Xuanyu Wu , Xiao Zhang , Ji Liu , Jiang Bian , Dejing Dou

Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits

Neural Probabilistic Circuits (NPCs), a new class of concept bottleneck models, comprise an attribute recognition model and a probabilistic circuit for reasoning. By integrating the outputs from these two modules, NPCs produce compositional…

Machine Learning · Computer Science 2025-09-26 Weixin Chen , Han Zhao