Related papers: Explaining Image Classifiers by Counterfactual Gen…

Explaining Image Classifiers Using Contrastive Counterfactuals in Generative Latent Spaces

Despite their high accuracies, modern complex image classifiers cannot be trusted for sensitive tasks due to their unknown decision-making process and potential biases. Counterfactual explanations are very effective in providing…

Computer Vision and Pattern Recognition · Computer Science 2022-06-13 Kamran Alipour , Aditya Lahiri , Ehsan Adeli , Babak Salimi , Michael Pazzani

Counterfactual Generation Under Confounding

A machine learning model, under the influence of observed or unobserved confounders in the training data, can learn spurious correlations and fail to generalize when deployed. For image classifiers, augmenting a training dataset using…

Machine Learning · Computer Science 2022-12-13 Abbavaram Gowtham Reddy , Saloni Dash , Amit Sharma , Vineeth N Balasubramanian

GANterfactual - Counterfactual Explanations for Medical Non-Experts using Generative Adversarial Learning

With the ongoing rise of machine learning, the need for methods for explaining decisions made by artificial intelligence systems is becoming a more and more important topic. Especially for image classification tasks, many state-of-the-art…

Machine Learning · Computer Science 2022-05-10 Silvan Mertes , Tobias Huber , Katharina Weitz , Alexander Heimerl , Elisabeth André

Counterfactual Generation with Knockoffs

Human interpretability of deep neural networks' decisions is crucial, especially in domains where these directly affect human lives. Counterfactual explanations of already trained neural networks can be generated by perturbing input…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Oana-Iuliana Popescu , Maha Shadaydeh , Joachim Denzler

From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Visual counterfactual explanations are ideal hypothetical images that change the decision-making of the classifier with high confidence toward the desired class while remaining visually plausible and close to the initial image. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Tung Luu , Nam Le , Duc Le , Bac Le

Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

In this paper, we demonstrate the feasibility of alterfactual explanations for black box image classifiers. Traditional explanation mechanisms from the field of Counterfactual Thinking are a widely-used paradigm for Explainable Artificial…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Silvan Mertes , Tobias Huber , Christina Karle , Katharina Weitz , Ruben Schlagowski , Cristina Conati , Elisabeth André

Explaining image classifiers by removing input features using generative models

Perturbation-based explanation methods often measure the contribution of an input feature to an image classifier's outputs by heuristically removing it via e.g. blurring, adding noise, or graying out, which often produce unrealistic,…

Machine Learning · Computer Science 2020-10-07 Chirag Agarwal , Anh Nguyen

Evaluating and Mitigating Bias in Image Classifiers: A Causal Perspective Using Counterfactuals

Counterfactual examples for an input -- perturbations that change specific features but not others -- have been shown to be useful for evaluating bias of machine learning models, e.g., against specific demographic groups. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-01-07 Saloni Dash , Vineeth N Balasubramanian , Amit Sharma

Causal Generative Explainers using Counterfactual Inference: A Case Study on the Morpho-MNIST Dataset

In this paper, we propose leveraging causal generative learning as an interpretable tool for explaining image classifiers. Specifically, we present a generative counterfactual inference approach to study the influence of visual features…

Machine Learning · Computer Science 2024-01-23 Will Taylor-Melanson , Zahra Sadeghi , Stan Matwin

Counterfactual Image Editing

Counterfactual image editing is an important task in generative AI, which asks how an image would look if certain features were different. The current literature on the topic focuses primarily on changing individual features while remaining…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Yushu Pan , Elias Bareinboim

Generating Counterfactual Explanations with Natural Language

Natural language explanations of deep neural network decisions provide an intuitive way for a AI agent to articulate a reasoning process. Current textual explanations learn to discuss class discriminative features in an image. However, it…

Computer Vision and Pattern Recognition · Computer Science 2018-06-27 Lisa Anne Hendricks , Ronghang Hu , Trevor Darrell , Zeynep Akata

Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals

A visual counterfactual explanation replaces image regions in a query image with regions from a distractor image such that the system's decision on the transformed image changes to the distractor class. In this work, we present a novel…

Computer Vision and Pattern Recognition · Computer Science 2022-07-19 Simon Vandenhende , Dhruv Mahajan , Filip Radenovic , Deepti Ghadiyaram

Seeing What Shouldn't Be There: Counterfactual GANs for Medical Image Attribution

Ascription of an image gives insights into the objects that influence the classification of the whole image or its pixels towards a specific category. These insights help radiologists to visualize deformities in medical imaging. Most of the…

Computer Vision and Pattern Recognition · Computer Science 2026-05-08 Shakeeb Murtaza

Counterfactual Generative Networks

Neural networks are prone to learning shortcuts -- they often model simple correlations, ignoring more complex ones that potentially generalize better. Prior works on image classification show that instead of learning a connection to object…

Machine Learning · Computer Science 2021-01-18 Axel Sauer , Andreas Geiger

DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Maximilian Augustin , Yannic Neuhaus , Matthias Hein

Counterfactual Visual Explanations

In this work, we develop a technique to produce counterfactual visual explanations. Given a 'query' image $I$ for which a vision system predicts class $c$, a counterfactual visual explanation identifies how $I$ could change such that the…

Machine Learning · Computer Science 2019-06-12 Yash Goyal , Ziyan Wu , Jan Ernst , Dhruv Batra , Devi Parikh , Stefan Lee

Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models

Counterfactual explanations (CFE) for deep image classifiers aim to reveal how minimal input changes lead to different model decisions, providing critical insights for model interpretation and improvement. However, existing CFE methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Townim Faisal Chowdhury , Vu Minh Hieu Phan , Kewen Liao , Nanyu Dong , Minh-Son To , Anton Hengel , Johan Verjans , Zhibin Liao

Explaining Low Perception Model Competency with High-Competency Counterfactuals

There exist many methods to explain how an image classification model generates its decision, but very little work has explored methods to explain why a classifier might lack confidence in its prediction. As there are various reasons the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Sara Pohland , Claire Tomlin

Viewing the process of generating counterfactuals as a source of knowledge: a new approach for explaining classifiers

There are now many explainable AI methods for understanding the decisions of a machine learning model. Among these are those based on counterfactual reasoning, which involve simulating features changes and observing the impact on the…

Machine Learning · Computer Science 2024-04-15 Vincent Lemaire , Nathan Le Boudec , Victor Guyomard , Françoise Fessant

Counterfactual Image Generation for adversarially robust and interpretable Classifiers

Neural Image Classifiers are effective but inherently hard to interpret and susceptible to adversarial attacks. Solutions to both problems exist, among others, in the form of counterfactual examples generation to enhance explainability or…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Rafael Bischof , Florian Scheidegger , Michael A. Kraus , A. Cristiano I. Malossi