Related papers: Explaining Classifiers with Causal Concept Effect …

MCCE: Missingness-aware Causal Concept Explainer

Causal concept effect estimation is gaining increasing interest in the field of interpretable machine learning. This general approach explains the behaviors of machine learning models by estimating the causal effect of human-understandable…

Machine Learning · Computer Science 2024-11-15 Jifan Gao , Guanhua Chen

Unsupervised Causal Binary Concepts Discovery with VAE for Black-box Model Explanation

We aim to explain a black-box classifier with the form: `data X is classified as class Y because X \textit{has} A, B and \textit{does not have} C' in which A, B, and C are high-level concepts. The challenge is that we have to discover in an…

Machine Learning · Computer Science 2021-09-13 Thien Q. Tran , Kazuto Fukuchi , Youhei Akimoto , Jun Sakuma

Towards Principled Causal Effect Estimation by Deep Identifiable Models

As an important problem in causal inference, we discuss the estimation of treatment effects (TEs). Representing the confounder as a latent variable, we propose Intact-VAE, a new variant of variational autoencoder (VAE), motivated by the…

Machine Learning · Statistics 2022-04-22 Pengzhou Wu , Kenji Fukumizu

A Critical Look at the Consistency of Causal Estimation With Deep Latent Variable Models

Using deep latent variable models in causal inference has attracted considerable interest recently, but an essential open question is their ability to yield consistent causal estimates. While they have demonstrated promising results and…

Machine Learning · Computer Science 2022-01-25 Severi Rissanen , Pekka Marttinen

Causal Learning and Explanation of Deep Neural Networks via Autoencoded Activations

Deep neural networks are complex and opaque. As they enter application in a variety of important and safety critical domains, users seek methods to explain their output predictions. We develop an approach to explaining deep neural networks…

Artificial Intelligence · Computer Science 2018-02-05 Michael Harradon , Jeff Druce , Brian Ruttenberg

VAE-CE: Visual Contrastive Explanation using Disentangled VAEs

The goal of a classification model is to assign the correct labels to data. In most cases, this data is not fully described by the given set of labels. Often a rich set of meaningful concepts exist in the domain that can much more precisely…

Machine Learning · Computer Science 2021-08-23 Yoeri Poels , Vlado Menkovski

CAuSE: Decoding Multimodal Classifiers using Faithful Natural Language Explanation

Multimodal classifiers function as opaque black box models. While several techniques exist to interpret their predictions, very few of them are as intuitive and accessible as natural language explanations (NLEs). To build trust, such…

Computation and Language · Computer Science 2025-12-09 Dibyanayan Bandyopadhyay , Soham Bhattacharjee , Mohammed Hasanuzzaman , Asif Ekbal

Causal Effect Inference with Deep Latent-Variable Models

Learning individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient, is a problem of growing importance for policy makers. The most important aspect of inferring causal…

Machine Learning · Statistics 2017-11-07 Christos Louizos , Uri Shalit , Joris Mooij , David Sontag , Richard Zemel , Max Welling

Concept-SAE: Active Causal Probing of Visual Model Behavior

Standard Sparse Autoencoders (SAEs) excel at discovering a dictionary of a model's learned features, offering a powerful observational lens. However, the ambiguous and ungrounded nature of these features makes them unreliable instruments…

Machine Learning · Computer Science 2025-09-29 Jianrong Ding , Muxi Chen , Chenchen Zhao , Qiang Xu

Causal Decision Making and Causal Effect Estimation Are Not the Same... and Why It Matters

Causal decision making (CDM) based on machine learning has become a routine part of business. Businesses algorithmically target offers, incentives, and recommendations to affect consumer behavior. Recently, we have seen an acceleration of…

Machine Learning · Statistics 2021-10-01 Carlos Fernández-Loría , Foster Provost

Causal Concept Graph Models: Beyond Causal Opacity in Deep Learning

Causal opacity denotes the difficulty in understanding the "hidden" causal structure underlying the decisions of deep neural network (DNN) models. This leads to the inability to rely on and verify state-of-the-art DNN-based systems,…

Machine Learning · Computer Science 2025-04-02 Gabriele Dominici , Pietro Barbiero , Mateo Espinosa Zarlenga , Alberto Termine , Martin Gjoreski , Giuseppe Marra , Marc Langheinrich

Quantifying Error in the Presence of Confounders for Causal Inference

Estimating average causal effect (ACE) is useful whenever we want to know the effect of an intervention on a given outcome. In the absence of a randomized experiment, many methods such as stratification and inverse propensity weighting have…

Machine Learning · Computer Science 2019-07-11 Rathin Desai , Amit Sharma

Enhancing Causal Effect Estimation with Diffusion-Generated Data

Estimating causal effects from observational data is inherently challenging due to the lack of observable counterfactual outcomes and even the presence of unmeasured confounding. Traditional methods often rely on restrictive, untestable…

Methodology · Statistics 2025-04-07 Li Chen , Xiaotong Shen , Wei Pan

Towards Compositionality in Concept Learning

Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that…

Computation and Language · Computer Science 2024-06-27 Adam Stein , Aaditya Naik , Yinjun Wu , Mayur Naik , Eric Wong

Interpreting Low-level Vision Models with Causal Effect Maps

Deep neural networks have significantly improved the performance of low-level vision tasks but also increased the difficulty of interpretability. A deep understanding of deep models is beneficial for both network design and practical…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Jinfan Hu , Jinjin Gu , Shiyao Yu , Fanghua Yu , Zheyuan Li , Zhiyuan You , Chaochao Lu , Chao Dong

DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Maximilian Augustin , Yannic Neuhaus , Matthias Hein

Human-Centered Concept Explanations for Neural Networks

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the…

Machine Learning · Computer Science 2022-02-28 Chih-Kuan Yeh , Been Kim , Pradeep Ravikumar

When Causal Intervention Meets Adversarial Examples and Image Masking for Deep Neural Networks

Discovering and exploiting the causality in deep neural networks (DNNs) are crucial challenges for understanding and reasoning causal effects (CE) on an explainable visual model. "Intervention" has been widely used for recognizing a causal…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Chao-Han Huck Yang , Yi-Chieh Liu , Pin-Yu Chen , Xiaoli Ma , Yi-Chang James Tsai

Accurate Explanation Model for Image Classifiers using Class Association Embedding

Image classification is a primary task in data analysis where explainable models are crucially demanded in various applications. Although amounts of methods have been proposed to obtain explainable knowledge from the black-box classifiers,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Ruitao Xie , Jingbang Chen , Limai Jiang , Rui Xiao , Yi Pan , Yunpeng Cai

Using Causal Analysis for Conceptual Deep Learning Explanation

Model explainability is essential for the creation of trustworthy Machine Learning models in healthcare. An ideal explanation resembles the decision-making process of a domain expert and is expressed using concepts or terminology that is…

Machine Learning · Computer Science 2021-07-14 Sumedha Singla , Stephen Wallace , Sofia Triantafillou , Kayhan Batmanghelich