Related papers: Debiasing Concept-based Explanations with Causal A…

Debiasing Multimodal Models via Causal Information Minimization

Most existing debiasing methods for multimodal models, including causal intervention and inference methods, utilize approximate heuristics to represent the biases, such as shallow features from early stages of training or unimodal features…

Machine Learning · Computer Science 2023-11-29 Vaidehi Patil , Adyasha Maharana , Mohit Bansal

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

*Concept-based explanations* offer a promising approach for explaining the predictions of deep neural networks in terms of high-level, human-understandable concepts. However, existing methods either do not establish a causal connection…

Machine Learning · Computer Science 2026-05-08 Ronaldo Canizales , Divya Gopinath , Corina Păsăreanu , Ravi Mangal

Estimation of Concept Explanations Should be Uncertainty Aware

Model explanations can be valuable for interpreting and debugging predictive models. We study a specific kind called Concept Explanations, where the goal is to interpret a model using human-understandable concepts. Although popular for…

Machine Learning · Computer Science 2024-04-08 Vihari Piratla , Juyeon Heo , Katherine M. Collins , Sukriti Singh , Adrian Weller

Overlooked factors in concept-based explanations: Dataset choice, concept learnability, and human capability

Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with…

Computer Vision and Pattern Recognition · Computer Science 2023-05-15 Vikram V. Ramaswamy , Sunnie S. Y. Kim , Ruth Fong , Olga Russakovsky

Visual Data Diagnosis and Debiasing with Concept Graphs

The widespread success of deep learning models today is owed to the curation of extensive datasets significant in size and complexity. However, such models frequently pick up inherent biases in the data during the training process, leading…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Rwiddhi Chakraborty , Yinong Wang , Jialu Gao , Runkai Zheng , Cheng Zhang , Fernando De la Torre

Cross-modal Counterfactual Explanations: Uncovering Decision Factors and Dataset Biases in Subjective Classification

Concept-driven counterfactuals explain decisions of classifiers by altering the model predictions through semantic changes. In this paper, we present a novel approach that leverages cross-modal decompositionality and image-specific concepts…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Alina Elena Baia , Andrea Cavallaro

Understanding Inter-Concept Relationships in Concept-Based Models

Concept-based explainability methods provide insight into deep learning systems by constructing explanations using human-understandable concepts. While the literature on human reasoning demonstrates that we exploit relationships between…

Machine Learning · Computer Science 2024-05-29 Naveen Raman , Mateo Espinosa Zarlenga , Mateja Jamnik

Causal Analysis for Robust Interpretability of Neural Networks

Interpreting the inner function of neural networks is crucial for the trustworthy development and deployment of these black-box models. Prior interpretability methods focus on correlation-based measures to attribute model decisions to…

Machine Learning · Computer Science 2023-06-21 Ola Ahmad , Nicolas Bereux , Loïc Baret , Vahid Hashemi , Freddy Lecue

Explaining Visual Models by Causal Attribution

Model explanations based on pure observational data cannot compute the effects of features reliably, due to their inability to estimate how each factor alteration could affect the rest. We argue that explanations should be based on the…

Machine Learning · Statistics 2019-09-20 Álvaro Parafita , Jordi Vitrià

Using Causal Analysis for Conceptual Deep Learning Explanation

Model explainability is essential for the creation of trustworthy Machine Learning models in healthcare. An ideal explanation resembles the decision-making process of a domain expert and is expressed using concepts or terminology that is…

Machine Learning · Computer Science 2021-07-14 Sumedha Singla , Stephen Wallace , Sofia Triantafillou , Kayhan Batmanghelich

Explanatory causal effects for model agnostic explanations

This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature…

Machine Learning · Computer Science 2022-06-24 Jiuyong Li , Ha Xuan Tran , Thuc Duy Le , Lin Liu , Kui Yu , Jixue Liu

An Axiomatic Approach to Model-Agnostic Concept Explanations

Concept explanation is a popular approach for examining how human-interpretable concepts impact the predictions of a model. However, most existing methods for concept explanations are tailored to specific models. To address this issue, this…

Machine Learning · Computer Science 2024-01-17 Zhili Feng , Michal Moshkovitz , Dotan Di Castro , J. Zico Kolter

Do Users Benefit From Interpretable Vision? A User Study, Baseline, And Dataset

A variety of methods exist to explain image classification models. However, whether they provide any benefit to users over simply comparing various inputs and the model's respective predictions remains unclear. We conducted a user study…

Machine Learning · Computer Science 2022-04-26 Leon Sixt , Martin Schuessler , Oana-Iuliana Popescu , Philipp Weiß , Tim Landgraf

Deconfounding Scores: Feature Representations for Causal Effect Estimation with Weak Overlap

A key condition for obtaining reliable estimates of the causal effect of a treatment is overlap (a.k.a. positivity): the distributions of the features used to perform causal adjustment cannot be too different in the treated and control…

Methodology · Statistics 2021-04-14 Alexander D'Amour , Alexander Franks

Causal Inference with Categorical Unobserved Confounder via Mixture Learning

Unobserved confounding is a fundamental challenge for estimating causal effects. To address unobserved confounding, recent literature has turned to two different approaches -- proxy variables and the use of multiple treatments. The first…

Methodology · Statistics 2026-05-20 Aytijhya Saha , Stephen Bates , Devavrat Shah

Balancing Unobserved Confounding with a Few Unbiased Ratings in Debiased Recommendations

Recommender systems are seen as an effective tool to address information overload, but it is widely known that the presence of various biases makes direct training on large-scale observational data result in sub-optimal prediction…

Information Retrieval · Computer Science 2023-04-19 Haoxuan Li , Yanghao Xiao , Chunyuan Zheng , Peng Wu

Detecting and Measuring Confounding Using Causal Mechanism Shifts

Detecting and measuring confounding effects from data is a key challenge in causal inference. Existing methods frequently assume causal sufficiency, disregarding the presence of unobserved confounding variables. Causal sufficiency is both…

Artificial Intelligence · Computer Science 2024-09-27 Abbavaram Gowtham Reddy , Vineeth N Balasubramanian

EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence…

Machine Learning · Computer Science 2019-09-30 Diane Bouchacourt , Ludovic Denoyer

Concept Based Explanations and Class Contrasting

Explaining deep neural networks is challenging, due to their large size and non-linearity. In this paper, we introduce a concept-based explanation method, in order to explain the prediction for an individual class, as well as contrasting…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Rudolf Herdt , Daniel Otero Baguer

Doubly Debiased Lasso: High-Dimensional Inference under Hidden Confounding

Inferring causal relationships or related associations from observational data can be invalidated by the existence of hidden confounding. We focus on a high-dimensional linear regression setting, where the measured covariates are affected…

Methodology · Statistics 2021-07-22 Zijian Guo , Domagoj Ćevid , Peter Bühlmann