English
Related papers

Related papers: Explaining Classifiers with Causal Concept Effect …

200 papers

Causal concept effect estimation is gaining increasing interest in the field of interpretable machine learning. This general approach explains the behaviors of machine learning models by estimating the causal effect of human-understandable…

Machine Learning · Computer Science 2024-11-15 Jifan Gao , Guanhua Chen

We aim to explain a black-box classifier with the form: `data X is classified as class Y because X \textit{has} A, B and \textit{does not have} C' in which A, B, and C are high-level concepts. The challenge is that we have to discover in an…

Machine Learning · Computer Science 2021-09-13 Thien Q. Tran , Kazuto Fukuchi , Youhei Akimoto , Jun Sakuma

As an important problem in causal inference, we discuss the estimation of treatment effects (TEs). Representing the confounder as a latent variable, we propose Intact-VAE, a new variant of variational autoencoder (VAE), motivated by the…

Machine Learning · Statistics 2022-04-22 Pengzhou Wu , Kenji Fukumizu

Using deep latent variable models in causal inference has attracted considerable interest recently, but an essential open question is their ability to yield consistent causal estimates. While they have demonstrated promising results and…

Machine Learning · Computer Science 2022-01-25 Severi Rissanen , Pekka Marttinen

Deep neural networks are complex and opaque. As they enter application in a variety of important and safety critical domains, users seek methods to explain their output predictions. We develop an approach to explaining deep neural networks…

Artificial Intelligence · Computer Science 2018-02-05 Michael Harradon , Jeff Druce , Brian Ruttenberg

The goal of a classification model is to assign the correct labels to data. In most cases, this data is not fully described by the given set of labels. Often a rich set of meaningful concepts exist in the domain that can much more precisely…

Machine Learning · Computer Science 2021-08-23 Yoeri Poels , Vlado Menkovski

Multimodal classifiers function as opaque black box models. While several techniques exist to interpret their predictions, very few of them are as intuitive and accessible as natural language explanations (NLEs). To build trust, such…

Computation and Language · Computer Science 2025-12-09 Dibyanayan Bandyopadhyay , Soham Bhattacharjee , Mohammed Hasanuzzaman , Asif Ekbal

Learning individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient, is a problem of growing importance for policy makers. The most important aspect of inferring causal…

Machine Learning · Statistics 2017-11-07 Christos Louizos , Uri Shalit , Joris Mooij , David Sontag , Richard Zemel , Max Welling

Standard Sparse Autoencoders (SAEs) excel at discovering a dictionary of a model's learned features, offering a powerful observational lens. However, the ambiguous and ungrounded nature of these features makes them unreliable instruments…

Machine Learning · Computer Science 2025-09-29 Jianrong Ding , Muxi Chen , Chenchen Zhao , Qiang Xu

Causal decision making (CDM) based on machine learning has become a routine part of business. Businesses algorithmically target offers, incentives, and recommendations to affect consumer behavior. Recently, we have seen an acceleration of…

Machine Learning · Statistics 2021-10-01 Carlos Fernández-Loría , Foster Provost

Causal opacity denotes the difficulty in understanding the "hidden" causal structure underlying the decisions of deep neural network (DNN) models. This leads to the inability to rely on and verify state-of-the-art DNN-based systems,…

Estimating average causal effect (ACE) is useful whenever we want to know the effect of an intervention on a given outcome. In the absence of a randomized experiment, many methods such as stratification and inverse propensity weighting have…

Machine Learning · Computer Science 2019-07-11 Rathin Desai , Amit Sharma

Estimating causal effects from observational data is inherently challenging due to the lack of observable counterfactual outcomes and even the presence of unmeasured confounding. Traditional methods often rely on restrictive, untestable…

Methodology · Statistics 2025-04-07 Li Chen , Xiaotong Shen , Wei Pan

Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that…

Computation and Language · Computer Science 2024-06-27 Adam Stein , Aaditya Naik , Yinjun Wu , Mayur Naik , Eric Wong

Deep neural networks have significantly improved the performance of low-level vision tasks but also increased the difficulty of interpretability. A deep understanding of deep models is beneficial for both network design and practical…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Jinfan Hu , Jinjin Gu , Shiyao Yu , Fanghua Yu , Zheyuan Li , Zhiyuan You , Chaochao Lu , Chao Dong

While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Maximilian Augustin , Yannic Neuhaus , Matthias Hein

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the…

Machine Learning · Computer Science 2022-02-28 Chih-Kuan Yeh , Been Kim , Pradeep Ravikumar

Discovering and exploiting the causality in deep neural networks (DNNs) are crucial challenges for understanding and reasoning causal effects (CE) on an explainable visual model. "Intervention" has been widely used for recognizing a causal…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Chao-Han Huck Yang , Yi-Chieh Liu , Pin-Yu Chen , Xiaoli Ma , Yi-Chang James Tsai

Image classification is a primary task in data analysis where explainable models are crucially demanded in various applications. Although amounts of methods have been proposed to obtain explainable knowledge from the black-box classifiers,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Ruitao Xie , Jingbang Chen , Limai Jiang , Rui Xiao , Yi Pan , Yunpeng Cai

Model explainability is essential for the creation of trustworthy Machine Learning models in healthcare. An ideal explanation resembles the decision-making process of a domain expert and is expressed using concepts or terminology that is…

Machine Learning · Computer Science 2021-07-14 Sumedha Singla , Stephen Wallace , Sofia Triantafillou , Kayhan Batmanghelich
‹ Prev 1 2 3 10 Next ›