Related papers: Towards Automatic Concept-based Explanations

Enhancing the Comprehensibility of Text Explanations via Unsupervised Concept Discovery

Concept-based explainable approaches have emerged as a promising method in explainable AI because they can interpret models in a way that aligns with human reasoning. However, their adaption in the text domain remains limited. Most existing…

Computation and Language · Computer Science 2025-05-27 Yifan Sun , Danding Wang , Qiang Sheng , Juan Cao , Jintao Li

Human-Centered Concept Explanations for Neural Networks

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the…

Machine Learning · Computer Science 2022-02-28 Chih-Kuan Yeh , Been Kim , Pradeep Ravikumar

EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence…

Machine Learning · Computer Science 2019-09-30 Diane Bouchacourt , Ludovic Denoyer

Towards Compositionality in Concept Learning

Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that…

Computation and Language · Computer Science 2024-06-27 Adam Stein , Aaditya Naik , Yinjun Wu , Mayur Naik , Eric Wong

A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

In recent years, concept-based approaches have emerged as some of the most promising explainability methods to help us interpret the decisions of Artificial Neural Networks (ANNs). These methods seek to discover intelligible visual…

Machine Learning · Computer Science 2023-10-31 Thomas Fel , Victor Boutin , Mazda Moayeri , Rémi Cadène , Louis Bethune , Léo andéol , Mathieu Chalvidal , Thomas Serre

Automatic Concept Embedding Model (ACEM): No train-time concepts, No issue!

Interpretability and explainability of neural networks is continuously increasing in importance, especially within safety-critical domains and to provide the social right to explanation. Concept based explanations align well with how humans…

Machine Learning · Computer Science 2023-09-11 Rishabh Jain

AcME -- Accelerated Model-agnostic Explanations: Fast Whitening of the Machine-Learning Black Box

In the context of human-in-the-loop Machine Learning applications, like Decision Support Systems, interpretability approaches should provide actionable insights without making the users wait. In this paper, we propose Accelerated…

Machine Learning · Computer Science 2021-12-24 David Dandolo , Chiara Masiero , Mattia Carletti , Davide Dalle Pezze , Gian Antonio Susto

Meaningful Models: Utilizing Conceptual Structure to Improve Machine Learning Interpretability

The last decade has seen huge progress in the development of advanced machine learning models; however, those models are powerless unless human users can interpret them. Here we show how the mind's construction of concepts and meaning can…

Machine Learning · Statistics 2016-07-04 Nick Condry

Concept Bottleneck Models Without Predefined Concepts

There has been considerable recent interest in interpretable concept-based models such as Concept Bottleneck Models (CBMs), which first predict human-interpretable concepts and then map them to output classes. To reduce reliance on…

Machine Learning · Computer Science 2024-07-08 Simon Schrodi , Julian Schur , Max Argus , Thomas Brox

Altruist: Argumentative Explanations through Local Interpretations of Predictive Models

Explainable AI is an emerging field providing solutions for acquiring insights into automated systems' rationale. It has been put on the AI map by suggesting ways to tackle key ethical and societal issues. Existing explanation techniques…

Machine Learning · Computer Science 2022-05-02 Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Overlooked factors in concept-based explanations: Dataset choice, concept learnability, and human capability

Concept-based interpretability methods aim to explain deep neural network model predictions using a predefined set of semantic concepts. These methods evaluate a trained model on a new, "probe" dataset and correlate model predictions with…

Computer Vision and Pattern Recognition · Computer Science 2023-05-15 Vikram V. Ramaswamy , Sunnie S. Y. Kim , Ruth Fong , Olga Russakovsky

On Completeness-aware Concept-Based Explanations in Deep Neural Networks

Human explanations of high-level decisions are often expressed in terms of key concepts the decisions are based on. In this paper, we study such concept-based explainability for Deep Neural Networks (DNNs). First, we define the notion of…

Machine Learning · Computer Science 2022-02-09 Chih-Kuan Yeh , Been Kim , Sercan O. Arik , Chun-Liang Li , Tomas Pfister , Pradeep Ravikumar

A Concept and Argumentation based Interpretable Model in High Risk Domains

Interpretability has become an essential topic for artificial intelligence in some high-risk domains such as healthcare, bank and security. For commonly-used tabular data, traditional methods trained end-to-end machine learning models with…

Artificial Intelligence · Computer Science 2022-08-18 Haixiao Chi , Dawei Wang , Gaojie Cui , Feng Mao , Beishui Liao

Learning Interpretable Concept-Based Models with Human Feedback

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

Transparent Anomaly Detection via Concept-based Explanations

Advancements in deep learning techniques have given a boost to the performance of anomaly detection. However, real-world and safety-critical applications demand a level of transparency and reasoning beyond accuracy. The task of anomaly…

Machine Learning · Computer Science 2023-11-03 Laya Rafiee Sevyeri , Ivaxi Sheth , Farhood Farahnak , Samira Ebrahimi Kahou , Shirin Abbasinejad Enger

Comparing Feature Importance and Rule Extraction for Interpretability on Text Data

Complex machine learning algorithms are used more and more often in critical tasks involving text data, leading to the development of interpretability methods. Among local methods, two families have emerged: those computing importance…

Machine Learning · Computer Science 2025-10-22 Gianluigi Lopardo , Damien Garreau

Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models

The interpretation of deep neural networks (DNNs) has become a key topic as more and more people apply them to solve various problems and making critical decisions. Concept-based explanations have recently become a popular approach for…

Human-Computer Interaction · Computer Science 2021-08-10 Zhenge Zhao , Panpan Xu , Carlos Scheidegger , Liu Ren

Interpreting Language Models Through Concept Descriptions: A Survey

Understanding the decision-making processes of neural networks is a central goal of mechanistic interpretability. In the context of Large Language Models (LLMs), this involves uncovering the underlying mechanisms and identifying the roles…

Computation and Language · Computer Science 2026-04-21 Nils Feldhus , Laura Kopf

Estimating Color-Concept Associations from Image Statistics

To interpret the meanings of colors in visualizations of categorical information, people must determine how distinct colors correspond to different concepts. This process is easier when assignments between colors and concepts in…

Human-Computer Interaction · Computer Science 2019-10-08 Ragini Rathore , Zachary Leggon , Laurent Lessard , Karen B. Schloss

Improving Explanations: Applying the Feature Understandability Scale for Cost-Sensitive Feature Selection

With the growing pervasiveness of artificial intelligence, the ability to explain the inferences made by machine learning models has become increasingly important. Numerous techniques for model explainability have been proposed, with…

Human-Computer Interaction · Computer Science 2026-04-08 Nicola Rossberg , Bennett Kleinberg , Barry O'Sullivan , Luca Longo , Andrea Visentin