Related papers: Concept Bottleneck Models

Concept Complement Bottleneck Model for Interpretable Medical Image Diagnosis

Models based on human-understandable concepts have received extensive attention to improve model interpretability for trustworthy artificial intelligence in the field of medical image analysis. These methods can provide convincing…

Computer Vision and Pattern Recognition · Computer Science 2024-12-25 Hongmei Wang , Junlin Hou , Hao Chen

Integrating Clinical Knowledge into Concept Bottleneck Models

Concept bottleneck models (CBMs), which predict human-interpretable concepts (e.g., nucleus shapes in cell images) before predicting the final output (e.g., cell type), provide insights into the decision-making processes of the model.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Winnie Pang , Xueyi Ke , Satoshi Tsutsui , Bihan Wen

Diverse Concept Proposals for Concept Bottleneck Models

Concept bottleneck models are interpretable predictive models that are often used in domains where model trust is a key priority, such as healthcare. They identify a small number of human-interpretable concepts in the data, which they then…

Machine Learning · Computer Science 2024-12-25 Katrina Brown , Marton Havasi , Finale Doshi-Velez

Do Concept Bottleneck Models Learn as Intended?

Concept bottleneck models map from raw inputs to concepts, and then from concepts to targets. Such models aim to incorporate pre-specified, high-level concepts into the learning procedure, and have been motivated to meet three desiderata:…

Machine Learning · Computer Science 2021-05-11 Andrei Margeloiu , Matthew Ashman , Umang Bhatt , Yanzhi Chen , Mateja Jamnik , Adrian Weller

A Closer Look at the Intervention Procedure of Concept Bottleneck Models

Concept bottleneck models (CBMs) are a class of interpretable neural network models that predict the target response of a given input based on its high-level concepts. Unlike the standard end-to-end models, CBMs enable domain experts to…

Machine Learning · Computer Science 2023-07-04 Sungbin Shin , Yohan Jo , Sungsoo Ahn , Namhoon Lee

Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off

Deploying AI-powered systems requires trustworthy models supporting effective human interactions, going beyond raw prediction accuracy. Concept bottleneck models promote trustworthiness by conditioning classification tasks on an…

Machine Learning · Computer Science 2023-04-27 Mateo Espinosa Zarlenga , Pietro Barbiero , Gabriele Ciravegna , Giuseppe Marra , Francesco Giannini , Michelangelo Diligenti , Zohreh Shams , Frederic Precioso , Stefano Melacci , Adrian Weller , Pietro Lio , Mateja Jamnik

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

Concept Bottleneck Models (CBMs) are regarded as inherently interpretable because they first predict a set of human-defined concepts which are used to predict a task label. For inherent interpretability to be fully realised, and ensure…

Machine Learning · Computer Science 2024-07-31 Jack Furby , Daniel Cunnington , Dave Braines , Alun Preece

Interactive Concept Bottleneck Models

Concept bottleneck models (CBMs) are interpretable neural networks that first predict labels for human-interpretable concepts relevant to the prediction task, and then predict the final label based on the concept label predictions. We…

Machine Learning · Computer Science 2023-04-28 Kushal Chauhan , Rishabh Tiwari , Jan Freyberg , Pradeep Shenoy , Krishnamurthy Dvijotham

Cross-Modal Conceptualization in Bottleneck Models

Concept Bottleneck Models (CBMs) assume that training examples (e.g., x-ray images) are annotated with high-level concepts (e.g., types of abnormalities), and perform classification by first predicting the concepts, followed by predicting…

Computation and Language · Computer Science 2023-12-19 Danis Alukaev , Semen Kiselev , Ilya Pershin , Bulat Ibragimov , Vladimir Ivanov , Alexey Kornaev , Ivan Titov

Towards Fine-Grained and Verifiable Concept Bottleneck Models

Concept Bottleneck Models (CBMs) offer interpretable alternatives to black-box predictors by introducing human-relatable concepts before the final output. However, existing CBMs struggle to verify whether predicted concepts correspond to…

Machine Learning · Computer Science 2026-05-15 Yingying Fang , Haijie Xu , Shuang Wu , Mariathasan Anish , Guang Yang

Graph Concept Bottleneck Models

Concept Bottleneck Models (CBMs) provide explicit interpretations for deep neural networks through concepts and allow intervention with concepts to adjust final predictions. Existing CBMs assume concepts are conditionally independent given…

Machine Learning · Computer Science 2026-05-04 Haotian Xu , Tsui-Wei Weng , Lam M. Nguyen , Tengfei Ma

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

Concept Bottleneck Models (CBMs) ground image classification on human-understandable concepts to allow for interpretable model decisions. Crucially, the CBM design inherently allows for human interventions, in which expert users are given…

Machine Learning · Computer Science 2024-08-07 Nishad Singhi , Jae Myung Kim , Karsten Roth , Zeynep Akata

Concept Bottleneck Models Without Predefined Concepts

There has been considerable recent interest in interpretable concept-based models such as Concept Bottleneck Models (CBMs), which first predict human-interpretable concepts and then map them to output classes. To reduce reliance on…

Machine Learning · Computer Science 2024-07-08 Simon Schrodi , Julian Schur , Max Argus , Thomas Brox

Rethinking Concept Bottleneck Models: From Pitfalls to Solutions

Concept Bottleneck Models (CBMs) ground predictions in human-understandable concepts but face fundamental limitations: the absence of a metric to pre-evaluate concept relevance, the "linearity problem" causing recent CBMs to bypass the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Merve Tapli , Quentin Bouniot , Wolfgang Stammer , Zeynep Akata , Emre Akbas

Post-hoc Concept Bottleneck Models

Concept Bottleneck Models (CBMs) map the inputs onto a set of interpretable concepts (``the bottleneck'') and use the concepts to make predictions. A concept bottleneck enhances interpretability since it can be investigated to understand…

Machine Learning · Computer Science 2023-02-03 Mert Yuksekgonul , Maggie Wang , James Zou

Towards a Deeper Understanding of Concept Bottleneck Models Through End-to-End Explanation

Concept Bottleneck Models (CBMs) first map raw input(s) to a vector of human-defined concepts, before using this vector to predict a final classification. We might therefore expect CBMs capable of predicting concepts based on distinct…

Artificial Intelligence · Computer Science 2023-02-08 Jack Furby , Daniel Cunnington , Dave Braines , Alun Preece

Stochastic Concept Bottleneck Models

Concept Bottleneck Models (CBMs) have emerged as a promising interpretable method whose final prediction is based on intermediate, human-understandable concepts rather than the raw input. Through time-consuming manual interventions, a user…

Machine Learning · Computer Science 2024-10-18 Moritz Vandenhirtz , Sonia Laguna , Ričards Marcinkevičs , Julia E. Vogt

There Was Never a Bottleneck in Concept Bottleneck Models

Deep learning representations are often difficult to interpret, which can hinder their deployment in sensitive applications. Concept Bottleneck Models (CBMs) have emerged as a promising approach to mitigate this issue by learning…

Machine Learning · Computer Science 2026-01-30 Antonio Almudévar , José Miguel Hernández-Lobato , Alfonso Ortega

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts

The concept bottleneck model (CBM) is an interpretable-by-design framework that makes decisions by first predicting a set of interpretable concepts, and then predicting the class label based on the given concepts. Existing CBMs are trained…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Andong Tan , Fengtao Zhou , Hao Chen

Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?

Recently, interpretable machine learning has re-explored concept bottleneck models (CBM). An advantage of this model class is the user's ability to intervene on predicted concept values, affecting the downstream output. In this work, we…

Machine Learning · Computer Science 2024-10-29 Sonia Laguna , Ričards Marcinkevičs , Moritz Vandenhirtz , Julia E. Vogt