Related papers: Interpretable Concept-Based Memory Reasoning

Interpretable Hierarchical Concept Reasoning through Attention-Guided Graph Learning

Concept-Based Models (CBMs) are a class of deep learning models that provide interpretability by explaining predictions through high-level concepts. These models first predict concepts and then use them to perform a downstream task.…

Machine Learning · Computer Science 2025-06-27 David Debot , Pietro Barbiero , Gabriele Dominici , Giuseppe Marra

Interpretable Neural-Symbolic Concept Reasoning

Deep learning methods are highly accurate, yet their opaque decision process prevents them from earning full human trust. Concept-based models aim to address this issue by learning tasks based on a set of human-understandable concepts.…

Artificial Intelligence · Computer Science 2024-05-02 Pietro Barbiero , Gabriele Ciravegna , Francesco Giannini , Mateo Espinosa Zarlenga , Lucie Charlotte Magister , Alberto Tonda , Pietro Lio' , Frederic Precioso , Mateja Jamnik , Giuseppe Marra

Language Guided Concept Bottleneck Models for Interpretable Continual Learning

Continual learning (CL) aims to enable learning systems to acquire new knowledge constantly without forgetting previously learned information. CL faces the challenge of mitigating catastrophic forgetting while maintaining interpretability…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Lu Yu , Haoyu Han , Zhe Tao , Hantao Yao , Changsheng Xu

The Impact of Concept Explanations and Interventions on Human-Machine Collaboration

Deep Neural Networks (DNNs) are often considered black boxes due to their opaque decision-making processes. To reduce their opacity Concept Models (CMs), such as Concept Bottleneck Models (CBMs), were introduced to predict human-defined…

Human-Computer Interaction · Computer Science 2025-12-02 Jack Furby , Dan Cunnington , Dave Braines , Alun Preece

Towards Fine-Grained and Verifiable Concept Bottleneck Models

Concept Bottleneck Models (CBMs) offer interpretable alternatives to black-box predictors by introducing human-relatable concepts before the final output. However, existing CBMs struggle to verify whether predicted concepts correspond to…

Machine Learning · Computer Science 2026-05-15 Yingying Fang , Haijie Xu , Shuang Wu , Mariathasan Anish , Guang Yang

Relational Concept Bottleneck Models

The design of interpretable deep learning models working in relational domains poses an open challenge: interpretable deep learning methods, such as Concept Bottleneck Models (CBMs), are not designed to solve relational problems, while…

Machine Learning · Computer Science 2024-10-28 Pietro Barbiero , Francesco Giannini , Gabriele Ciravegna , Michelangelo Diligenti , Giuseppe Marra

Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability

In the context of image classification, Concept Bottleneck Models (CBMs) first embed images into a set of human-understandable concepts, followed by an intrinsically interpretable classifier that predicts labels based on these intermediate…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Haifei Zhang , Patrick Barry , Eduardo Brandao

Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning

Concept Bottleneck Models (CBMs) are a prominent framework for interpretable AI that map learned visual features to a set of meaningful concepts for task-specific downstream predictions. Their sequential structure enhances transparency by…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Mohamed Harmanani , Bining Long , Zhuoxin Guo , Paul F. R. Wilson , Amirhossein Sabour , Minh Nguyen Nhat To , Gabor Fichtinger , Purang Abolmaesumi , Parvin Mousavi

Concept Bottleneck Models Without Predefined Concepts

There has been considerable recent interest in interpretable concept-based models such as Concept Bottleneck Models (CBMs), which first predict human-interpretable concepts and then map them to output classes. To reduce reliance on…

Machine Learning · Computer Science 2024-07-08 Simon Schrodi , Julian Schur , Max Argus , Thomas Brox

Probabilistic Concept Bottleneck Models

Interpretable models are designed to make decisions in a human-interpretable manner. Representatively, Concept Bottleneck Models (CBM) follow a two-step process of concept prediction and class prediction based on the predicted concepts. CBM…

Machine Learning · Computer Science 2023-06-05 Eunji Kim , Dahuin Jung , Sangha Park , Siwon Kim , Sungroh Yoon

Causally Reliable Concept Bottleneck Models

Concept-based models are an emerging paradigm in deep learning that constrains the inference process to operate through human-interpretable variables, facilitating explainability and human interaction. However, these architectures, on par…

Machine Learning · Computer Science 2026-02-03 Giovanni De Felice , Arianna Casanova Flores , Francesco De Santis , Silvia Santini , Johannes Schneider , Pietro Barbiero , Alberto Termine

Interpretable Prognostics with Concept Bottleneck Models

Deep learning approaches have recently been extensively explored for the prognostics of industrial assets. However, they still suffer from a lack of interpretability, which hinders their adoption in safety-critical applications. To improve…

Machine Learning · Computer Science 2024-05-29 Florent Forest , Katharina Rombach , Olga Fink

AnyCBMs: How to Turn Any Black Box into a Concept Bottleneck Model

Interpretable deep learning aims at developing neural architectures whose decision-making processes could be understood by their users. Among these techniqes, Concept Bottleneck Models enhance the interpretability of neural networks by…

Machine Learning · Computer Science 2024-05-28 Gabriele Dominici , Pietro Barbiero , Francesco Giannini , Martin Gjoreski , Marc Langhenirich

LogicCBMs: Logic-Enhanced Concept-Based Learning

Concept Bottleneck Models (CBMs) provide a basis for semantic abstractions within a neural network architecture. Such models have primarily been seen through the lens of interpretability so far, wherein they offer transparency by inferring…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Deepika SN Vemuri , Gautham Bellamkonda , Aditya Pola , Vineeth N Balasubramanian

Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions

Machine learning is a vital part of many real-world systems, but several concerns remain about the lack of interpretability, explainability and robustness of black-box AI systems. Concept Bottleneck Models (CBM) address some of these…

Machine Learning · Statistics 2025-10-24 Hidde Fokkema , Tim van Erven , Sara Magliacane

Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency

Concept bottleneck models (CBMs) have emerged as critical tools in domains where interpretability is paramount. These models rely on predefined textual descriptions, referred to as concepts, to inform their decision-making process and offer…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Maor Dikter , Tsachi Blau , Chaim Baskin

Concept-Based Explainable Artificial Intelligence: Metrics and Benchmarks

Concept-based explanation methods, such as concept bottleneck models (CBMs), aim to improve the interpretability of machine learning models by linking their decisions to human-understandable concepts, under the critical assumption that such…

Artificial Intelligence · Computer Science 2025-02-03 Halil Ibrahim Aysel , Xiaohao Cai , Adam Prugel-Bennett

Chat-CBM: Towards Interactive Concept Bottleneck Models with Frozen Large Language Models

Concept Bottleneck Models (CBMs) provide inherent interpretability by first predicting a set of human-understandable concepts and then mapping them to labels through a simple classifier. While users can intervene in the concept space to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Hangzhou He , Lei Zhu , Kaiwen Li , Xinliang Zhang , Jiakui Hu , Ourui Fu , Zhengjian Yao , Yanye Lu

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

Concept Bottleneck Models (CBMs) are regarded as inherently interpretable because they first predict a set of human-defined concepts which are used to predict a task label. For inherent interpretability to be fully realised, and ensure…

Machine Learning · Computer Science 2024-07-31 Jack Furby , Daniel Cunnington , Dave Braines , Alun Preece

Concept Bottleneck Large Language Models

We introduce Concept Bottleneck Large Language Models (CB-LLMs), a novel framework for building inherently interpretable Large Language Models (LLMs). In contrast to traditional black-box LLMs that rely on limited post-hoc interpretations,…

Computation and Language · Computer Science 2025-09-09 Chung-En Sun , Tuomas Oikarinen , Berk Ustun , Tsui-Wei Weng