Related papers: Learning Interpretable Concept-Based Models with H…

A Concept and Argumentation based Interpretable Model in High Risk Domains

Interpretability has become an essential topic for artificial intelligence in some high-risk domains such as healthcare, bank and security. For commonly-used tabular data, traditional methods trained end-to-end machine learning models with…

Artificial Intelligence · Computer Science 2022-08-18 Haixiao Chi , Dawei Wang , Gaojie Cui , Feng Mao , Beishui Liao

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

To build intelligent machine learning systems, there are two broad approaches. One approach is to build inherently interpretable models, as endeavored by the growing field of causal representation learning. The other approach is to build…

Machine Learning · Computer Science 2024-12-10 Goutham Rajendran , Simon Buchholz , Bryon Aragam , Bernhard Schölkopf , Pradeep Ravikumar

Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization

Concept discovery is one of the open problems in the interpretability literature that is important for bridging the gap between non-deep learning experts and model end-users. Among current formulations, concepts defines them by as a…

Machine Learning · Computer Science 2022-02-11 Adrianna Janik , Kris Sankaran

The Definitions of Interpretability and Learning of Interpretable Models

As machine learning algorithms getting adopted in an ever-increasing number of applications, interpretation has emerged as a crucial desideratum. In this paper, we propose a mathematical definition for the human-interpretable model. In…

Machine Learning · Computer Science 2021-06-01 Weishen Pan , Changshui Zhang

Meaningful Models: Utilizing Conceptual Structure to Improve Machine Learning Interpretability

The last decade has seen huge progress in the development of advanced machine learning models; however, those models are powerless unless human users can interpret them. Here we show how the mind's construction of concepts and meaning can…

Machine Learning · Statistics 2016-07-04 Nick Condry

Concept Learning for Interpretable Multi-Agent Reinforcement Learning

Multi-agent robotic systems are increasingly operating in real-world environments in close proximity to humans, yet are largely controlled by policy models with inscrutable deep neural network representations. We introduce a method for…

Machine Learning · Computer Science 2023-02-24 Renos Zabounidis , Joseph Campbell , Simon Stepputtis , Dana Hughes , Katia Sycara

Interpretable Representations in Explainable AI: From Theory to Practice

Interpretable representations are the backbone of many explainers that target black-box predictive systems based on artificial intelligence and machine learning algorithms. They translate the low-level data representation necessary for good…

Machine Learning · Computer Science 2024-04-29 Kacper Sokol , Peter Flach

Human-interpretable model explainability on high-dimensional data

The importance of explainability in machine learning continues to grow, as both neural-network architectures and the data they model become increasingly complex. Unique challenges arise when a model's input features become high dimensional:…

Machine Learning · Computer Science 2021-12-21 Damien de Mijolla , Christopher Frye , Markus Kunesch , John Mansir , Ilya Feige

Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks

Concept-based machine learning methods have increasingly gained importance due to the growing interest in making neural networks interpretable. However, concept annotations are generally challenging to obtain, making it crucial to leverage…

Machine Learning · Computer Science 2024-11-06 Alba Carballo-Castro , Sonia Laguna , Moritz Vandenhirtz , Julia E. Vogt

Tree of Concepts: Interpretable Continual Learners in Non-Stationary Clinical Domains

Continual learning aims to update models under distribution shift without forgetting, yet many high-stakes deployments, such as healthcare, also require interpretability. In practice, models that adapt well (e.g., deep networks) are often…

Machine Learning · Computer Science 2026-04-21 Dongkyu Cho , Xiyue Li , Samrachana Adhikari , Rumi Chunara

Comprehensible Convolutional Neural Networks via Guided Concept Learning

Learning concepts that are consistent with human perception is important for Deep Neural Networks to win end-user trust. Post-hoc interpretation methods lack transparency in the feature representations learned by the models. This work…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Sandareka Wickramanayake , Wynne Hsu , Mong Li Lee

Uncovering Unique Concept Vectors through Latent Space Decomposition

Interpreting the inner workings of deep learning models is crucial for establishing trust and ensuring model safety. Concept-based explanations have emerged as a superior approach that is more interpretable than feature attribution…

Machine Learning · Computer Science 2023-07-17 Mara Graziani , Laura O' Mahony , An-Phi Nguyen , Henning Müller , Vincent Andrearczyk

Understanding Inter-Concept Relationships in Concept-Based Models

Concept-based explainability methods provide insight into deep learning systems by constructing explanations using human-understandable concepts. While the literature on human reasoning demonstrates that we exploit relationships between…

Machine Learning · Computer Science 2024-05-29 Naveen Raman , Mateo Espinosa Zarlenga , Mateja Jamnik

Interpretable Latent Spaces for Learning from Demonstration

Effective human-robot interaction, such as in robot learning from human demonstration, requires the learning agent to be able to ground abstract concepts (such as those contained within instructions) in a corresponding high-dimensional…

Computer Vision and Pattern Recognition · Computer Science 2018-10-03 Yordan Hristov , Alex Lascarides , Subramanian Ramamoorthy

Interpretable Companions for Black-Box Models

We present an interpretable companion model for any pre-trained black-box classifiers. The idea is that for any input, a user can decide to either receive a prediction from the black-box model, with high accuracy but no explanations, or…

Machine Learning · Statistics 2020-02-12 Danqing Pan , Tong Wang , Satoshi Hara

Learning an Interpretable Model for Driver Behavior Prediction with Inductive Biases

To plan safe maneuvers and act with foresight, autonomous vehicles must be capable of accurately predicting the uncertain future. In the context of autonomous driving, deep neural networks have been successfully applied to learning…

Robotics · Computer Science 2022-08-02 Salar Arbabi , Davide Tavernini , Saber Fallah , Richard Bowden

Interpretable Neural-Symbolic Concept Reasoning

Deep learning methods are highly accurate, yet their opaque decision process prevents them from earning full human trust. Concept-based models aim to address this issue by learning tasks based on a set of human-understandable concepts.…

Artificial Intelligence · Computer Science 2024-05-02 Pietro Barbiero , Gabriele Ciravegna , Francesco Giannini , Mateo Espinosa Zarlenga , Lucie Charlotte Magister , Alberto Tonda , Pietro Lio' , Frederic Precioso , Mateja Jamnik , Giuseppe Marra

Diverse Concept Proposals for Concept Bottleneck Models

Concept bottleneck models are interpretable predictive models that are often used in domains where model trust is a key priority, such as healthcare. They identify a small number of human-interpretable concepts in the data, which they then…

Machine Learning · Computer Science 2024-12-25 Katrina Brown , Marton Havasi , Finale Doshi-Velez

Techniques for Interpretable Machine Learning

Interpretable machine learning tackles the important problem that humans cannot understand the behaviors of complex machine learning models and how these models arrive at a particular decision. Although many approaches have been proposed, a…

Machine Learning · Computer Science 2019-05-21 Mengnan Du , Ninghao Liu , Xia Hu

Interpretable Concept-Based Memory Reasoning

The lack of transparency in the decision-making processes of deep learning systems presents a significant challenge in modern artificial intelligence (AI), as it impairs users' ability to rely on and verify these systems. To address this…

Artificial Intelligence · Computer Science 2024-11-18 David Debot , Pietro Barbiero , Francesco Giannini , Gabriele Ciravegna , Michelangelo Diligenti , Giuseppe Marra