Related papers: TextCAVs: Debugging vision models using text

Exploiting Text-Image Latent Spaces for the Description of Visual Concepts

Concept Activation Vectors (CAVs) offer insights into neural network decision-making by linking human friendly concepts to the model's internal feature extraction process. However, when a new set of CAVs is discovered, they must still be…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Laines Schmalwasser , Jakob Gawlikowski , Joachim Denzler , Julia Niebling

Explaining Explainability: Recommendations for Effective Use of Concept Activation Vectors

Concept-based explanations translate the internal representations of deep learning models into a language that humans are familiar with: concepts. One popular method for finding concepts is Concept Activation Vectors (CAVs), which are…

Machine Learning · Computer Science 2025-02-14 Angus Nicolson , Lisa Schut , J. Alison Noble , Yarin Gal

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

The interpretation of deep learning models is a challenge due to their size, complexity, and often opaque internal state. In addition, many systems, such as image classifiers, operate on low-level features rather than high-level concepts.…

Machine Learning · Statistics 2019-04-05 Been Kim , Martin Wattenberg , Justin Gilmer , Carrie Cai , James Wexler , Fernanda Viegas , Rory Sayres

FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks

Concepts such as objects, patterns, and shapes are how humans understand the world. Building on this intuition, concept-based explainability methods aim to study representations learned by deep neural networks in relation to…

Machine Learning · Computer Science 2025-05-26 Laines Schmalwasser , Niklas Penzel , Joachim Denzler , Julia Niebling

Human-Centered Concept Explanations for Neural Networks

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the…

Machine Learning · Computer Science 2022-02-28 Chih-Kuan Yeh , Been Kim , Pradeep Ravikumar

Towards Concept-based Interpretability of Skin Lesion Diagnosis using Vision-Language Models

Concept-based models naturally lend themselves to the development of inherently interpretable skin lesion diagnosis, as medical experts make decisions based on a set of visual patterns of the lesion. Nevertheless, the development of these…

Computer Vision and Pattern Recognition · Computer Science 2024-03-07 Cristiano Patrício , Luís F. Teixeira , João C. Neves

TextCAM: Explaining Class Activation Map with Text

Deep neural networks (DNNs) have achieved remarkable success across domains but remain difficult to interpret, limiting their trustworthiness in high-stakes applications. This paper focuses on deep vision models, for which a dominant line…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Qiming Zhao , Xingjian Li , Xiaoyu Cao , Xiaolong Wu , Min Xu

Knowledge graphs for empirical concept retrieval

Concept-based explainable AI is promising as a tool to improve the understanding of complex models at the premises of a given user, viz.\ as a tool for personalized explainability. An important class of concept-based explainability methods…

Machine Learning · Computer Science 2024-07-31 Lenka Tětková , Teresa Karen Scheidt , Maria Mandrup Fogh , Ellen Marie Gaunby Jørgensen , Finn Årup Nielsen , Lars Kai Hansen

ConceptExplainer: Interactive Explanation for Deep Neural Networks from a Concept Perspective

Traditional deep learning interpretability methods which are suitable for model users cannot explain network behaviors at the global level and are inflexible at providing fine-grained explanations. As a solution, concept-based explanations…

Human-Computer Interaction · Computer Science 2022-10-26 Jinbin Huang , Aditi Mishra , Bum Chul Kwon , Chris Bryan

Robust Semantic Interpretability: Revisiting Concept Activation Vectors

Interpretability methods for image classification assess model trustworthiness by attempting to expose whether the model is systematically biased or attending to the same cues as a human would. Saliency methods for feature attribution…

Machine Learning · Statistics 2021-04-08 Jacob Pfau , Albert T. Young , Jerome Wei , Maria L. Wei , Michael J. Keiser

iCapsNets: Towards Interpretable Capsule Networks for Text Classification

Many text classification applications require models with satisfying performance as well as good interpretability. Traditional machine learning methods are easy to interpret but have low accuracies. The development of deep learning models…

Computation and Language · Computer Science 2020-06-02 Zhengyang Wang , Xia Hu , Shuiwang Ji

GCAV: A Global Concept Activation Vector Framework for Cross-Layer Consistency in Interpretability

Concept Activation Vectors (CAVs) provide a powerful approach for interpreting deep neural networks by quantifying their sensitivity to human-defined concepts. However, when computed independently at different layers, CAVs often exhibit…

Computer Vision and Pattern Recognition · Computer Science 2025-09-11 Zhenghao He , Sanchit Sinha , Guangzhi Xiong , Aidong Zhang

Interpreting Vision and Language Generative Models with Semantic Visual Priors

When applied to Image-to-text models, interpretability methods often provide token-by-token explanations namely, they compute a visual explanation for each token of the generated sequence. Those explanations are expensive to compute and…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Michele Cafagna , Lina M. Rojas-Barahona , Kees van Deemter , Albert Gatt

Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models

Explaining Deep Learning models is becoming increasingly important in the face of daily emerging multimodal models, particularly in safety-critical domains like medical imaging. However, the lack of detailed investigations into the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Anees Ur Rehman Hashmi , Dwarikanath Mahapatra , Mohammad Yaqub

ViConEx-Med: Visual Concept Explainability via Multi-Concept Token Transformer for Medical Image Analysis

Concept-based models aim to explain model decisions with human-understandable concepts. However, most existing approaches treat concepts as numerical attributes, without providing complementary visual explanations that could localize the…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Cristiano Patrício , Luís F. Teixeira , João C. Neves

Interpretability for Multimodal Emotion Recognition using Concept Activation Vectors

Multimodal Emotion Recognition refers to the classification of input video sequences into emotion labels based on multiple input modalities (usually video, audio and text). In recent years, Deep Neural networks have shown remarkable…

Machine Learning · Computer Science 2024-10-28 Ashish Ramayee Asokan , Nidarshan Kumar , Anirudh Venkata Ragam , Shylaja S Sharath

Enhancing Vision Models for Text-Heavy Content Understanding and Interaction

Interacting and understanding with text heavy visual content with multiple images is a major challenge for traditional vision models. This paper is on enhancing vision models' capability to comprehend or understand and learn from images…

Computer Vision and Pattern Recognition · Computer Science 2024-08-31 Adithya TG , Adithya SK , Abhinav R Bharadwaj , Abhiram HA , Surabhi Narayan

Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement

Humans use abstract concepts for understanding instead of hard features. Recent interpretability research has focused on human-centered concept explanations of neural networks. Concept Activation Vectors (CAVs) estimate a model's…

Machine Learning · Computer Science 2023-11-28 Avani Gupta , Saurabh Saini , P J Narayanan

Learnable Visual Words for Interpretable Image Recognition

To interpret deep models' predictions, attention-based visual cues are widely used in addressing \textit{why} deep models make such predictions. Beyond that, the current research community becomes more interested in reasoning \textit{how}…

Computer Vision and Pattern Recognition · Computer Science 2022-05-27 Wenxiao Xiao , Zhengming Ding , Hongfu Liu

Corpus-level and Concept-based Explanations for Interpretable Document Classification

Using attention weights to identify information that is important for models' decision-making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for…

Information Retrieval · Computer Science 2021-06-01 Tian Shi , Xuchao Zhang , Ping Wang , Chandan K. Reddy