Related papers: Generating Visual Explanations

Explainable Deep Classification Models for Domain Generalization

Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Andrea Zunino , Sarah Adel Bargal , Riccardo Volpi , Mehrnoosh Sameki , Jianming Zhang , Stan Sclaroff , Vittorio Murino , Kate Saenko

TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models

Interpreting the learned features of vision models has posed a longstanding challenge in the field of machine learning. To address this issue, we propose a novel method that leverages the capabilities of language models to interpret the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 Saeid Asgari Taghanaki , Aliasghar Khani , Ali Saheb Pasand , Amir Khasahmadi , Aditya Sanghi , Karl D. D. Willis , Ali Mahdavi-Amiri

Seeing in Words: Learning to Classify through Language Bottlenecks

Neural networks for computer vision extract uninterpretable features despite achieving high accuracy on benchmarks. In contrast, humans can explain their predictions using succinct and intuitive descriptions. To incorporate explainability…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Khalid Saifullah , Yuxin Wen , Jonas Geiping , Micah Goldblum , Tom Goldstein

Leveraging Conditional Generative Models in a General Explanation Framework of Classifier Decisions

Providing a human-understandable explanation of classifiers' decisions has become imperative to generate trust in their use for day-to-day tasks. Although many works have addressed this problem by generating visual explanation maps, they…

Machine Learning · Computer Science 2021-06-22 Martin Charachon , Paul-Henry Cournède , Céline Hudelot , Roberto Ardon

Representing visual classification as a linear combination of words

Explainability is a longstanding challenge in deep learning, especially in high-stakes domains like healthcare. Common explainability methods highlight image regions that drive an AI model's decision. Humans, however, heavily rely on…

Artificial Intelligence · Computer Science 2023-11-21 Shobhit Agarwal , Yevgeniy R. Semenov , William Lotter

Grounding Visual Explanations (Extended Abstract)

Existing models which generate textual explanations enforce task relevance through a discriminative term loss function, but such mechanisms only weakly constrain mentioned object parts to actually be present in the image. In this paper, a…

Computer Vision and Pattern Recognition · Computer Science 2017-11-20 Lisa Anne Hendricks , Ronghang Hu , Trevor Darrell , Zeynep Akata

Counterfactual Visual Explanations

In this work, we develop a technique to produce counterfactual visual explanations. Given a 'query' image $I$ for which a vision system predicts class $c$, a counterfactual visual explanation identifies how $I$ could change such that the…

Machine Learning · Computer Science 2019-06-12 Yash Goyal , Ziyan Wu , Jan Ernst , Dhruv Batra , Devi Parikh , Stefan Lee

Evolving Interpretable Visual Classifiers with Large Language Models

Multimodal pre-trained models, such as CLIP, are popular for zero-shot classification due to their open-vocabulary flexibility and high performance. However, vision-language models, which compute similarity scores between images and class…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Mia Chiquier , Utkarsh Mall , Carl Vondrick

Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks

Interpretation and explanation of deep models is critical towards wide adoption of systems that rely on them. In this paper, we propose a novel scheme for both interpretation as well as explanation in which, given a pretrained model, we…

Computer Vision and Pattern Recognition · Computer Science 2019-03-11 Jose Oramas , Kaili Wang , Tinne Tuytelaars

Generation of Multimodal Justification Using Visual Word Constraint Model for Explainable Computer-Aided Diagnosis

The ambiguity of the decision-making process has been pointed out as the main obstacle to applying the deep learning-based method in a practical way in spite of its outstanding performance. Interpretability could guarantee the confidence of…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Hyebin Lee , Seong Tae Kim , Yong Man Ro

DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Maximilian Augustin , Yannic Neuhaus , Matthias Hein

Concept Based Explanations and Class Contrasting

Explaining deep neural networks is challenging, due to their large size and non-linearity. In this paper, we introduce a concept-based explanation method, in order to explain the prediction for an individual class, as well as contrasting…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Rudolf Herdt , Daniel Otero Baguer

Descriminative-Generative Custom Tokens for Vision-Language Models

This paper explores the possibility of learning custom tokens for representing new concepts in Vision-Language Models (VLMs). Our aim is to learn tokens that can be effective for both discriminative and generative tasks while composing well…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Pramuditha Perera , Matthew Trager , Luca Zancato , Alessandro Achille , Stefano Soatto

Generating Part-Based Global Explanations Via Correspondence

Deep learning models are notoriously opaque. Existing explanation methods often focus on localized visual explanations for individual images. Concept-based explanations, while offering global insights, require extensive annotations,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Kunal Rathore , Prasad Tadepalli

Grounding Visual Explanations

Existing visual explanation generating agents learn to fluently justify a class prediction. However, they may mention visual attributes which reflect a strong class prior, although the evidence may not actually be in the image. This is…

Computer Vision and Pattern Recognition · Computer Science 2018-08-03 Lisa Anne Hendricks , Ronghang Hu , Trevor Darrell , Zeynep Akata

Generation of Consistent Sets of Multi-Label Classification Rules with a Multi-Objective Evolutionary Algorithm

Multi-label classification consists in classifying an instance into two or more classes simultaneously. It is a very challenging task present in many real-world applications, such as classification of biology, image, video, audio, and text.…

Machine Learning · Computer Science 2020-04-03 Thiago Zafalon Miranda , Diorge Brognara Sardinha , Márcio Porto Basgalupp , Yaochu Jin , Ricardo Cerri

Write a Classifier: Predicting Visual Classifiers from Unstructured Text

People typically learn through exposure to visual concepts associated with linguistic descriptions. For instance, teaching visual object categories to children is often accompanied by descriptions in text or speech. In a machine learning…

Computer Vision and Pattern Recognition · Computer Science 2016-12-30 Mohamed Elhoseiny , Ahmed Elgammal , Babak Saleh

Generative Interventions for Causal Learning

We introduce a framework for learning robust visual representations that generalize to new viewpoints, backgrounds, and scene contexts. Discriminative models often learn naturally occurring spurious correlations, which cause them to fail on…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Chengzhi Mao , Augustine Cha , Amogh Gupta , Hao Wang , Junfeng Yang , Carl Vondrick

Weakly Supervised Recovery of Semantic Attributes

We consider the problem of the extraction of semantic attributes, supervised only with classification labels. For example, when learning to classify images of birds into species, we would like to observe the emergence of features that…

Machine Learning · Computer Science 2021-06-14 Ameen Ali , Tomer Galanti , Evgeniy Zheltonozhskiy , Chaim Baskin , Lior Wolf

Visualizing Deep Neural Network Decisions: Prediction Difference Analysis

This article presents the prediction difference analysis method for visualizing the response of a deep neural network to a specific input. When classifying images, the method highlights areas in a given input image that provide evidence for…

Computer Vision and Pattern Recognition · Computer Science 2017-02-16 Luisa M Zintgraf , Taco S Cohen , Tameem Adel , Max Welling