Related papers: Explaining Image Classification with Visual Debate…

Explaining decisions of black-box classifiers is paramount in sensitive domains such as medical imaging since clinicians confidence is necessary for adoption. Various explanation approaches have been proposed, among which perturbation based…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Martin Charachon , Céline Hudelot , Paul-Henry Cournède , Camille Ruppli , Roberto Ardon

Visualizing and Understanding Contrastive Learning

Contrastive learning has revolutionized the field of computer vision, learning rich representations from unlabeled data, which generalize well to diverse vision tasks. Consequently, it has become increasingly important to explain these…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Fawaz Sammani , Boris Joukovsky , Nikos Deligiannis

Contrastive Counterfactual Visual Explanations With Overdetermination

A novel explainable AI method called CLEAR Image is introduced in this paper. CLEAR Image is based on the view that a satisfactory explanation should be contrastive, counterfactual and measurable. CLEAR Image explains an image's…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Adam White , Kwun Ho Ngan , James Phelan , Saman Sadeghi Afgeh , Kevin Ryan , Constantino Carlos Reyes-Aldasoro , Artur d'Avila Garcez

Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning

We study how to generate captions that are not only accurate in describing an image but also discriminative across different images. The problem is both fundamental and interesting, as most machine-generated captions, despite phenomenal…

Computer Vision and Pattern Recognition · Computer Science 2019-03-12 Dianqi Li , Qiuyuan Huang , Xiaodong He , Lei Zhang , Ming-Ting Sun

Zero-Shot Textual Explanations via Translating Decision-Critical Features

Textual explanations make image classifier decisions transparent by describing the prediction rationale in natural language. Large vision-language models can generate captions but are designed for general visual understanding, not…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Toshinori Yamauchi , Hiroshi Kera , Kazuhiko Kawamoto

DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Maximilian Augustin , Yannic Neuhaus , Matthias Hein

From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Visual counterfactual explanations are ideal hypothetical images that change the decision-making of the classifier with high confidence toward the desired class while remaining visually plausible and close to the initial image. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Tung Luu , Nam Le , Duc Le , Bac Le

Towards Visual Explainable Active Learning for Zero-Shot Classification

Zero-shot classification is a promising paradigm to solve an applicable problem when the training classes and test classes are disjoint. Achieving this usually needs experts to externalize their domain knowledge by manually specifying a…

Human-Computer Interaction · Computer Science 2021-08-17 Shichao Jia , Zeyu Li , Nuo Chen , Jiawan Zhang

Explainable Metric Learning for Deflating Data Bias

Image classification is an essential part of computer vision which assigns a given input image to a specific category based on the similarity evaluation within given criteria. While promising classifiers can be obtained through deep…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Emma Andrews , Prabhat Mishra

Visualizing Deep Neural Network Decisions: Prediction Difference Analysis

This article presents the prediction difference analysis method for visualizing the response of a deep neural network to a specific input. When classifying images, the method highlights areas in a given input image that provide evidence for…

Computer Vision and Pattern Recognition · Computer Science 2017-02-16 Luisa M Zintgraf , Taco S Cohen , Tameem Adel , Max Welling

Free Argumentative Exchanges for Explaining Image Classifiers

Deep learning models are powerful image classifiers but their opacity hinders their trustworthiness. Explanation methods for capturing the reasoning process within these classifiers faithfully and in a clear manner are scarce, due to their…

Artificial Intelligence · Computer Science 2025-02-19 Avinash Kori , Antonio Rago , Francesca Toni

Generating Visual Explanations

Clearly explaining a rationale for a classification decision to an end-user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text;…

Computer Vision and Pattern Recognition · Computer Science 2016-03-29 Lisa Anne Hendricks , Zeynep Akata , Marcus Rohrbach , Jeff Donahue , Bernt Schiele , Trevor Darrell

Leveraging Conditional Generative Models in a General Explanation Framework of Classifier Decisions

Providing a human-understandable explanation of classifiers' decisions has become imperative to generate trust in their use for day-to-day tasks. Although many works have addressed this problem by generating visual explanation maps, they…

Machine Learning · Computer Science 2021-06-22 Martin Charachon , Paul-Henry Cournède , Céline Hudelot , Roberto Ardon

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

Visual arguments, often used in advertising or social causes, rely on images to persuade viewers to do or believe something. Understanding these arguments requires selective vision: only specific visual stimuli within an image are relevant…

Computation and Language · Computer Science 2024-10-24 Jiwan Chung , Sungjae Lee , Minseo Kim , Seungju Han , Ashkan Yousefpour , Jack Hessel , Youngjae Yu

Measuring and improving the quality of visual explanations

The ability of to explain neural network decisions goes hand in hand with their safe deployment. Several methods have been proposed to highlight features important for a given network decision. However, there is no consensus on how to…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Agnieszka Grabska-Barwińska

Reward Design for Justifiable Sequential Decision-Making

Equipping agents with the capacity to justify made decisions using supporting evidence represents a cornerstone of accountable decision-making. Furthermore, ensuring that justifications are in line with human expectations and societal norms…

Machine Learning · Computer Science 2024-02-27 Aleksa Sukovic , Goran Radanovic

Learning to Disambiguate by Asking Discriminative Questions

The ability to ask questions is a powerful tool to gather information in order to learn about the world and resolve ambiguities. In this paper, we explore a novel problem of generating discriminative questions to help disambiguate visual…

Computer Vision and Pattern Recognition · Computer Science 2017-08-10 Yining Li , Chen Huang , Xiaoou Tang , Chen-Change Loy

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Vision-Language Models have excelled at textual reasoning, but they often struggle with fine-grained spatial understanding and continuous action planning, failing to simulate the dynamics required for complex visual reasoning. In this work,…

Machine Learning · Computer Science 2026-01-30 Chengzu Li , Zanyi Wang , Jiaang Li , Yi Xu , Han Zhou , Huanyu Zhang , Ruichuan An , Dengyang Jiang , Zhaochong An , Ivan Vulić , Serge Belongie , Anna Korhonen

Pragmatic Inference with a CLIP Listener for Contrastive Captioning

We propose a simple yet effective and robust method for contrastive captioning: generating discriminative captions that distinguish target images from very similar alternative distractor images. Our approach is built on a pragmatic…

Computation and Language · Computer Science 2023-06-16 Jiefu Ou , Benno Krojer , Daniel Fried

Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering

Human conversation is a complex mechanism with subtle nuances. It is hence an ambitious goal to develop artificial intelligence agents that can participate fluently in a conversation. While we are still far from achieving this goal, recent…

Computer Vision and Pattern Recognition · Computer Science 2018-03-30 Unnat Jain , Svetlana Lazebnik , Alexander Schwing