Related papers: Inter-model Interpretability: Self-supervised Mode…

Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning. In legal doctrine however, determining the degree of similarity between works requires subjective analysis, and fact-finders…

Computer Vision and Pattern Recognition · Computer Science 2024-02-15 Alessandro Achille , Greg Ver Steeg , Tian Yu Liu , Matthew Trager , Carson Klingenberg , Stefano Soatto

Manipulating and Measuring Model Interpretability

With machine learning models being increasingly used to aid decision making even in high-stakes domains, there has been a growing interest in developing interpretable models. Although many supposedly interpretable models have been proposed,…

Artificial Intelligence · Computer Science 2021-08-17 Forough Poursabzi-Sangdeh , Daniel G. Goldstein , Jake M. Hofman , Jennifer Wortman Vaughan , Hanna Wallach

Concept-Centric Transformers: Enhancing Model Interpretability through Object-Centric Concept Learning within a Shared Global Workspace

Many interpretable AI approaches have been proposed to provide plausible explanations for a model's decision-making. However, configuring an explainable model that effectively communicates among computational modules has received less…

Machine Learning · Computer Science 2023-11-09 Jinyung Hong , Keun Hee Park , Theodore P. Pavlic

Learning Interpretable Concept-Based Models with Human Feedback

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings

Inner interpretability is a promising field aiming to uncover the internal mechanisms of AI systems through scalable, automated methods. While significant research has been conducted on large language models, limited attention has been paid…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Jinyung Hong , Yearim Kim , Keun Hee Park , Sangyu Han , Nojun Kwak , Theodore P. Pavlic

Capability $\neq$ Interpretability: Human Interpretability of Vision Foundation Models

How interpretable are the features of leading vision models? The question is increasingly pressing as these models move from research benchmarks into high-stakes deployments, yet existing methods cannot answer it reliably. We close this gap…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Julien Colin , Lore Goetschalckx , Nuria Oliver , Thomas Serre

Explainable Metric Learning for Deflating Data Bias

Image classification is an essential part of computer vision which assigns a given input image to a specific category based on the similarity evaluation within given criteria. While promising classifiers can be obtained through deep…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Emma Andrews , Prabhat Mishra

Enhancing Interpretability Through Loss-Defined Classification Objective in Structured Latent Spaces

Supervised machine learning often operates on the data-driven paradigm, wherein internal model parameters are autonomously optimized to converge predicted outputs with the ground truth, devoid of explicitly programming rules or a priori…

Machine Learning · Computer Science 2024-12-12 Daniel Geissler , Bo Zhou , Mengxi Liu , Paul Lukowicz

Network Dissection: Quantifying Interpretability of Deep Visual Representations

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model,…

Computer Vision and Pattern Recognition · Computer Science 2017-04-20 David Bau , Bolei Zhou , Aditya Khosla , Aude Oliva , Antonio Torralba

Learning to Select Prototypical Parts for Interpretable Sequential Data Modeling

Prototype-based interpretability methods provide intuitive explanations of model prediction by comparing samples to a reference set of memorized exemplars or typical representatives in terms of similarity. In the field of sequential data…

Machine Learning · Computer Science 2023-03-20 Yifei Zhang , Neng Gao , Cunqing Ma

Fully Self-Supervised Class Awareness in Dense Object Descriptors

We address the problem of inferring self-supervised dense semantic correspondences between objects in multi-object scenes. The method introduces learning of class-aware dense object descriptors by providing either unsupervised discrete…

Robotics · Computer Science 2021-10-06 Denis Hadjivelichkov , Dimitrios Kanoulas

Representational Similarity via Interpretable Visual Concepts

How do two deep neural networks differ in how they arrive at a decision? Measuring the similarity of deep networks has been a long-standing open question. Most existing methods provide a single number to measure the similarity of two…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Neehar Kondapaneni , Oisin Mac Aodha , Pietro Perona

Towards Visually Explaining Similarity Models

We consider the problem of visually explaining similarity models, i.e., explaining why a model predicts two images to be similar in addition to producing a scalar score. While much recent work in visual model interpretability has focused on…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Meng Zheng , Srikrishna Karanam , Terrence Chen , Richard J. Radke , Ziyan Wu

AutoProtoNet: Interpretability for Prototypical Networks

In meta-learning approaches, it is difficult for a practitioner to make sense of what kind of representations the model employs. Without this ability, it can be difficult to both understand what the model knows as well as to make meaningful…

Machine Learning · Computer Science 2022-04-05 Pedro Sandoval-Segura , Wallace Lawson

Diverse Imagenet Models Transfer Better

A commonly accepted hypothesis is that models with higher accuracy on Imagenet perform better on other downstream tasks, leading to much research dedicated to optimizing Imagenet accuracy. Recently this hypothesis has been challenged by…

Computer Vision and Pattern Recognition · Computer Science 2022-04-21 Niv Nayman , Avram Golbert , Asaf Noy , Tan Ping , Lihi Zelnik-Manor

LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition

Open-set object recognition aims to identify if an object is from a class that has been encountered during training or not. To perform open-set object recognition accurately, a key challenge is how to reduce the reliance on…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Haoxuan Qu , Xiaofei Hui , Yujun Cai , Jun Liu

Learning Compatible Embeddings

Achieving backward compatibility when rolling out new models can highly reduce costs or even bypass feature re-encoding of existing gallery images for in-production visual retrieval systems. Previous related works usually leverage losses…

Computer Vision and Pattern Recognition · Computer Science 2021-08-05 Qiang Meng , Chixiang Zhang , Xiaoqiang Xu , Feng Zhou

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

Discriminative self-supervised learning allows training models on any random group of internet images, and possibly recover salient information that helps differentiate between the images. Applied to ImageNet, this leads to object centric…

Computer Vision and Pattern Recognition · Computer Science 2022-02-23 Priya Goyal , Quentin Duval , Isaac Seessel , Mathilde Caron , Ishan Misra , Levent Sagun , Armand Joulin , Piotr Bojanowski

The Cross-Depiction Problem: Computer Vision Algorithms for Recognising Objects in Artwork and in Photographs

The cross-depiction problem is that of recognising visual objects regardless of whether they are photographed, painted, drawn, etc. It is a potentially significant yet under-researched problem. Emulating the remarkable human ability to…

Computer Vision and Pattern Recognition · Computer Science 2015-05-04 Hongping Cai , Qi Wu , Tadeo Corradi , Peter Hall