Related papers: Intrinsic Concept Extraction Based on Compositiona…

ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models

The inherent ambiguity in defining visual concepts poses significant challenges for modern generative models, such as the diffusion-based Text-to-Image (T2I) models, in accurately learning concepts from a single image. Existing methods lack…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Fernando Julio Cendra , Kai Han

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

While personalized text-to-image generation has enabled the learning of a single concept from multiple images, a more practical yet challenging scenario involves learning multiple concepts within a single image. However, existing works…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Shaozhe Hao , Kai Han , Zhengyao Lv , Shihao Zhao , Kwan-Yee K. Wong

Towards Compositionality in Concept Learning

Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that…

Computation and Language · Computer Science 2024-06-27 Adam Stein , Aaditya Naik , Yinjun Wu , Mayur Naik , Eric Wong

Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings

Inner interpretability is a promising field aiming to uncover the internal mechanisms of AI systems through scalable, automated methods. While significant research has been conducted on large language models, limited attention has been paid…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Jinyung Hong , Yearim Kim , Keun Hee Park , Sangyu Han , Nojun Kwak , Theodore P. Pavlic

CusConcept: Customized Visual Concept Decomposition with Diffusion Models

Enabling generative models to decompose visual concepts from a single image is a complex and challenging problem. In this paper, we study a new and challenging task, customized concept decomposition, wherein the objective is to leverage…

Computer Vision and Pattern Recognition · Computer Science 2024-10-02 Zhi Xu , Shaozhe Hao , Kai Han

VICE: Variational Interpretable Concept Embeddings

A central goal in the cognitive sciences is the development of numerical models for mental representations of object concepts. This paper introduces Variational Interpretable Concept Embeddings (VICE), an approximate Bayesian method for…

Machine Learning · Computer Science 2022-10-07 Lukas Muttenthaler , Charles Y. Zheng , Patrick McClure , Robert A. Vandermeulen , Martin N. Hebart , Francisco Pereira

Enhancing the Comprehensibility of Text Explanations via Unsupervised Concept Discovery

Concept-based explainable approaches have emerged as a promising method in explainable AI because they can interpret models in a way that aligns with human reasoning. However, their adaption in the text domain remains limited. Most existing…

Computation and Language · Computer Science 2025-05-27 Yifan Sun , Danding Wang , Qiang Sheng , Juan Cao , Jintao Li

EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence…

Machine Learning · Computer Science 2019-09-30 Diane Bouchacourt , Ludovic Denoyer

CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models

Text-to-Image diffusion models can produce undesirable content that necessitates concept erasure. However, existing methods struggle with under-erasure, leaving residual traces of targeted concepts, or over-erasure, mistakenly eliminating…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Yuyang Xue , Edward Moroshko , Feng Chen , Jingyu Sun , Steven McDonagh , Sotirios A. Tsaftaris

ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

Composed image retrieval (CIR) is the task of retrieving a target image specified by a query image and a relative text that describes a semantic modification to the query image. Existing methods in CIR struggle to accurately represent the…

Computer Vision and Pattern Recognition · Computer Science 2025-05-28 Eric Xing , Pranavi Kolouju , Robert Pless , Abby Stylianou , Nathan Jacobs

Composite Concept Extraction through Backdooring

Learning composite concepts, such as \textquotedbl red car\textquotedbl , from individual examples -- like a white car representing the concept of \textquotedbl car\textquotedbl{} and a red strawberry representing the concept of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-24 Banibrata Ghosh , Haripriya Harikumar , Khoa D Doan , Svetha Venkatesh , Santu Rana

Unsupervised Interpretable Basis Extraction for Concept-Based Visual Explanations

An important line of research attempts to explain CNN image classifier predictions and intermediate layer representations in terms of human-understandable concepts. Previous work supports that deep representations are linearly separable…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Alexandros Doumanoglou , Stylianos Asteriadis , Dimitrios Zarpalas

A Unifying Framework for Unsupervised Concept Extraction

Techniques for concept extraction, such as sparse autoencoders and transcoders, aim to extract high-level symbolic concepts from low-level nonsymbolic representations. When these extracted concepts are used for downstream tasks such as…

Machine Learning · Computer Science 2026-04-29 Chandler Squires , Pradeep Ravikumar

Concept-Centric Transformers: Enhancing Model Interpretability through Object-Centric Concept Learning within a Shared Global Workspace

Many interpretable AI approaches have been proposed to provide plausible explanations for a model's decision-making. However, configuring an explainable model that effectively communicates among computational modules has received less…

Machine Learning · Computer Science 2023-11-09 Jinyung Hong , Keun Hee Park , Theodore P. Pavlic

MACE: Model Agnostic Concept Extractor for Explaining Image Classification Networks

Deep convolutional networks have been quite successful at various image classification tasks. The current methods to explain the predictions of a pre-trained model rely on gradient information, often resulting in saliency maps that focus on…

Machine Learning · Computer Science 2020-11-04 Ashish Kumar , Karan Sehgal , Prerna Garg , Vidhya Kamakshi , Narayanan C Krishnan

Towards Automatic Concept-based Explanations

Interpretability has become an important topic of research as more machine learning (ML) models are deployed and widely used to make important decisions. Most of the current explanation methods provide explanations through feature…

Machine Learning · Statistics 2019-10-09 Amirata Ghorbani , James Wexler , James Zou , Been Kim

Hierarchical Concept Embedding & Pursuit for Interpretable Image Classification

Interpretable-by-design models are gaining traction in computer vision because they provide faithful explanations for their predictions. In image classification, these models typically recover human-interpretable concepts from an image and…

Machine Learning · Computer Science 2026-03-31 Nghia Nguyen , Tianjiao Ding , René Vidal

ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas

Image compositions are helpful in the study of image structures and assist in discovering the semantics of the underlying scene portrayed across art forms and styles. With the digitization of artworks in recent years, thousands of images of…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Prathmesh Madhu , Tilman Marquart , Ronak Kosti , Dirk Suckow , Peter Bell , Andreas Maier , Vincent Christlein

Uncovering Unique Concept Vectors through Latent Space Decomposition

Interpreting the inner workings of deep learning models is crucial for establishing trust and ensuring model safety. Concept-based explanations have emerged as a superior approach that is more interpretable than feature attribution…

Machine Learning · Computer Science 2023-07-17 Mara Graziani , Laura O' Mahony , An-Phi Nguyen , Henning Müller , Vincent Andrearczyk

DISC: Dense Integrated Semantic Context for Large-Scale Open-Set Semantic Mapping

Open-set semantic mapping enables language-driven robotic perception, but current instance-centric approaches are bottlenecked by context-depriving and computationally expensive crop-based feature extraction. To overcome this fundamental…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Felix Igelbrink , Lennart Niecksch , Martin Atzmueller , Joachim Hertzberg