Related papers: CEIR: Concept-based Explainable Image Representati…

A clinically motivated self-supervised approach for content-based image retrieval of CT liver images

Deep learning-based approaches for content-based image retrieval (CBIR) of CT liver images is an active field of research, but suffers from some critical limitations. First, they are heavily reliant on labeled data, which can be challenging…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Kristoffer Knutsen Wickstrøm , Eirik Agnalt Østmo , Keyur Radiya , Karl Øyvind Mikalsen , Michael Christian Kampffmeyer , Robert Jenssen

CLIC: Contrastive Learning Framework for Unsupervised Image Complexity Representation

As a fundamental visual attribute, image complexity significantly influences both human perception and the performance of computer vision models. However, accurately assessing and quantifying image complexity remains a challenging task. (1)…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Shipeng Liu , Liang Zhao , Dengfeng Chen

G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling

In the realms of computer vision, it is evident that deep neural networks perform better in a supervised setting with a large amount of labeled data. The representations learned with supervision are not only of high quality but also helps…

Machine Learning · Computer Science 2020-09-28 Souradip Chakraborty , Aritra Roy Gosthipaty , Sayak Paul

Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing

Composed Image Retrieval (CIR) is a pivotal and complex task in multimodal understanding. Current CIR benchmarks typically feature limited query categories and fail to capture the diverse requirements of real-world scenarios. To bridge this…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Tingyu Song , Yanzhao Zhang , Mingxin Li , Zhuoning Guo , Dingkun Long , Pengjun Xie , Siyue Zhang , Yilun Zhao , Shu Wu

Learning Interpretable Concept-Based Models with Human Feedback

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

VISIR: Visual and Semantic Image Label Refinement

The social media explosion has populated the Internet with a wealth of images. There are two existing paradigms for image retrieval: 1) content-based image retrieval (CBIR), which has traditionally used visual features for similarity search…

Multimedia · Computer Science 2019-09-04 Sreyasi Nag Chowdhury , Niket Tandon , Hakan Ferhatosmanoglu , Gerhard Weikum

Concept Bottleneck Model with Additional Unsupervised Concepts

With the increasing demands for accountability, interpretability is becoming an essential capability for real-world AI applications. However, most methods utilize post-hoc approaches rather than training the interpretable model. In this…

Computer Vision and Pattern Recognition · Computer Science 2022-02-04 Yoshihide Sawada , Keigo Nakamura

SAIR: Learning Semantic-aware Implicit Representation

Implicit representation of an image can map arbitrary coordinates in the continuous domain to their corresponding color values, presenting a powerful capability for image reconstruction. Nevertheless, existing implicit representation…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Canyu Zhang , Xiaoguang Li , Qing Guo , Song Wang

Lesion Search with Self-supervised Learning

Content-based image retrieval (CBIR) with self-supervised learning (SSL) accelerates clinicians' interpretation of similar images without manual annotations. We develop a CBIR from the contrastive learning SimCLR and incorporate a…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Kristin Qi , Jiali Cheng , Daniel Haehn

From Segments to Concepts: Interpretable Image Classification via Concept-Guided Segmentation

Deep neural networks have achieved remarkable success in computer vision; however, their black-box nature in decision-making limits interpretability and trust, particularly in safety-critical applications. Interpretability is crucial in…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Ran Eisenberg , Amit Rozner , Ethan Fetaya , Ofir Lindenbaum

ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

Composed image retrieval (CIR) is the task of retrieving a target image specified by a query image and a relative text that describes a semantic modification to the query image. Existing methods in CIR struggle to accurately represent the…

Computer Vision and Pattern Recognition · Computer Science 2025-05-28 Eric Xing , Pranavi Kolouju , Robert Pless , Abby Stylianou , Nathan Jacobs

Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions

Machine learning is a vital part of many real-world systems, but several concerns remain about the lack of interpretability, explainability and robustness of black-box AI systems. Concept Bottleneck Models (CBM) address some of these…

Machine Learning · Statistics 2025-10-24 Hidde Fokkema , Tim van Erven , Sara Magliacane

Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Content-based image retrieval (CBIR) systems are an emerging technology that supports reading and interpreting medical images. Since 3D brain MR images are high dimensional, dimensionality reduction is necessary for CBIR using machine…

Image and Video Processing · Electrical Eng. & Systems 2022-10-04 Kei Nishimaki , Kumpei Ikuta , Yuto Onga , Hitoshi Iyatomi , Kenichi Oishi

CIR-CoT: Towards Interpretable Composed Image Retrieval via End-to-End Chain-of-Thought Reasoning

Composed Image Retrieval (CIR), which aims to find a target image from a reference image and a modification text, presents the core challenge of performing unified reasoning across visual and semantic modalities. While current approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Weihuang Lin , Yiwei Ma , Jiayi Ji , Xiaoshuai Sun , Rongrong Ji

CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs

Deep vision models often rely on biases learned from spurious correlations in datasets. To identify these biases, methods that interpret high-level, human-understandable concepts are more effective than those relying primarily on low-level…

Computer Vision and Pattern Recognition · Computer Science 2025-05-19 David Méndez , Gianpaolo Bontempo , Elisa Ficarra , Roberto Confalonieri , Natalia Díaz-Rodríguez

Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data

Composed Image Retrieval (CIR) is the task of retrieving images matching a reference image augmented with a text, where the text describes changes to the reference image in natural language. Traditionally, models designed for CIR have…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Yiqun Duan , Sameera Ramasinghe , Stephen Gould , Ajanthan Thalaiyasingam

Instance-Level Composed Image Retrieval

The progress of composed image retrieval (CIR), a popular research direction in image retrieval, where a combined visual and textual query is used, is held back by the absence of high-quality training and evaluation data. We introduce a new…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Bill Psomas , George Retsinas , Nikos Efthymiadis , Panagiotis Filntisis , Yannis Avrithis , Petros Maragos , Ondrej Chum , Giorgos Tolias

Cross-Modality Sub-Image Retrieval using Contrastive Multimodal Image Representations

In tissue characterization and cancer diagnostics, multimodal imaging has emerged as a powerful technique. Thanks to computational advances, large datasets can be exploited to discover patterns in pathologies and improve diagnosis. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Eva Breznik , Elisabeth Wetzer , Joakim Lindblad , Nataša Sladoje

TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval

Composed Image Retrieval (CIR) is an important image retrieval paradigm that enables users to retrieve a target image using a multimodal query that consists of a reference image and modification text. Although research on CIR has made…

Computer Vision and Pattern Recognition · Computer Science 2026-04-27 Zixu Li , Yupeng Hu , Zhiheng Fu , Zhiwei Chen , Yongqi Li , Liqiang Nie

Unsupervised High-level Feature Learning by Ensemble Projection for Semi-supervised Image Classification and Image Clustering

This paper investigates the problem of image classification with limited or no annotations, but abundant unlabeled data. The setting exists in many tasks such as semi-supervised image classification, image clustering, and image retrieval.…

Computer Vision and Pattern Recognition · Computer Science 2016-02-05 Dengxin Dai , Luc Van Gool