Related papers: Dictionary-based Framework for Interpretable and C…

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

We tackle open-world semantic segmentation, which aims at learning to segment arbitrary visual concepts in images, by using only image-text pairs without dense annotations. Existing open-world segmentation methods have shown impressive…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Junbum Cha , Jonghwan Mun , Byungseok Roh

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder.…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Jishnu Mukhoti , Tsung-Yu Lin , Omid Poursaeed , Rui Wang , Ashish Shah , Philip H. S. Torr , Ser-Nam Lim

Semantic-Aware Contrastive Learning for Multi-object Medical Image Segmentation

Medical image segmentation, or computing voxelwise semantic masks, is a fundamental yet challenging task to compute a voxel-level semantic mask. To increase the ability of encoder-decoder neural networks to perform this task across large…

Computer Vision and Pattern Recognition · Computer Science 2021-11-10 Ho Hin Lee , Yucheng Tang , Qi Yang , Xin Yu , Shunxing Bao , Leon Y. Cai , Lucas W. Remedios , Bennett A. Landman , Yuankai Huo

ContrastMask: Contrastive Learning to Segment Every Thing

Partially-supervised instance segmentation is a task which requests segmenting objects from novel unseen categories via learning on limited seen categories with annotated masks thus eliminating demands of heavy annotation burden. The key to…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Xuehui Wang , Kai Zhao , Ruixin Zhang , Shouhong Ding , Yan Wang , Wei Shen

Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation

The task of unsupervised semantic segmentation aims to cluster pixels into semantically meaningful groups. Specifically, pixels assigned to the same cluster should share high-level semantic properties like their object or part category.…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Wouter Van Gansbeke , Simon Vandenhende , Luc Van Gool

SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive Learning

Although there is significant progress in supervised semantic segmentation, it remains challenging to deploy the segmentation models to unseen domains due to domain biases. Domain adaptation can help in this regard by transferring knowledge…

Computer Vision and Pattern Recognition · Computer Science 2022-05-03 Binhui Xie , Mingjia Li , Shuang Li

MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset

Deep learning has achieved great success in recent years with the aid of advanced neural network structures and large-scale human-annotated datasets. However, it is often costly and difficult to accurately and efficiently annotate…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Chen Feng , Ioannis Patras

Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation

This paper addresses text-supervised semantic segmentation, aiming to learn a model capable of segmenting arbitrary visual concepts within images by using only image-text pairs without dense annotations. Existing methods have demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Ji-Jia Wu , Andy Chia-Hao Chang , Chieh-Yu Chuang , Chun-Pei Chen , Yu-Lun Liu , Min-Hung Chen , Hou-Ning Hu , Yung-Yu Chuang , Yen-Yu Lin

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Yun-Chun Chen , Yen-Yu Lin , Ming-Hsuan Yang , Jia-Bin Huang

CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation

Referring image segmentation aims at localizing all pixels of the visual objects described by a natural language sentence. Previous works learn to straightforwardly align the sentence embedding and pixel-level embedding for highlighting the…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Zicheng Zhang , Yi Zhu , Jianzhuang Liu , Xiaodan Liang , Wei Ke

Multi-Rationale Explainable Object Recognition via Contrastive Conditional Inference

Explainable object recognition using vision-language models such as CLIP involves predicting accurate category labels supported by rationales that justify the decision-making process. Existing methods typically rely on prompt-based…

Computer Vision and Pattern Recognition · Computer Science 2025-08-21 Ali Rasekh , Sepehr Kazemi Ranjbar , Simon Gottschalk

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

This paper proposes a scalable and straightforward pre-training paradigm for efficient visual conceptual representation called occluded image contrastive learning (OCL). Our OCL approach is simple: we randomly mask patches to generate…

Computer Vision and Pattern Recognition · Computer Science 2025-02-17 Xiaoyu Yang , Lijian Xu , Hongsheng Li , Shaoting Zhang

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals

Being able to learn dense semantic representations of images without supervision is an important problem in computer vision. However, despite its significance, this problem remains rather unexplored, with a few exceptions that considered…

Computer Vision and Pattern Recognition · Computer Science 2021-08-04 Wouter Van Gansbeke , Simon Vandenhende , Stamatios Georgoulis , Luc Van Gool

CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation

Many recent approaches in contrastive learning have worked to close the gap between pretraining on iconic images like ImageNet and pretraining on complex scenes like COCO. This gap exists largely because commonly used random crop…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Renhao Wang , Hang Zhao , Yang Gao

Improving Object Detection via Local-global Contrastive Learning

Visual domain gaps often impact object detection performance. Image-to-image translation can mitigate this effect, where contrastive approaches enable learning of the image-to-image mapping under unsupervised regimes. However, existing…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Danai Triantafyllidou , Sarah Parisot , Ales Leonardis , Steven McDonagh

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Image-level contrastive representation learning has proven to be highly effective as a generic model for transfer learning. Such generality for transfer learning, however, sacrifices specificity if we are interested in a certain downstream…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Fangyun Wei , Yue Gao , Zhirong Wu , Han Hu , Stephen Lin

CoKe: Localized Contrastive Learning for Robust Keypoint Detection

In this paper, we introduce a contrastive learning framework for keypoint detection (CoKe). Keypoint detection differs from other visual tasks where contrastive learning has been applied because the input is a set of images in which…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Yutong Bai , Angtian Wang , Adam Kortylewski , Alan Yuille

Identifying Interpretable Subspaces in Image Representations

We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-11 Neha Kalibhat , Shweta Bhardwaj , Bayan Bruss , Hamed Firooz , Maziar Sanjabi , Soheil Feizi

CoCa: Contrastive Captioners are Image-Text Foundation Models

Exploring large-scale pretrained foundation models is of significant interest in computer vision because these models can be quickly transferred to many downstream tasks. This paper presents Contrastive Captioner (CoCa), a minimalist design…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Jiahui Yu , Zirui Wang , Vijay Vasudevan , Legg Yeung , Mojtaba Seyedhosseini , Yonghui Wu

Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation

This study introduces an efficacious approach, Masked Collaborative Contrast (MCC), to highlight semantic regions in weakly supervised semantic segmentation. MCC adroitly draws inspiration from masked image modeling and contrastive learning…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Fangwen Wu , Jingxuan He , Yufei Yin , Yanbin Hao , Gang Huang , Lechao Cheng