Related papers: Decoupling Zero-Shot Semantic Segmentation

Zero-Shot Semantic Segmentation

Semantic segmentation models are limited in their ability to scale to large numbers of object classes. In this paper, we introduce the new task of zero-shot semantic segmentation: learning pixel-wise classifiers for never-seen object…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Maxime Bucher , Tuan-Hung Vu , Matthieu Cord , Patrick Pérez

Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation

Generalized zero-shot semantic segmentation (GZS3) aims to achieve the human-level capability of segmenting not only seen classes but also novel class regions unseen in the training data through introducing the bridge of semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Xiaoqing Guo , Wuyang Li , Yixuan Yuan

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation

In this paper, we propose an embarrassingly simple yet highly effective zero-shot semantic segmentation (ZS3) method, based on the pre-trained vision-language model CLIP. First, our study provides a couple of key discoveries: (i) the global…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Letian Wu , Wenyao Zhang , Tengping Jiang , Wankou Yang , Xin Jin , Wenjun Zeng

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

Semantic segmentation is a crucial task in computer vision that involves segmenting images into semantically meaningful regions at the pixel level. However, existing approaches often rely on expensive human annotations as supervision for…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Jun Chen , Deyao Zhu , Guocheng Qian , Bernard Ghanem , Zhicheng Yan , Chenchen Zhu , Fanyi Xiao , Mohamed Elhoseiny , Sean Chang Culatana

Split Matching for Inductive Zero-shot Semantic Segmentation

Zero-shot Semantic Segmentation (ZSS) aims to segment categories that are not annotated during training. While fine-tuning vision-language models has achieved promising results, these models often overfit to seen categories due to the lack…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Jialei Chen , Xu Zheng , Dongyue Li , Chong Yi , Seigo Ito , Danda Pani Paudel , Luc Van Gool , Hiroshi Murase , Daisuke Deguchi

Delving into Shape-aware Zero-shot Semantic Segmentation

Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy. However, translating this success…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Xinyu Liu , Beiwen Tian , Zhen Wang , Rui Wang , Kehua Sheng , Bo Zhang , Hao Zhao , Guyue Zhou

CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation

Generalized Zero-shot Semantic Segmentation aims to segment both seen and unseen categories only under the supervision of the seen ones. To tackle this, existing methods adopt the large-scale Vision Language Models (VLMs) which obtain…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Jialei Chen , Daisuke Deguchi , Chenkai Zhang , Xu Zheng , Hiroshi Murase

Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibration

Zero-shot learning (ZSL) for image classification focuses on recognizing novel categories that have no labeled data available for training. The learning is generally carried out with the help of mid-level semantic descriptors associated…

Computer Vision and Pattern Recognition · Computer Science 2019-03-29 Debasmit Das , C. S. George Lee

Recursive Training for Zero-Shot Semantic Segmentation

General purpose semantic segmentation relies on a backbone CNN network to extract discriminative features that help classify each image pixel into a 'seen' object class (ie., the object classes available during training) or a background…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Ce Wang , Moshiur Farazi , Nick Barnes

Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding

Fully supervised semantic segmentation technologies bring a paradigm shift in scene understanding. However, the burden of expensive labeling cost remains as a challenge. To solve the cost problem, recent studies proposed language model…

Computer Vision and Pattern Recognition · Computer Science 2021-12-21 Sungguk Cha , Yooseung Wang

Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective

Zero-shot learning (ZSL) aims to recognize instances of unseen classes solely based on the semantic descriptions of the classes. Existing algorithms usually formulate it as a semantic-visual correspondence problem, by learning mappings from…

Computer Vision and Pattern Recognition · Computer Science 2019-11-28 Kai Li , Martin Renqiang Min , Yun Fu

Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview

Visual semantic segmentation aims at separating a visual sample into diverse blocks with specific semantic attributes and identifying the category for each block, and it plays a crucial role in environmental perception. Conventional…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Wenqi Ren , Yang Tang , Qiyu Sun , Chaoqiang Zhao , Qing-Long Han

ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation

Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a two-stage scheme. The general idea is to first generate class-agnostic region proposals and then feed the cropped proposal regions to CLIP to utilize its…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Ziqin Zhou , Bowen Zhang , Yinjie Lei , Lingqiao Liu , Yifan Liu

Zero-Shot Image Classification Using Coupled Dictionary Embedding

Zero-shot learning (ZSL) is a framework to classify images belonging to unseen classes based on solely semantic information about these unseen classes. In this paper, we propose a new ZSL algorithm using coupled dictionary learning. The…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Mohammad Rostami , Soheil Kolouri , Zak Murez , Yuri Owekcho , Eric Eaton , Kuyngnam Kim

Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation

Zero-shot instance segmentation aims to detect and precisely segment objects of unseen categories without any training samples. Since the model is trained on seen categories, there is a strong bias that the model tends to classify all the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Shuting He , Henghui Ding , Wei Jiang

A Simple Framework for Open-Vocabulary Zero-Shot Segmentation

Zero-shot classification capabilities naturally arise in models trained within a vision-language contrastive framework. Despite their classification prowess, these models struggle in dense tasks like zero-shot open-vocabulary segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Thomas Stegmüller , Tim Lebailly , Nikola Dukic , Behzad Bozorgtabar , Tinne Tuytelaars , Jean-Philippe Thiran

Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding

To bridge the gap between supervised semantic segmentation and real-world applications that acquires one model to recognize arbitrary new concepts, recent zero-shot segmentation attracts a lot of attention by exploring the relationships…

Computer Vision and Pattern Recognition · Computer Science 2022-11-01 Quande Liu , Youpeng Wen , Jianhua Han , Chunjing Xu , Hang Xu , Xiaodan Liang

Semantics Disentangling for Generalized Zero-Shot Learning

Generalized zero-shot learning (GZSL) aims to classify samples under the assumption that some classes are not observable during training. To bridge the gap between the seen and unseen classes, most GZSL methods attempt to associate the…

Computer Vision and Pattern Recognition · Computer Science 2021-08-30 Zhi Chen , Yadan Luo , Ruihong Qiu , Sen Wang , Zi Huang , Jingjing Li , Zheng Zhang

Annotation Free Semantic Segmentation with Vision Foundation Models

Semantic Segmentation is one of the most challenging vision tasks, usually requiring large amounts of training data with expensive pixel level annotations. With the success of foundation models and especially vision-language models, recent…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Soroush Seifi , Daniel Olmeda Reino , Fabien Despinoy , Rahaf Aljundi

Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation

Zero-shot Panoptic Segmentation (ZPS) aims to recognize foreground instances and background stuff without images containing unseen categories in training. Due to the visual data sparsity and the difficulty of generalizing from seen to…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Jialei Chen , Daisuke Deguchi , Chenkai Zhang , Hiroshi Murase