Related papers: Subobject-level Image Tokenization

Pointly-Supervised Panoptic Segmentation

In this paper, we propose a new approach to applying point-level annotations for weakly-supervised panoptic segmentation. Instead of the dense pixel-level labels used by fully supervised methods, point-level labels only provide a single…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Junsong Fan , Zhaoxiang Zhang , Tieniu Tan

Maximize the Exploration of Congeneric Semantics for Weakly Supervised Semantic Segmentation

With the increase in the number of image data and the lack of corresponding labels, weakly supervised learning has drawn a lot of attention recently in computer vision tasks, especially in the fine-grained semantic segmentation problem. To…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Ke Zhang , Sihong Chen , Qi Ju , Yong Jiang , Yucong Li , Xin He

Scene-Centric Unsupervised Panoptic Segmentation

Unsupervised panoptic segmentation aims to partition an image into semantically meaningful regions and distinct object instances without training on manually annotated data. In contrast to prior work on unsupervised panoptic scene…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Oliver Hahn , Christoph Reich , Nikita Araslanov , Daniel Cremers , Christian Rupprecht , Stefan Roth

Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features

Weakly-supervised semantic segmentation under image tags supervision is a challenging task as it directly associates high-level semantic to low-level appearance. To bridge this gap, in this paper, we propose an iterative bottom-up and…

Computer Vision and Pattern Recognition · Computer Science 2018-06-13 Xiang Wang , Shaodi You , Xi Li , Huimin Ma

Exploring Token-Level Augmentation in Vision Transformer for Semi-Supervised Semantic Segmentation

Semi-supervised semantic segmentation has witnessed remarkable advancements in recent years. However, existing algorithms are based on convolutional neural networks and directly applying them to Vision Transformers poses certain limitations…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Dengke Zhang , Quan Tang , Fagui Liu , Haiqing Mei , C. L. Philip Chen

CAT: Content-Adaptive Image Tokenization

Most existing image tokenizers encode images into a fixed number of tokens or patches, overlooking the inherent variability in image complexity. To address this, we introduce Content-Adaptive Tokenizer (CAT), which dynamically adjusts…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Junhong Shen , Kushal Tirumala , Michihiro Yasunaga , Ishan Misra , Luke Zettlemoyer , Lili Yu , Chunting Zhou

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy…

Computer Vision and Pattern Recognition · Computer Science 2018-03-06 Weifeng Ge , Sibei Yang , Yizhou Yu

Domain Adaptation with Morphologic Segmentation

We present a novel domain adaptation framework that uses morphologic segmentation to translate images from arbitrary input domains (real and synthetic) into a uniform output domain. Our framework is based on an established image-to-image…

Computer Vision and Pattern Recognition · Computer Science 2020-06-17 Jonathan Klein , Sören Pirk , Dominik L. Michels

Pixel-Level Clustering Network for Unsupervised Image Segmentation

While image segmentation is crucial in various computer vision applications, such as autonomous driving, grasping, and robot navigation, annotating all objects at the pixel-level for training is nearly impossible. Therefore, the study of…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Cuong Manh Hoang , Byeongkeun Kang

Deep Patch Learning for Weakly Supervised Object Classification and Discovery

Patch-level image representation is very important for object classification and detection, since it is robust to spatial transformation, scale variation, and cluttered background. Many existing methods usually require fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Peng Tang , Xinggang Wang , Zilong Huang , Xiang Bai , Wenyu Liu

Rethinking Generalization in Few-Shot Classification

Single image-level annotations only correctly describe an often small subset of an image's content, particularly when complex real-world scenes are depicted. While this might be acceptable in many classification scenarios, it poses a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Markus Hiller , Rongkai Ma , Mehrtash Harandi , Tom Drummond

Unsupervised High-level Feature Learning by Ensemble Projection for Semi-supervised Image Classification and Image Clustering

This paper investigates the problem of image classification with limited or no annotations, but abundant unlabeled data. The setting exists in many tasks such as semi-supervised image classification, image clustering, and image retrieval.…

Computer Vision and Pattern Recognition · Computer Science 2016-02-05 Dengxin Dai , Luc Van Gool

Self-supervised Semantic Segmentation Grounded in Visual Concepts

Unsupervised semantic segmentation requires assigning a label to every pixel without any human annotations. Despite recent advances in self-supervised representation learning for individual images, unsupervised semantic segmentation with…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Wenbin He , William Surmeier , Arvind Kumar Shekar , Liang Gou , Liu Ren

Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery

Semantic segmentation is a powerful method to facilitate visual scene understanding. Each pixel is assigned a label according to a pre-defined list of object classes and semantic entities. This becomes very useful as a means to summarize…

Computer Vision and Pattern Recognition · Computer Science 2018-11-21 Marc Bosch , Gordon A. Christie , Christopher M. Gifford

A Lightweight Clustering Framework for Unsupervised Semantic Segmentation

Unsupervised semantic segmentation aims to categorize each pixel in an image into a corresponding class without the use of annotated data. It is a widely researched area as obtaining labeled datasets is expensive. While previous works in…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Yau Shing Jonathan Cheung , Xi Chen , Lihe Yang , Hengshuang Zhao

Panoptic-aware Image-to-Image Translation

Despite remarkable progress in image translation, the complex scene with multiple discrepant objects remains a challenging problem. The translated images have low fidelity and tiny objects in fewer details causing unsatisfactory performance…

Computer Vision and Pattern Recognition · Computer Science 2022-12-26 Liyun Zhang , Photchara Ratsamee , Bowen Wang , Zhaojie Luo , Yuki Uranishi , Manabu Higashida , Haruo Takemura

Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation

Existing studies in weakly-supervised semantic segmentation (WSSS) using image-level weak supervision have several limitations: sparse object coverage, inaccurate object boundaries, and co-occurring pixels from non-target objects. To…

Computer Vision and Pattern Recognition · Computer Science 2021-05-20 Seungho Lee , Minhyun Lee , Jongwuk Lee , Hyunjung Shim

Weakly- and Semi-Supervised Panoptic Segmentation

We present a weakly supervised model that jointly performs both semantic- and instance-segmentation -- a particularly relevant problem given the substantial cost of obtaining pixel-perfect annotation for these tasks. In contrast to many…

Computer Vision and Pattern Recognition · Computer Science 2019-01-15 Qizhu Li , Anurag Arnab , Philip H. S. Torr

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

We consider the task of learning a classifier for semantic segmentation using weak supervision in the form of image labels which specify the object classes present in the image. Our method uses deep convolutional neural networks (CNNs) and…

Computer Vision and Pattern Recognition · Computer Science 2017-11-07 Qinbin Hou , Puneet Kumar Dokania , Daniela Massiceti , Yunchao Wei , Ming-Ming Cheng , Philip Torr

Mutual Enhancement Between Global Tokens and Patch Tokens: From Theory to Practice

Accurate and effective discrete image tokenization is crucial for long image sequence processing. However, current methods rigidly compress all content at a fixed rate, ignoring the variable information density of images and leading to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Xiusheng Huang , Xin Jiang , Jun Zhao , Kang Liu , Yequan Wang