Related papers: Self-Supervised Transformers for Unsupervised Obje…

TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Yangtao Wang , Xi Shen , Yuan Yuan , Yuming Du , Maomao Li , Shell Xu Hu , James L Crowley , Dominique Vaufreydaz

Finding Distributed Object-Centric Properties in Self-Supervised Transformers

Self-supervised Vision Transformers (ViTs) like DINO show an emergent ability to discover objects, typically observed in [CLS] token attention maps of the final layer. However, these maps often contain spurious activations resulting in poor…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Samyak Rawlekar , Amitabh Swain , Yujun Cai , Yiwei Wang , Ming-Hsuan Yang , Narendra Ahuja

Weakly Supervised Salient Object Detection Using Image Labels

Deep learning based salient object detection has recently achieved great success with its performance greatly outperforms any other unsupervised methods. However, annotating per-pixel saliency masks is a tedious and inefficient procedure.…

Computer Vision and Pattern Recognition · Computer Science 2018-03-20 Guanbin Li , Yuan Xie , Liang Lin

Unsupervised Object Localization: Observing the Background to Discover Objects

Recent advances in self-supervised visual representation learning have paved the way for unsupervised methods tackling tasks such as object discovery and instance segmentation. However, discovering objects in an image with no supervision is…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Oriane Siméoni , Chloé Sekkat , Gilles Puy , Antonin Vobecky , Éloi Zablocki , Patrick Pérez

Unifying Global-Local Representations in Salient Object Detection with Transformer

The fully convolutional network (FCN) has dominated salient object detection for a long period. However, the locality of CNN requires the model deep enough to have a global receptive field and such a deep model always leads to the loss of…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Sucheng Ren , Qiang Wen , Nanxuan Zhao , Guoqiang Han , Shengfeng He

DADO: A Depth-Attention framework for Object Discovery

Unsupervised object discovery, the task of identifying and localizing objects in images without human-annotated labels, remains a significant challenge and a growing focus in computer vision. In this work, we introduce a novel model, DADO…

Computer Vision and Pattern Recognition · Computer Science 2025-10-09 Federico Gonzalez , Estefania Talavera , Petia Radeva

Semi-supervised 3D Object Detection via Temporal Graph Neural Networks

3D object detection plays an important role in autonomous driving and other robotics applications. However, these detectors usually require training on large amounts of annotated data that is expensive and time-consuming to collect.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Jianren Wang , Haiming Gang , Siddharth Ancha , Yi-Ting Chen , David Held

LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training

Learning object segmentation in image and video datasets without human supervision is a challenging problem. Humans easily identify moving salient objects in videos using the gestalt principle of common fate, which suggests that what moves…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Silky Singh , Shripad Deshmukh , Mausoom Sarkar , Balaji Krishnamurthy

Self-supervised Segmentation via Background Inpainting

While supervised object detection and segmentation methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this when annotating data is…

Computer Vision and Pattern Recognition · Computer Science 2020-11-12 Isinsu Katircioglu , Helge Rhodin , Victor Constantin , Jörg Spörri , Mathieu Salzmann , Pascal Fua

Towards the Success Rate of One: Real-time Unconstrained Salient Object Detection

In this work, we propose an efficient and effective approach for unconstrained salient object detection in images using deep convolutional neural networks. Instead of generating thousands of candidate bounding boxes and refining them, our…

Computer Vision and Pattern Recognition · Computer Science 2017-08-03 Mahyar Najibi , Fan Yang , Qiaosong Wang , Robinson Piramuthu

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS). Our key insight is that the inherent structural dependencies present in DINO-pretrained Transformers can be leveraged to…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Shuangrui Ding , Rui Qian , Haohang Xu , Dahua Lin , Hongkai Xiong

Unsupervised Semantic Segmentation with Self-supervised Object-centric Representations

In this paper, we show that recent advances in self-supervised feature learning enable unsupervised object discovery and semantic segmentation with a performance that matches the state of the field on supervised semantic segmentation 10…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Andrii Zadaianchuk , Matthaeus Kleindessner , Yi Zhu , Francesco Locatello , Thomas Brox

Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context

Unsupervised salient object detection aims to detect salient objects without using supervision signals eliminating the tedious task of manually labeling salient objects. To improve training efficiency, end-to-end methods for USOD have been…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Yicheng Song , Shuyong Gao , Haozhe Xing , Yiting Cheng , Yan Wang , Wenqiang Zhang

Self-supervised Training of Proposal-based Segmentation via Background Prediction

While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is…

Computer Vision and Pattern Recognition · Computer Science 2019-07-19 Isinsu Katircioglu , Helge Rhodin , Victor Constantin , Jörg Spörri , Mathieu Salzmann , Pascal Fua

Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-10 Yunqiu Lv , Jing Zhang , Nick Barnes , Yuchao Dai

Oh-A-DINO: Understanding and Enhancing Attribute-Level Information in Self-Supervised Object-Centric Representations

Object-centric understanding is fundamental to human vision and required for complex reasoning. Traditional methods define slot-based bottlenecks to learn object properties explicitly, while recent self-supervised vision models like DINO…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Stefan Sylvius Wagner , Stefan Harmeling

Unsupervised Domain Adaptive Salient Object Detection Through Uncertainty-Aware Pseudo-Label Learning

Recent advances in deep learning significantly boost the performance of salient object detection (SOD) at the expense of labeling larger-scale per-pixel annotations. To relieve the burden of labor-intensive labeling, deep unsupervised SOD…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Pengxiang Yan , Ziyi Wu , Mengmeng Liu , Kun Zeng , Liang Lin , Guanbin Li

Adaptive Graph Convolution Module for Salient Object Detection

Salient object detection (SOD) is a task that involves identifying and segmenting the most visually prominent object in an image. Existing solutions can accomplish this use a multi-scale feature fusion mechanism to detect the global context…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Yongwoo Lee , Minhyeok Lee , Suhwan Cho , Sangyoun Lee

Image Segmentation-based Unsupervised Multiple Objects Discovery

Unsupervised object discovery aims to localize objects in images, while removing the dependence on annotations required by most deep learning-based methods. To address this problem, we propose a fully unsupervised, bottom-up approach, for…

Computer Vision and Pattern Recognition · Computer Science 2022-12-21 Sandra Kara , Hejer Ammar , Florian Chabot , Quoc-Cuong Pham

Self-explanatory Deep Salient Object Detection

Salient object detection has seen remarkable progress driven by deep learning techniques. However, most of deep learning based salient object detection methods are black-box in nature and lacking in interpretability. This paper proposes the…

Computer Vision and Pattern Recognition · Computer Science 2017-08-21 Huaxin Xiao , Jiashi Feng , Yunchao Wei , Maojun Zhang