Related papers: Self-supervised Video Object Segmentation

Video Object Segmentation using Tracked Object Proposals

We present an approach to semi-supervised video object segmentation, in the context of the DAVIS 2017 challenge. Our approach combines category-based object detection, category-independent object appearance segmentation and temporal object…

Computer Vision and Pattern Recognition · Computer Science 2017-07-21 Gilad Sharir , Eddie Smolyansky , Itamar Friedman

Dense Unsupervised Learning for Video Segmentation

We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid…

Computer Vision and Pattern Recognition · Computer Science 2021-11-12 Nikita Araslanov , Simone Schaub-Meyer , Stefan Roth

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano

Self-Supervision by Prediction for Object Discovery in Videos

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data. On the other hand, unsupervised settings pose many challenges, especially about determining the right inductive bias in diverse scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Beril Besbinar , Pascal Frossard

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

Video object segmentation is a fundamental research problem in computer vision. Recent techniques have often applied attention mechanism to object representation learning from video sequences. However, due to temporal changes in the video…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Quang-Trung Truong , Duc Thanh Nguyen , Binh-Son Hua , Sai-Kit Yeung

A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation

Video object segmentation (VOS) -- predicting pixel-level regions for objects within each frame of a video -- is particularly challenging in agricultural scenarios, where videos of crops include hundreds of small, dense, and occluded…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Keyhan Najafian , Farhad Maleki , Lingling Jin , Ian Stavness

Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Mohammadreza Salehi , Efstratios Gavves , Cees G. M. Snoek , Yuki M. Asano

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Online tracking of multiple objects in videos requires strong capacity of modeling and matching object appearances. Previous methods for learning appearance embedding mostly rely on instance-level matching without considering the temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Wei Li , Yuanjun Xiong , Shuo Yang , Mingze Xu , Yongxin Wang , Wei Xia

Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation

The objective of this paper is self-supervised learning of video object segmentation. We develop a unified framework which simultaneously models cross-frame dense correspondence for locally discriminative feature learning and embeds…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Liulei Li , Wenguan Wang , Tianfei Zhou , Jianwu Li , Yi Yang

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Tracking and segmenting multiple similar objects with distinct or complex parts in long-term videos is particularly challenging due to the ambiguity in identifying target components and the confusion caused by occlusion, background clutter,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Xin Li , Deshui Miao , Zhenyu He , Yaowei Wang , Huchuan Lu , Ming-Hsuan Yang

Self-Supervised Learning of Audio-Visual Objects from Video

Our objective is to transform a video into a set of discrete audio-visual objects using self-supervised learning. To this end, we introduce a model that uses attention to localize and group sound sources, and optical flow to aggregate…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Triantafyllos Afouras , Andrew Owens , Joon Son Chung , Andrew Zisserman

A Perceptual Prediction Framework for Self Supervised Event Segmentation

Temporal segmentation of long videos is an important problem, that has largely been tackled through supervised learning, often requiring large amounts of annotated training data. In this paper, we tackle the problem of self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Sathyanarayanan N. Aakur , Sudeep Sarkar

Self-supervised Learning for Video Correspondence Flow

The objective of this paper is self-supervised learning of feature embeddings that are suitable for matching correspondences along the videos, which we term correspondence flow. By leveraging the natural spatial-temporal coherence in…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Zihang Lai , Weidi Xie

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentation. By the nature of the problem, available cues (e.g. video frame(s) with object masks) become richer with the intermediate predictions. However, the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

Watch and Learn: Semi-Supervised Learning of Object Detectors from Videos

We present a semi-supervised approach that localizes multiple unknown object instances in long videos. We start with a handful of labeled boxes and iteratively learn and label hundreds of thousands of object instances. We propose criteria…

Computer Vision and Pattern Recognition · Computer Science 2015-05-22 Ishan Misra , Abhinav Shrivastava , Martial Hebert

Self-supervised Object-Centric Learning for Videos

Unsupervised multi-object segmentation has shown impressive results on images by utilizing powerful semantics learned from self-supervised pretraining. An additional modality such as depth or motion is often used to facilitate the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Görkay Aydemir , Weidi Xie , Fatma Güney

Self-Supervised Representation Learning for Visual Anomaly Detection

Self-supervised learning allows for better utilization of unlabelled data. The feature representation obtained by self-supervision can be used in downstream tasks such as classification, object detection, segmentation, and anomaly…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Rabia Ali , Muhammad Umar Karim Khan , Chong Min Kyung

Fast Online Object Tracking and Segmentation: A Unifying Approach

In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach. Our method, dubbed SiamMask, improves the offline training procedure of…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Qiang Wang , Li Zhang , Luca Bertinetto , Weiming Hu , Philip H. S. Torr

Joint Inductive and Transductive Learning for Video Object Segmentation

Semi-supervised video object segmentation is a task of segmenting the target object in a video sequence given only a mask annotation in the first frame. The limited information available makes it an extremely challenging task. Most previous…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Yunyao Mao , Ning Wang , Wengang Zhou , Houqiang Li

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation

We consider the task of semi-supervised video object segmentation (VOS). Our approach mitigates shortcomings in previous VOS work by addressing detail preservation and temporal consistency using visual warping. In contrast to prior work…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Julia Gong , F. Christopher Holsinger , Serena Yeung