Related papers: Multi-Granularity Video Object Segmentation

Point-VOS: Pointing Up Video Object Segmentation

Current state-of-the-art Video Object Segmentation (VOS) methods rely on dense per-object mask annotations both during training and testing. This requires time-consuming and costly video annotation mechanisms. We propose a novel Point-VOS…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Idil Esen Zulfikar , Sabarinath Mahadevan , Paul Voigtlaender , Bastian Leibe

Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos

Segmenting objects in videos is a fundamental computer vision task. The current deep learning based paradigm offers a powerful, but data-hungry solution. However, current datasets are limited by the cost and human effort of annotating…

Computer Vision and Pattern Recognition · Computer Science 2021-01-07 Bin Zhao , Goutam Bhat , Martin Danelljan , Luc Van Gool , Radu Timofte

Learning Video Object Segmentation from Unlabeled Videos

We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data. We introduce a unified…

Computer Vision and Pattern Recognition · Computer Science 2020-03-12 Xiankai Lu , Wenguan Wang , Jianbing Shen , Yu-Wing Tai , David Crandall , Steven C. H. Hoi

YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark

Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Ning Xu , Linjie Yang , Yuchen Fan , Dingcheng Yue , Yuchen Liang , Jianchao Yang , Thomas Huang

Learning Video Object Segmentation from Static Images

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation. Our model proceeds on a per-frame basis, guided by the…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Anna Khoreva , Federico Perazzi , Rodrigo Benenson , Bernt Schiele , Alexander Sorkine-Hornung

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing

In this work we propose a capsule-based approach for semi-supervised video object segmentation. Current video object segmentation methods are frame-based and often require optical flow to capture temporal consistency across frames which can…

Computer Vision and Pattern Recognition · Computer Science 2019-10-02 Kevin Duarte , Yogesh S Rawat , Mubarak Shah

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

Video object segmentation (VOS) aims at segmenting a particular object throughout the entire video clip sequence. The state-of-the-art VOS methods have achieved excellent performance (e.g., 90+% J&F) on existing datasets. However, since the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Henghui Ding , Chang Liu , Shuting He , Xudong Jiang , Philip H. S. Torr , Song Bai

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Ning Xu , Linjie Yang , Yuchen Fan , Jianchao Yang , Dingcheng Yue , Yuchen Liang , Brian Price , Scott Cohen , Thomas Huang

Learning the What and How of Annotation in Video Object Segmentation

Video Object Segmentation (VOS) is crucial for several applications, from video editing to video data generation. Training a VOS model requires an abundance of manually labeled training videos. The de-facto traditional way of annotating…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Thanos Delatolas , Vicky Kalogeiton , Dim P. Papadopoulos

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Video object segmentation (VOS) aims to segment specified target objects throughout a video. Although state-of-the-art methods have achieved impressive performance (e.g., 90+% J&F) on benchmarks such as DAVIS and YouTube-VOS, these datasets…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Henghui Ding , Kaining Ying , Chang Liu , Shuting He , Xudong Jiang , Yu-Gang Jiang , Philip H. S. Torr , Song Bai

Sketch-based Video Object Segmentation: Benchmark and Analysis

Reference-based video object segmentation is an emerging topic which aims to segment the corresponding target object in each video frame referred by a given reference, such as a language expression or a photo mask. However, language…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Ruolin Yang , Da Li , Conghui Hu , Timothy Hospedales , Honggang Zhang , Yi-Zhe Song

Learning Fast and Robust Target Models for Video Object Segmentation

Video object segmentation (VOS) is a highly challenging problem since the initial mask, defining the target object, is only given at test-time. The main difficulty is to effectively handle appearance changes and similar background objects,…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Andreas Robinson , Felix Järemo Lawin , Martin Danelljan , Fahad Shahbaz Khan , Michael Felsberg

LIP: Learning Instance Propagation for Video Object Segmentation

In recent years, the task of segmenting foreground objects from background in a video, i.e. video object segmentation (VOS), has received considerable attention. In this paper, we propose a single end-to-end trainable deep neural network,…

Computer Vision and Pattern Recognition · Computer Science 2019-10-02 Ye Lyu , George Vosselman , Gui-Song Xia , Michael Ying Yang

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

Tracking and segmenting multiple objects in complex scenes has always been a challenge in the field of video object segmentation, especially in scenarios where objects are occluded and split into parts. In such cases, the definition of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-10 Deshui Miao , Xin Li , Zhenyu He , Yaowei Wang , Ming-Hsuan Yang

BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

Multiple existing benchmarks involve tracking and segmenting objects in video e.g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate…

Computer Vision and Pattern Recognition · Computer Science 2022-11-23 Ali Athar , Jonathon Luiten , Paul Voigtlaender , Tarasha Khurana , Achal Dave , Bastian Leibe , Deva Ramanan

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Video object segmentation (VOS) aims at pixel-level object tracking given only the annotations in the first frame. Due to the large visual variations of objects in video and the lack of training samples, it remains a difficult task despite…

Computer Vision and Pattern Recognition · Computer Science 2019-07-05 Qiang Zhou , Zilong Huang , Lichao Huang , Yongchao Gong , Han Shen , Chang Huang , Wenyu Liu , Xinggang Wang

Spectrum-guided Multi-granularity Referring Video Object Segmentation

Current referring video object segmentation (R-VOS) techniques extract conditional kernels from encoded (low-resolution) vision-language features to segment the decoded high-resolution features. We discovered that this causes significant…

Computer Vision and Pattern Recognition · Computer Science 2023-07-26 Bo Miao , Mohammed Bennamoun , Yongsheng Gao , Ajmal Mian

Towards Open-Vocabulary Video Instance Segmentation

Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categories in real-world videos. To address this…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Haochen Wang , Cilin Yan , Shuai Wang , Xiaolong Jiang , XU Tang , Yao Hu , Weidi Xie , Efstratios Gavves

Learning What to Learn for Video Object Segmentation

Video object segmentation (VOS) is a highly challenging problem, since the target object is only defined during inference with a given first-frame reference mask. The problem of how to capture and utilize this limited target information…

Computer Vision and Pattern Recognition · Computer Science 2020-05-04 Goutam Bhat , Felix Järemo Lawin , Martin Danelljan , Andreas Robinson , Michael Felsberg , Luc Van Gool , Radu Timofte

Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation

Video Instance Segmentation is a fundamental computer vision task that deals with segmenting and tracking object instances across a video sequence. Most existing methods typically accomplish this task by employing a multi-stage top-down…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Jyoti Kini , Mubarak Shah