Related papers: Memory Aggregation Networks for Efficient Interact…

Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-mask and mask propagation, allowing for higher generalizability and better performance. Trained separately, the interaction module converts user…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Ho Kei Cheng , Yu-Wing Tai , Chi-Keung Tang

IDPro: Flexible Interactive Video Object Segmentation by ID-queried Concurrent Propagation

Interactive Video Object Segmentation (iVOS) is a challenging task that requires real-time human-computer interaction. To improve the user experience, it is important to consider the user's input habits, segmentation quality, running time…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Kexin Li , Tao Jiang , Zongxin Yang , Yi Yang , Yueting Zhuang , Jun Xiao

Region Aware Video Object Segmentation with Deep Motion Modeling

Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory. This introduces significant redundant computations. To reduce redundancy, we…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Bo Miao , Mohammed Bennamoun , Yongsheng Gao , Ajmal Mian

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

Video object segmentation aims at accurately segmenting the target object regions across consecutive frames. It is technically challenging for coping with complicated factors (e.g., shape deformations, occlusion and out of the lens). Recent…

Computer Vision and Pattern Recognition · Computer Science 2019-07-03 Peng Sun , Peiwen Lin , Guangliang Cheng , Jianping Shi , Jiawan Zhang , Xi Li

Dual Temporal Memory Network for Efficient Video Object Segmentation

Video Object Segmentation (VOS) is typically formulated in a semi-supervised setting. Given the ground-truth segmentation mask on the first frame, the task of VOS is to track and segment the single or multiple objects of interests in the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-16 Kaihua Zhang , Long Wang , Dong Liu , Bo Liu , Qingshan Liu , Zhu Li

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

Recently, transformer-based approaches have shown promising results for semi-supervised video object segmentation. However, these approaches typically struggle on long videos due to increased GPU memory demands, as they frequently expand…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Abdelrahman Shaker , Syed Talal Wasim , Martin Danelljan , Salman Khan , Ming-Hsuan Yang , Fahad Shahbaz Khan

Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching

Significant progress has been made in Video Object Segmentation (VOS), the video object tracking task in its finest level. While the VOS task can be naturally decoupled into image semantic segmentation and video object tracking,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Xuhua Huang , Jiarui Xu , Yu-Wing Tai , Chi-Keung Tang

Video Object Segmentation with Dynamic Query Modulation

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS). However, these methods…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Hantao Zhou , Runze Hu , Xiu Li

Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks

We present a deep learning method for the interactive video object segmentation. Our method is built upon two core operations, interaction and propagation, and each operation is conducted by Convolutional Neural Networks. The two networks…

Computer Vision and Pattern Recognition · Computer Science 2019-05-03 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

Revisiting Click-based Interactive Video Object Segmentation

While current methods for interactive Video Object Segmentation (iVOS) rely on scribble-based interactions to generate precise object masks, we propose a Click-based interactive Video Object Segmentation (CiVOS) framework to simplify the…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Stephane Vujasinovic , Sebastian Bullinger , Stefan Becker , Norbert Scherer-Negenborn , Michael Arens , Rainer Stiefelhagen

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Contemporary Video Object Segmentation (VOS) approaches typically consist stages of feature extraction, matching, memory management, and multiple objects aggregation. Recent advanced models either employ a discrete modeling for these…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Wanyun Li , Pinxue Guo , Xinyu Zhou , Lingyi Hong , Yangji He , Xiangyu Zheng , Wei Zhang , Wenqiang Zhang

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentation. By the nature of the problem, available cues (e.g. video frame(s) with object masks) become richer with the intermediate predictions. However, the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation

The recent works on Video Object Segmentation achieved remarkable results by matching dense semantic and instance-level features between the current and previous frames for long-time propagation. Nevertheless, global feature matching…

Computer Vision and Pattern Recognition · Computer Science 2024-05-15 Volodymyr Fedynyak , Yaroslav Romanus , Bohdan Hlovatskyi , Bohdan Sydor , Oles Dobosevych , Igor Babin , Roman Riazantsev

RVOS: End-to-End Recurrent Network for Video Object Segmentation

Multiple object video object segmentation is a challenging task, specially for the zero-shot case, when no object mask is given at the initial frame and the model has to find the objects to be segmented along the sequence. In our work, we…

Computer Vision and Pattern Recognition · Computer Science 2019-05-22 Carles Ventura , Miriam Bellver , Andreu Girbau , Amaia Salvador , Ferran Marques , Xavier Giro-i-Nieto

Make One-Shot Video Object Segmentation Efficient Again

Video object segmentation (VOS) describes the task of segmenting a set of objects in each frame of a video. In the semi-supervised setting, the first mask of each object is provided at test time. Following the one-shot principle,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Tim Meinhardt , Laura Leal-Taixe

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Exploring dense matching between the current frame and past frames for long-range context modeling, memory-based methods have demonstrated impressive results in video object segmentation (VOS) recently. Nevertheless, due to the lack of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Junke Wang , Dongdong Chen , Zuxuan Wu , Chong Luo , Chuanxin Tang , Xiyang Dai , Yucheng Zhao , Yujia Xie , Lu Yuan , Yu-Gang Jiang

Adaptive Memory Management for Video Object Segmentation

Matching-based networks have achieved state-of-the-art performance for video object segmentation (VOS) tasks by storing every-k frames in an external memory bank for future inference. Storing the intermediate frames' predictions provides…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Ali Pourganjalikhan , Charalambos Poullis

Robust and Efficient Memory Network for Video Object Segmentation

This paper proposes a Robust and Efficient Memory Network, referred to as REMN, for studying semi-supervised video object segmentation (VOS). Memory-based methods have recently achieved outstanding VOS performance by performing non-local…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Yadang Chen , Dingwei Zhang , Zhi-xin Yang , Enhua Wu

Space-time Reinforcement Network for Video Object Segmentation

Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Yadang Chen , Wentao Zhu , Zhi-Xin Yang , Enhua Wu

FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution

Video Object Segmentation (VOS) is one of the most fundamental and challenging tasks in computer vision and has a wide range of applications. Most existing methods rely on spatiotemporal memory networks to extract frame-level features and…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Mengjiao Wang , Junpei Zhang , Xu Liu , Yuting Yang , Mengru Ma