Related papers: Kernelized Memory Network for Video Object Segment…

Robust and Efficient Memory Network for Video Object Segmentation

This paper proposes a Robust and Efficient Memory Network, referred to as REMN, for studying semi-supervised video object segmentation (VOS). Memory-based methods have recently achieved outstanding VOS performance by performing non-local…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Yadang Chen , Dingwei Zhang , Zhi-xin Yang , Enhua Wu

Dual Temporal Memory Network for Efficient Video Object Segmentation

Video Object Segmentation (VOS) is typically formulated in a semi-supervised setting. Given the ground-truth segmentation mask on the first frame, the task of VOS is to track and segment the single or multiple objects of interests in the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-16 Kaihua Zhang , Long Wang , Dong Liu , Bo Liu , Qingshan Liu , Zhu Li

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentation. By the nature of the problem, available cues (e.g. video frame(s) with object masks) become richer with the intermediate predictions. However, the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

TrickVOS: A Bag of Tricks for Video Object Segmentation

Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i)…

Computer Vision and Pattern Recognition · Computer Science 2023-06-29 Evangelos Skartados , Konstantinos Georgiadis , Mehmet Kerim Yucel , Koskinas Ioannis , Armando Domi , Anastasios Drosou , Bruno Manganelli , Albert Saa-Garriga

Space-time Reinforcement Network for Video Object Segmentation

Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Yadang Chen , Wentao Zhu , Zhi-Xin Yang , Enhua Wu

Hierarchical Memory Matching Network for Video Object Segmentation

We present Hierarchical Memory Matching Network (HMMN) for semi-supervised video object segmentation. Based on a recent memory-based method [33], we propose two advanced memory read modules that enable us to perform memory reading in…

Computer Vision and Pattern Recognition · Computer Science 2021-09-24 Hongje Seong , Seoung Wug Oh , Joon-Young Lee , Seongwon Lee , Suhyeon Lee , Euntai Kim

Learning Position and Target Consistency for Memory-based Video Object Segmentation

This paper studies the problem of semi-supervised video object segmentation(VOS). Multiple works have shown that memory-based approaches can be effective for video object segmentation. They are mostly based on pixel-level matching, both…

Computer Vision and Pattern Recognition · Computer Science 2021-04-12 Li Hu , Peng Zhang , Bang Zhang , Pan Pan , Yinghui Xu , Rong Jin

MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

The task of semi-supervised video object segmentation (VOS) has been greatly advanced and state-of-the-art performance has been made by dense matching-based methods. The recent methods leverage space-time memory (STM) networks and learn to…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Jiadai Sun , Yuxin Mao , Yuchao Dai , Yiran Zhong , Jianyuan Wang

Revisiting Sequence-to-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Video Object Segmentation (VOS) is an active research area of the visual domain. One of its fundamental sub-tasks is semi-supervised / one-shot learning: given only the segmentation mask for the first frame, the task is to provide…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Fatemeh Azimi , Benjamin Bischke , Sebastian Palacio , Federico Raue , Joern Hees , Andreas Dengel

Efficient Regional Memory Network for Video Object Segmentation

Recently, several Space-Time Memory based networks have shown that the object cues (e.g. video frames as well as the segmented object masks) from the past frames are useful for segmenting objects in the current frame. However, these methods…

Computer Vision and Pattern Recognition · Computer Science 2021-04-29 Haozhe Xie , Hongxun Yao , Shangchen Zhou , Shengping Zhang , Wenxiu Sun

Fast Video Object Segmentation via Mask Transfer Network

Accuracy and processing speed are two important factors that affect the use of video object segmentation (VOS) in real applications. With the advanced techniques of deep neural networks, the accuracy has been significantly improved,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-29 Tao Zhuo , Zhiyong Cheng , Mohan Kankanhalli

FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical Flow

Semi-supervised video object segmentation (VOS) aims to segment a few moving objects in a video sequence, where these objects are specified by annotation of first frame. The optical flow has been considered in many existing semi-supervised…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Ziyang Liu , Jingmeng Liu , Weihai Chen , Xingming Wu , Zhengguo Li

Learning Fast and Robust Target Models for Video Object Segmentation

Video object segmentation (VOS) is a highly challenging problem since the initial mask, defining the target object, is only given at test-time. The main difficulty is to effectively handle appearance changes and similar background objects,…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Andreas Robinson , Felix Järemo Lawin , Martin Danelljan , Fahad Shahbaz Khan , Michael Felsberg

LiVOS: Light Video Object Segmentation with Gated Linear Matching

Semi-supervised video object segmentation (VOS) has been largely driven by space-time memory (STM) networks, which store past frame features in a spatiotemporal memory to segment the current frame via softmax attention. However, STM…

Computer Vision and Pattern Recognition · Computer Science 2024-11-06 Qin Liu , Jianfeng Wang , Zhengyuan Yang , Linjie Li , Kevin Lin , Marc Niethammer , Lijuan Wang

Make One-Shot Video Object Segmentation Efficient Again

Video object segmentation (VOS) describes the task of segmenting a set of objects in each frame of a video. In the semi-supervised setting, the first mask of each object is provided at test time. Following the one-shot principle,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Tim Meinhardt , Laura Leal-Taixe

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation

We consider the task of semi-supervised video object segmentation (VOS). Our approach mitigates shortcomings in previous VOS work by addressing detail preservation and temporal consistency using visual warping. In contrast to prior work…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Julia Gong , F. Christopher Holsinger , Serena Yeung

Learning What to Learn for Video Object Segmentation

Video object segmentation (VOS) is a highly challenging problem, since the target object is only defined during inference with a given first-frame reference mask. The problem of how to capture and utilize this limited target information…

Computer Vision and Pattern Recognition · Computer Science 2020-05-04 Goutam Bhat , Felix Järemo Lawin , Martin Danelljan , Andreas Robinson , Michael Felsberg , Luc Van Gool , Radu Timofte

HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback

Video Object Segmentation (VOS) is foundational to numerous computer vision applications, including surveillance, autonomous driving, robotics and generative video editing. However, existing VOS models often struggle with precise mask…

Computer Vision and Pattern Recognition · Computer Science 2025-07-28 Elham Soltani Kazemi , Imad Eddine Toubal , Gani Rahmon , Jaired Collins , K. Palaniappan

PMVOS: Pixel-Level Matching-Based Video Object Segmentation

Semi-supervised video object segmentation (VOS) aims to segment arbitrary target objects in video when the ground truth segmentation mask of the initial frame is provided. Due to this limitation of using prior knowledge about the target…

Computer Vision and Pattern Recognition · Computer Science 2020-09-21 Suhwan Cho , Heansung Lee , Sungmin Woo , Sungjun Jang , Sangyoun Lee

Adaptive Memory Management for Video Object Segmentation

Matching-based networks have achieved state-of-the-art performance for video object segmentation (VOS) tasks by storing every-k frames in an external memory bank for future inference. Storing the intermediate frames' predictions provides…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Ali Pourganjalikhan , Charalambos Poullis