Related papers: Object Segmentation-Assisted Inter Prediction for …

Uniformly Accelerated Motion Model for Inter Prediction

Inter prediction is a key technology to reduce the temporal redundancy in video coding. In natural videos, there are usually multiple moving objects with variable velocity, resulting in complex motion fields that are difficult to represent…

Image and Video Processing · Electrical Eng. & Systems 2024-07-23 Zhuoyuan Li , Yao Li , Chuanbo Tang , Li Li , Dong Liu , Feng Wu

Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency

Weakly supervised instance segmentation reduces the cost of annotations required to train models. However, existing approaches which rely only on image-level class labels predominantly suffer from errors due to (a) partial segmentation of…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Qing Liu , Vignesh Ramanathan , Dhruv Mahajan , Alan Yuille , Zhenheng Yang

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentation. By the nature of the problem, available cues (e.g. video frame(s) with object masks) become richer with the intermediate predictions. However, the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation. MAMP leverages the frame reconstruction task for training without the need for annotations. During…

Computer Vision and Pattern Recognition · Computer Science 2021-10-29 Bo Miao , Mohammed Bennamoun , Yongsheng Gao , Ajmal Mian

Video Object Segmentation using Tracked Object Proposals

We present an approach to semi-supervised video object segmentation, in the context of the DAVIS 2017 challenge. Our approach combines category-based object detection, category-independent object appearance segmentation and temporal object…

Computer Vision and Pattern Recognition · Computer Science 2017-07-21 Gilad Sharir , Eddie Smolyansky , Itamar Friedman

Towards Robust Video Object Segmentation with Adaptive Object Calibration

In the booming video era, video segmentation attracts increasing research attention in the multimedia community. Semi-supervised video object segmentation (VOS) aims at segmenting objects in all target frames of a video, given annotated…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Xiaohao Xu , Jinglu Wang , Xiang Ming , Yan Lu

Per-Clip Video Object Segmentation

Recently, memory-based approaches show promising results on semi-supervised video object segmentation. These methods predict object masks frame-by-frame with the help of frequently updated memory of the previous mask. Different from this…

Computer Vision and Pattern Recognition · Computer Science 2022-08-04 Kwanyong Park , Sanghyun Woo , Seoung Wug Oh , In So Kweon , Joon-Young Lee

Accelerating Video Object Segmentation with Compressed Video

We propose an efficient plug-and-play acceleration framework for semi-supervised video object segmentation by exploiting the temporal redundancies in videos presented by the compressed bitstream. Specifically, we propose a motion…

Computer Vision and Pattern Recognition · Computer Science 2022-04-07 Kai Xu , Angela Yao

Efficient Motion Modelling with Variable-sized blocks from Hierarchical Cuboidal Partitioning

Motion modelling with block-based architecture has been widely used in video coding where a frame is divided into fixed-sized blocks that are motion compensated independently. This often leads to coding inefficiency as fixed-sized blocks…

Computer Vision and Pattern Recognition · Computer Science 2022-08-30 Priyabrata Karmakar , Manzur Murshed , Manoranjan Paul , David Taubman

Partition Map-Based Fast Block Partitioning for VVC Inter Coding

Among the new techniques of Versatile Video Coding (VVC), the quadtree with nested multi-type tree (QT+MTT) block structure yields significant coding gains by providing more flexible block partitioning patterns. However, the recursive…

Image and Video Processing · Electrical Eng. & Systems 2025-07-16 Xinmin Feng , Zhuoyuan Li , Li Li , Dong Liu , Feng Wu

Video Object Segmentation with Re-identification

Conventional video segmentation methods often rely on temporal continuity to propagate masks. Such an assumption suffers from issues like drifting and inability to handle large displacement. To overcome these issues, we formulate an…

Computer Vision and Pattern Recognition · Computer Science 2017-08-02 Xiaoxiao Li , Yuankai Qi , Zhe Wang , Kai Chen , Ziwei Liu , Jianping Shi , Ping Luo , Xiaoou Tang , Chen Change Loy

Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation

Video instance segmentation aims to detect, segment, and track objects in a video. Current approaches extend image-level segmentation algorithms to the temporal domain. However, this results in temporally inconsistent masks. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Anirudh S Chakravarthy , Won-Dong Jang , Zudi Lin , Donglai Wei , Song Bai , Hanspeter Pfister

Self-supervised Amodal Video Object Segmentation

Amodal perception requires inferring the full shape of an object that is partially occluded. This task is particularly challenging on two levels: (1) it requires more information than what is contained in the instant retina or imaging…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Jian Yao , Yuxin Hong , Chiyu Wang , Tianjun Xiao , Tong He , Francesco Locatello , David Wipf , Yanwei Fu , Zheng Zhang

Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression

Block based motion estimation is integral to inter prediction processes performed in hybrid video codecs. Prevalent block matching based methods that are used to compute block motion vectors (MVs) rely on computationally intensive search…

Image and Video Processing · Electrical Eng. & Systems 2022-12-06 Somdyuti Paul , Andrey Norkin , Alan C. Bovik

Semantically-Guided Video Object Segmentation

This paper tackles the problem of semi-supervised video object segmentation, that is, segmenting an object in a sequence given its mask in the first frame. One of the main challenges in this scenario is the change of appearance of the…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Sergi Caelles , Yuhua Chen , Jordi Pont-Tuset , Luc Van Gool

VoCap: Video Object Captioning and Segmentation from Any Prompt

Understanding objects in videos in terms of fine-grained localization masks and detailed semantic properties is a fundamental task in video understanding. In this paper, we propose VoCap, a flexible video model that consumes a video and a…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Jasper Uijlings , Xingyi Zhou , Xiuye Gu , Arsha Nagrani , Anurag Arnab , Alireza Fathi , David Ross , Cordelia Schmid

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

Video object segmentation aims at accurately segmenting the target object regions across consecutive frames. It is technically challenging for coping with complicated factors (e.g., shape deformations, occlusion and out of the lens). Recent…

Computer Vision and Pattern Recognition · Computer Science 2019-07-03 Peng Sun , Peiwen Lin , Guangliang Cheng , Jianping Shi , Jiawan Zhang , Xi Li

Space-time Reinforcement Network for Video Object Segmentation

Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Yadang Chen , Wentao Zhu , Zhi-Xin Yang , Enhua Wu

SSVIF: Self-Supervised Segmentation-Oriented Visible and Infrared Image Fusion

Visible and infrared image fusion (VIF) has gained significant attention in recent years due to its wide application in tasks such as scene segmentation and object detection. VIF methods can be broadly classified into traditional VIF…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Zixian Zhao , Xingchen Zhang

Boosting Video Object Segmentation based on Scale Inconsistency

We present a refinement framework to boost the performance of pre-trained semi-supervised video object segmentation (VOS) models. Our work is based on scale inconsistency, which is motivated by the observation that existing VOS models…

Computer Vision and Pattern Recognition · Computer Science 2022-05-04 Hengyi Wang , Changjae Oh