Related papers: Self-supervised Amodal Video Object Segmentation

A2VIS: Amodal-Aware Approach to Video Instance Segmentation

Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Minh Tran , Thang Pham , Winston Bounsavy , Tri Nguyen , Ngan Le

One-Shot Video Object Segmentation

This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Sergi Caelles , Kevis-Kokitsi Maninis , Jordi Pont-Tuset , Laura Leal-Taixé , Daniel Cremers , Luc Van Gool

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation

Video amodal segmentation is a particularly challenging task in computer vision, which requires to deduce the full shape of an object from the visible parts of it. Recently, some studies have achieved promising performance by using motion…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Ke Fan , Jingshi Lei , Xuelin Qian , Miaopeng Yu , Tianjun Xiao , Tong He , Zheng Zhang , Yanwei Fu

Segment Anything, Even Occluded

Amodal instance segmentation, which aims to detect and segment both visible and invisible parts of objects in images, plays a crucial role in various applications including autonomous driving, robotic manipulation, and scene understanding.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Wei-En Tai , Yu-Lin Shih , Cheng Sun , Yu-Chiang Frank Wang , Hwann-Tzong Chen

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Tracking and segmenting multiple similar objects with distinct or complex parts in long-term videos is particularly challenging due to the ambiguity in identifying target components and the confusion caused by occlusion, background clutter,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Xin Li , Deshui Miao , Zhenyu He , Yaowei Wang , Huchuan Lu , Ming-Hsuan Yang

Amodal SAM: A Unified Amodal Segmentation Framework with Generalization

Amodal segmentation is a challenging task that aims to predict the complete geometric shape of objects, including their occluded regions. Although existing methods primarily focus on amodal segmentation within the training domain, these…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Bo Zhang , Zhuotao Tian , Xin Tao , Songlin Tang , Jun Yu , Wenjie Pei

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior

Almost all existing amodal segmentation methods make the inferences of occluded regions by using features corresponding to the whole image. This is against the human's amodal perception, where human uses the visible part and the shape prior…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Yuting Xiao , Yanyu Xu , Ziming Zhong , Weixin Luo , Jiawei Li , Shenghua Gao

Using Diffusion Priors for Video Amodal Segmentation

Object permanence in humans is a fundamental cue that helps in understanding persistence of objects, even when they are fully occluded in the scene. Present day methods in object segmentation do not account for this amodal nature of the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Kaihua Chen , Deva Ramanan , Tarasha Khurana

Online Adaptation of Convolutional Neural Networks for Video Object Segmentation

We tackle the task of semi-supervised video object segmentation, i.e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame. We build on the recently introduced one-shot video object…

Computer Vision and Pattern Recognition · Computer Science 2017-08-02 Paul Voigtlaender , Bastian Leibe

FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation

We consider the task of semi-supervised video object segmentation (VOS). Our approach mitigates shortcomings in previous VOS work by addressing detail preservation and temporal consistency using visual warping. In contrast to prior work…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Julia Gong , F. Christopher Holsinger , Serena Yeung

Video Object Segmentation Without Temporal Information

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such…

Computer Vision and Pattern Recognition · Computer Science 2018-05-17 Kevis-Kokitsi Maninis , Sergi Caelles , Yuhua Chen , Jordi Pont-Tuset , Laura Leal-Taixé , Daniel Cremers , Luc Van Gool

Make One-Shot Video Object Segmentation Efficient Again

Video object segmentation (VOS) describes the task of segmenting a set of objects in each frame of a video. In the semi-supervised setting, the first mask of each object is provided at test time. Following the one-shot principle,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Tim Meinhardt , Laura Leal-Taixe

Sequential Amodal Segmentation via Cumulative Occlusion Learning

To fully understand the 3D context of a single image, a visual system must be able to segment both the visible and occluded regions of objects, while discerning their occlusion order. Ideally, the system should be able to handle any object…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Jiayang Ao , Qiuhong Ke , Krista A. Ehinger

Towards Robust Video Object Segmentation with Adaptive Object Calibration

In the booming video era, video segmentation attracts increasing research attention in the multimedia community. Semi-supervised video object segmentation (VOS) aims at segmenting objects in all target frames of a video, given annotated…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Xiaohao Xu , Jinglu Wang , Xiang Ming , Yan Lu

Self-supervised Video Object Segmentation

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking). We make the following contributions: (i) we propose to improve the existing…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Fangrui Zhu , Li Zhang , Yanwei Fu , Guodong Guo , Weidi Xie

Video Object Segmentation using Space-Time Memory Networks

We propose a novel solution for semi-supervised video object segmentation. By the nature of the problem, available cues (e.g. video frame(s) with object masks) become richer with the intermediate predictions. However, the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Seoung Wug Oh , Joon-Young Lee , Ning Xu , Seon Joo Kim

Video Object Segmentation using Tracked Object Proposals

We present an approach to semi-supervised video object segmentation, in the context of the DAVIS 2017 challenge. Our approach combines category-based object detection, category-independent object appearance segmentation and temporal object…

Computer Vision and Pattern Recognition · Computer Science 2017-07-21 Gilad Sharir , Eddie Smolyansky , Itamar Friedman

Region Aware Video Object Segmentation with Deep Motion Modeling

Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory. This introduces significant redundant computations. To reduce redundancy, we…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Bo Miao , Mohammed Bennamoun , Yongsheng Gao , Ajmal Mian

A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation

Video object segmentation (VOS) -- predicting pixel-level regions for objects within each frame of a video -- is particularly challenging in agricultural scenarios, where videos of crops include hundreds of small, dense, and occluded…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Keyhan Najafian , Farhad Maleki , Lingling Jin , Ian Stavness

Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation

Self-supervised Object Segmentation (SOS) aims to segment objects without any annotations. Under conditions of multi-camera inputs, the structural, textural and geometrical consistency among each view can be leveraged to achieve…

Computer Vision and Pattern Recognition · Computer Science 2025-01-20 Xiaoyun Zheng , Liwei Liao , Jianbo Jiao , Feng Gao , Ronggang Wang