Related papers: CAVIS: Context-Aware Video Instance Segmentation

STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation

Video Instance Segmentation (VIS) is a task that simultaneously requires classification, segmentation, and instance association in a video. Recent VIS approaches rely on sophisticated pipelines to achieve this goal, including RoI-related…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Zhengkai Jiang , Zhangxuan Gu , Jinlong Peng , Hang Zhou , Liang Liu , Yabiao Wang , Ying Tai , Chengjie Wang , Liqing Zhang

Online Video Instance Segmentation via Robust Context Fusion

Video instance segmentation (VIS) aims at classifying, segmenting and tracking object instances in video sequences. Recent transformer-based neural networks have demonstrated their powerful capability of modeling spatio-temporal…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Xiang Li , Jinglu Wang , Xiaohao Xu , Bhiksha Raj , Yan Lu

Crossover Learning for Fast Online Video Instance Segmentation

Modeling temporal visual context across frames is critical for video instance segmentation (VIS) and other video understanding tasks. In this paper, we propose a fast online VIS model named CrossVIS. For temporal information modeling in…

Computer Vision and Pattern Recognition · Computer Science 2021-04-14 Shusheng Yang , Yuxin Fang , Xinggang Wang , Yu Li , Chen Fang , Ying Shan , Bin Feng , Wenyu Liu

SeaVIS: Sound-Enhanced Association for Online Audio-Visual Instance Segmentation

Recently, an audio-visual instance segmentation (AVIS) task has been introduced, aiming to identify, segment and track individual sounding instances in videos. However, prevailing methods primarily adopt the offline paradigm, that cannot…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Yingjian Zhu , Ying Wang , Yuyang Hong , Ruohao Guo , Kun Ding , Xin Gu , Bin Fan , Shiming Xiang

Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

Video instance segmentation aims at predicting object segmentation masks for each frame, as well as associating the instances across multiple frames. Recent end-to-end video instance segmentation methods are capable of performing object…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Quanzeng You , Jiang Wang , Peng Chu , Andre Abrantes , Zicheng Liu

Efficient Video Instance Segmentation via Tracklet Query and Proposal

Video Instance Segmentation (VIS) aims to simultaneously classify, segment, and track multiple object instances in videos. Recent clip-level VIS takes a short video clip as input each time showing stronger performance than frame-level VIS…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Jialian Wu , Sudhir Yarram , Hui Liang , Tian Lan , Junsong Yuan , Jayan Eledath , Gerard Medioni

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

In recent years, online Video Instance Segmentation (VIS) methods have shown remarkable advancement with their powerful query-based detectors. Utilizing the output queries of the detector at the frame-level, these methods achieve high…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Hanjung Kim , Jaehyun Kang , Miran Heo , Sukjun Hwang , Seoung Wug Oh , Seon Joo Kim

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

We propose MinVIS, a minimal video instance segmentation (VIS) framework that achieves state-of-the-art VIS performance with neither video-based architectures nor training procedures. By only training a query-based image instance…

Computer Vision and Pattern Recognition · Computer Science 2022-08-04 De-An Huang , Zhiding Yu , Anima Anandkumar

Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation

In this paper, we propose Contextual Guided Segmentation (CGS) framework for video instance segmentation in three passes. In the first pass, i.e., preview segmentation, we propose Instance Re-Identification Flow to estimate main properties…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Trung-Nghia Le , Tam V. Nguyen , Minh-Triet Tran

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Exploring dense matching between the current frame and past frames for long-range context modeling, memory-based methods have demonstrated impressive results in video object segmentation (VOS) recently. Nevertheless, due to the lack of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Junke Wang , Dongdong Chen , Zuxuan Wu , Chong Luo , Chuanxin Tang , Xiyang Dai , Yucheng Zhao , Yujia Xie , Lu Yuan , Yu-Gang Jiang

CTVIS: Consistent Training for Online Video Instance Segmentation

The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS). Instance embedding learning is directly supervised by the contrastive loss computed upon the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Kaining Ying , Qing Zhong , Weian Mao , Zhenhua Wang , Hao Chen , Lin Yuanbo Wu , Yifan Liu , Chengxiang Fan , Yunzhi Zhuge , Chunhua Shen

Occluded Video Instance Segmentation: A Benchmark

Can our video understanding systems perceive objects when a heavy occlusion exists in a scene? To answer this question, we collect a large-scale dataset called OVIS for occluded video instance segmentation, that is, to simultaneously…

Computer Vision and Pattern Recognition · Computer Science 2022-05-18 Jiyang Qi , Yan Gao , Yao Hu , Xinggang Wang , Xiaoyu Liu , Xiang Bai , Serge Belongie , Alan Yuille , Philip H. S. Torr , Song Bai

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation

Video instance segmentation is a complex task in which we need to detect, segment, and track each object for any given video. Previous approaches only utilize single-frame features for the detection, segmentation, and tracking of objects…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Yang Fu , Linjie Yang , Ding Liu , Thomas S. Huang , Humphrey Shi

DeVIS: Making Deformable Transformers Work for Video Instance Segmentation

Video Instance Segmentation (VIS) jointly tackles multi-object detection, tracking, and segmentation in video sequences. In the past, VIS methods mirrored the fragmentation of these subtasks in their architectural design, hence missing out…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Adrià Caelles , Tim Meinhardt , Guillem Brasó , Laura Leal-Taixé

Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration

Instance segmentation in videos, which aims to segment and track multiple objects in video frames, has garnered a flurry of research attention in recent years. In this paper, we present a novel weakly supervised framework with…

Computer Vision and Pattern Recognition · Computer Science 2022-12-16 Liqi Yan , Qifan Wang , Siqi Ma , Jingang Wang , Changbin Yu

CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging

Recently, self-supervised instance discrimination methods have achieved significant success in learning visual representations from unlabeled photographic images. However, given the marked differences between photographic and medical…

Image and Video Processing · Electrical Eng. & Systems 2022-04-18 Mohammad Reza Hosseinzadeh Taher , Fatemeh Haghighi , Michael B. Gotway , Jianming Liang

UVIS: Unsupervised Video Instance Segmentation

Video instance segmentation requires classifying, segmenting, and tracking every object across video frames. Unlike existing approaches that rely on masks, boxes, or category labels, we propose UVIS, a novel Unsupervised Video Instance…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Shuaiyi Huang , Saksham Suri , Kamal Gupta , Sai Saketh Rambhatla , Ser-nam Lim , Abhinav Shrivastava

Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation

This paper addresses the task of unsupervised video multi-object segmentation. Current approaches follow a two-stage paradigm: 1) detect object proposals using pre-trained Mask R-CNN, and 2) conduct generic feature matching for temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Tianfei Zhou , Jianwu Li , Xueyi Li , Ling Shao

Human Instance Segmentation and Tracking via Data Association and Single-stage Detector

Human video instance segmentation plays an important role in computer understanding of human activities and is widely used in video processing, video surveillance, and human modeling in virtual reality. Most current VIS methods are based on…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Lu Cheng , Mingbo Zhao

Context-aware Visual Tracking with Joint Meta-updating

Visual object tracking acts as a pivotal component in various emerging video applications. Despite the numerous developments in visual tracking, existing deep trackers are still likely to fail when tracking against objects with dramatic…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Qiuhong Shen , Xin Li , Fanyang Meng , Yongsheng Liang