Related papers: CompFeat: Comprehensive Feature Aggregation for Vi…

Practical Video Object Detection via Feature Selection and Aggregation

Compared with still image object detection, video object detection (VOD) needs to particularly concern the high across-frame variation in object appearance, and the diverse deterioration in some frames. In principle, the detection in a…

Computer Vision and Pattern Recognition · Computer Science 2024-07-30 Yuheng Shi , Tong Zhang , Xiaojie Guo

Flow-Guided Feature Aggregation for Video Object Detection

Extending state-of-the-art object detectors from image to video is challenging. The accuracy of detection suffers from degenerated object appearances in videos, e.g., motion blur, video defocus, rare poses, etc. Existing work attempts to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-21 Xizhou Zhu , Yujie Wang , Jifeng Dai , Lu Yuan , Yichen Wei

Flow-free Video Object Segmentation

Segmenting foreground object from a video is a challenging task because of the large deformations of the objects, occlusions, and background clutter. In this paper, we propose a frame-by-frame but computationally efficient approach for…

Computer Vision and Pattern Recognition · Computer Science 2017-06-30 Aditya Vora , Shanmuganathan Raman

Object Segmentation with Audio Context

Visual objects often have acoustic signatures that are naturally synchronized with them in audio-bearing video recordings. For this project, we explore the multimodal feature aggregation for video instance segmentation task, in which we…

Computer Vision and Pattern Recognition · Computer Science 2023-01-26 Kaihui Zheng , Yuqing Ren , Zixin Shen , Tianxu Qin

Superframes, A Temporal Video Segmentation

The goal of video segmentation is to turn video data into a set of concrete motion clusters that can be easily interpreted as building blocks of the video. There are some works on similar topics like detecting scene cuts in a video, but…

Computer Vision and Pattern Recognition · Computer Science 2019-03-07 Hajar Sadeghi Sokeh , Vasileios Argyriou , Dorothy Monekosso , Paolo Remagnino

Object Detection in Video with Spatial-temporal Context Aggregation

Recent cutting-edge feature aggregation paradigms for video object detection rely on inferring feature correspondence. The feature correspondence estimation problem is fundamentally difficult due to poor image quality, motion blur, etc, and…

Computer Vision and Pattern Recognition · Computer Science 2019-07-12 Hao Luo , Lichao Huang , Han Shen , Yuan Li , Chang Huang , Xinggang Wang

CAVIS: Context-Aware Video Instance Segmentation

In this paper, we introduce the Context-Aware Video Instance Segmentation (CAVIS), a novel framework designed to enhance instance association by integrating contextual information adjacent to each object. To efficiently extract and leverage…

Computer Vision and Pattern Recognition · Computer Science 2025-07-10 Seunghun Lee , Jiwan Seo , Kiljoon Han , Minwoo Choi , Sunghoon Im

STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos

Existing methods for instance segmentation in videos typically involve multi-stage pipelines that follow the tracking-by-detection paradigm and model a video clip as a sequence of images. Multiple networks are used to detect objects in…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Ali Athar , Sabarinath Mahadevan , Aljoša Ošep , Laura Leal-Taixé , Bastian Leibe

Learning Video Object Segmentation from Static Images

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation. Our model proceeds on a per-frame basis, guided by the…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Anna Khoreva , Federico Perazzi , Rodrigo Benenson , Bernt Schiele , Alexander Sorkine-Hornung

Appearance-Based Refinement for Object-Centric Motion Segmentation

The goal of this paper is to discover, segment, and track independently moving objects in complex visual scenes. Previous approaches have explored the use of optical flow for motion segmentation, leading to imperfect predictions due to…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Junyu Xie , Weidi Xie , Andrew Zisserman

Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation

Video object detection is a fundamental problem in computer vision and has a wide spectrum of applications. Based on deep networks, video object detection is actively studied for pushing the limits of detection speed and accuracy. To reduce…

Computer Vision and Pattern Recognition · Computer Science 2021-03-29 Xinggang Wang , Zhaojin Huang , Bencheng Liao , Lichao Huang , Yongchao Gong , Chang Huang

Video Instance Segmentation with a Propose-Reduce Paradigm

Video instance segmentation (VIS) aims to segment and associate all instances of predefined classes for each frame in videos. Prior methods usually obtain segmentation for a frame or clip first, and merge the incomplete results by tracking…

Computer Vision and Pattern Recognition · Computer Science 2021-10-01 Huaijia Lin , Ruizheng Wu , Shu Liu , Jiangbo Lu , Jiaya Jia

Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

Video instance segmentation aims at predicting object segmentation masks for each frame, as well as associating the instances across multiple frames. Recent end-to-end video instance segmentation methods are capable of performing object…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Quanzeng You , Jiang Wang , Peng Chu , Andre Abrantes , Zicheng Liu

Feature Aggregation Network for Video Face Recognition

This paper aims to learn a compact representation of a video for video face recognition task. We make the following contributions: first, we propose a meta attention-based aggregation scheme which adaptively and fine-grained weighs the…

Computer Vision and Pattern Recognition · Computer Science 2019-09-13 Zhaoxiang Liu , Huan Hu , Jinqiang Bai , Shaohua Li , Shiguo Lian

Deep Learning Techniques for Video Instance Segmentation: A Survey

Video instance segmentation, also known as multi-object tracking and segmentation, is an emerging computer vision research area introduced in 2019, aiming at detecting, segmenting, and tracking instances in videos simultaneously. By…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Chenhao Xu , Chang-Tsun Li , Yongjian Hu , Chee Peng Lim , Douglas Creighton

DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Video object detection is a fundamental yet challenging task in computer vision. One practical solution is to take advantage of temporal information from the video and apply feature aggregation to enhance the object features in each frame.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Yiming Cui

Sequence Level Semantics Aggregation for Video Object Detection

Video objection detection (VID) has been a rising research direction in recent years. A central issue of VID is the appearance degradation of video frames caused by fast motion. This problem is essentially ill-posed for a single frame.…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Haiping Wu , Yuntao Chen , Naiyan Wang , Zhaoxiang Zhang

SSSC-AM: A Unified Framework for Video Co-Segmentation by Structured Sparse Subspace Clustering with Appearance and Motion Features

Video co-segmentation refers to the task of jointly segmenting common objects appearing in a given group of videos. In practice, high-dimensional data such as videos can be conceptually thought as being drawn from a union of subspaces…

Computer Vision and Pattern Recognition · Computer Science 2021-04-29 Junlin Yao , Frank Nielsen

CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation

The advancement of computer vision has pushed visual analysis tasks from still images to the video domain. In recent years, video instance segmentation, which aims to track and segment multiple objects in video frames, has drawn much…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Yiming Cui , Cheng Han , Dongfang Liu

Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation

Video Instance Segmentation is a fundamental computer vision task that deals with segmenting and tracking object instances across a video sequence. Most existing methods typically accomplish this task by employing a multi-stage top-down…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Jyoti Kini , Mubarak Shah