English
Related papers

Related papers: Switchable Temporal Propagation Network

200 papers

We propose a technique that propagates information forward through video data. The method is conceptually simple and can be applied to tasks that require the propagation of structured information, such as semantic labels, based on video…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Varun Jampani , Raghudeep Gadde , Peter V. Gehler

Traditional approaches for color propagation in videos rely on some form of matching between consecutive video frames. Using appearance descriptors, colors are then propagated both spatially and temporally. These methods, however, are…

Computer Vision and Pattern Recognition · Computer Science 2018-08-10 Simone Meyer , Victor Cornillère , Abdelaziz Djelouah , Christopher Schroers , Markus Gross

Frame quality deterioration is one of the main challenges in the field of video understanding. To compensate for the information loss caused by deteriorated frames, recent approaches exploit transformer-based integration modules to obtain…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Guanxiong Sun , Chi Wang , Zhaoyu Zhang , Jiankang Deng , Stefanos Zafeiriou , Yang Hua

In many scenarios of Person Re-identification (Re-ID), the gallery set consists of lots of surveillance videos and the query is just an image, thus Re-ID has to be conducted between image and videos. Compared with videos, still person…

Computer Vision and Pattern Recognition · Computer Science 2019-10-11 Xinqian Gu , Bingpeng Ma , Hong Chang , Shiguang Shan , Xilin Chen

We present a learning-based framework, recurrent transformer network (RTN), to restore heavily degraded old films. Instead of performing frame-wise restoration, our method is based on the hidden knowledge learned from adjacent frames that…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Ziyu Wan , Bo Zhang , Dongdong Chen , Jing Liao

High-quality video inpainting that completes missing regions in video frames is a promising yet challenging task. State-of-the-art approaches adopt attention models to complete a frame by searching missing contents from reference frames,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Yanhong Zeng , Jianlong Fu , Hongyang Chao

Most frame-based learned video codecs can be interpreted as recurrent neural networks (RNNs) propagating reference information along the temporal dimension. This work revisits the limitations of the current approaches from an RNN…

Image and Video Processing · Electrical Eng. & Systems 2025-08-26 Yi-Hsin Chen , Yi-Chen Yao , Kuan-Wei Ho , Chun-Hung Wu , Huu-Tai Phung , Martin Benjak , Jörn Ostermann , Wen-Hsiao Peng

Existing video recognition algorithms always conduct different training pipelines for inputs with different frame numbers, which requires repetitive training operations and multiplying storage costs. If we evaluate the model using other…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Yitian Zhang , Yue Bai , Chang Liu , Huan Wang , Sheng Li , Yun Fu

Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms in video inpainting (VI). Despite the effectiveness of these components, they still suffer from some limitations that affect their performance. Previous…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Shangchen Zhou , Chongyi Li , Kelvin C. K. Chan , Chen Change Loy

Visual tempo characterizes the dynamics and the temporal scale of an action. Modeling such visual tempos of different actions facilitates their recognition. Previous works often capture the visual tempo through sampling raw videos at…

Computer Vision and Pattern Recognition · Computer Science 2020-06-16 Ceyuan Yang , Yinghao Xu , Jianping Shi , Bo Dai , Bolei Zhou

Applying an image processing algorithm independently to each video frame often leads to temporal inconsistency in the resulting video. To address this issue, we present a novel and general approach for blind video temporal consistency. Our…

Computer Vision and Pattern Recognition · Computer Science 2022-01-28 Chenyang Lei , Yazhou Xing , Hao Ouyang , Qifeng Chen

Transfer learning makes it possible to use large vision networks on a variety of domains, by specializing their models' general filters to new tasks. However, these networks assume the input images to have 3 input channels, making them…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Mariette Schönfeld , Laurens Devos , Wannes Meert , Hendrik Blockeel

Applying image processing algorithms independently to each frame of a video often leads to undesired inconsistent results over time. Developing temporally consistent video-based extensions, however, requires domain knowledge for individual…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Wei-Sheng Lai , Jia-Bin Huang , Oliver Wang , Eli Shechtman , Ersin Yumer , Ming-Hsuan Yang

This paper introduces Progressively Diffused Networks (PDNs) for unifying multi-scale context modeling with deep feature learning, by taking semantic image segmentation as an exemplar application. Prior neural networks, such as ResNet, tend…

Computer Vision and Pattern Recognition · Computer Science 2017-02-21 Ruimao Zhang , Wei Yang , Zhanglin Peng , Xiaogang Wang , Liang Lin

Temporal graph is an abstraction for modeling dynamic systems that consist of evolving interaction elements. In this paper, we aim to solve an important yet neglected problem -- how to learn information from high-order neighbors in temporal…

Machine Learning · Computer Science 2023-04-17 Zehong Wang , Qi Li , Donghua Yu

The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally,…

Computer Vision and Pattern Recognition · Computer Science 2016-08-31 Colin Lea , Rene Vidal , Austin Reiter , Gregory D. Hager

Online video super-resolution (online-VSR) highly relies on an effective alignment module to aggregate temporal information, while the strict latency requirement makes accurate and efficient alignment very challenging. Though much progress…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Zhengqiang Zhang , Ruihuang Li , Shi Guo , Yang Cao , Lei Zhang

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos. However, one limitation of applying 2D CNN to analyze videos is…

Computer Vision and Pattern Recognition · Computer Science 2020-07-16 Junwu Weng , Donghao Luo , Yabiao Wang , Ying Tai , Chengjie Wang , Jilin Li , Feiyue Huang , Xudong Jiang , Junsong Yuan

Video colorization is a challenging task that involves inferring plausible and temporally consistent colors for grayscale frames. In this paper, we present ColorDiffuser, an adaptation of a pre-trained text-to-image latent diffusion model…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Hanyuan Liu , Minshan Xie , Jinbo Xing , Chengze Li , Tien-Tsin Wong

Feature propagation in Deep Neural Networks (DNNs) can be associated to nonlinear discrete dynamical systems. The novelty, in this paper, lies in letting the discretization parameter (time step-size) vary from layer to layer, which needs to…

Optimization and Control · Mathematics 2022-04-20 Harbir Antil , Hugo Díaz , Evelyn Herberg
‹ Prev 1 2 3 10 Next ›