Related papers: Variable Rate Learned Wavelet Video Coding using T…

Learned Wavelet Video Coding using Motion Compensated Temporal Filtering

We present an end-to-end trainable wavelet video coder based on motion-compensated temporal filtering (MCTF). Thereby, we introduce a different coding scheme for learned video compression, which is currently dominated by residual and…

Image and Video Processing · Electrical Eng. & Systems 2023-10-13 Anna Meyer , Fabian Brand , André Kaup

Motion-Compensated Temporal Filtering for Critically-Sampled Wavelet-Encoded Images

We propose a novel motion estimation/compensation (ME/MC) method for wavelet-based (in-band) motion compensated temporal filtering (MCTF), with application to low-bitrate video coding. Unlike the conventional in-band MCTF algorithms, which…

Computer Vision and Pattern Recognition · Computer Science 2017-05-17 Vildan Atalay Aydin , Hassan Foroosh

Efficient Learned Wavelet Image and Video Coding

Learned wavelet image and video coding approaches provide an explainable framework with a latent space corresponding to a wavelet decomposition. The wavelet image coder iWave++ achieves state-of-the-art performance and has been employed for…

Image and Video Processing · Electrical Eng. & Systems 2024-11-07 Anna Meyer , Srivatsa Prativadibhayankaram , André Kaup

Content Adaptive Wavelet Lifting for Scalable Lossless Video Coding

Scalable lossless video coding is an important aspect for many professional applications. Wavelet-based video coding decomposes an input sequence into a lowpass and a highpass subband by filtering along the temporal axis. The lowpass…

Image and Video Processing · Electrical Eng. & Systems 2023-02-03 Daniela Lanz , Christian Herbert , André Kaup

Dynamic Temporal Filtering in Video Models

Video temporal dynamics is conventionally modeled with 3D spatial-temporal kernel or its factorized version comprised of 2D spatial kernel and 1D temporal kernel. The modeling power, nevertheless, is limited by the fixed window size and…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Fuchen Long , Zhaofan Qiu , Yingwei Pan , Ting Yao , Chong-Wah Ngo , Tao Mei

MTC-VAE: Multi-Level Temporal Compression with Content Awareness

Latent Video Diffusion Models (LVDMs) rely on Variational Autoencoders (VAEs) to compress videos into compact latent representations. For continuous Variational Autoencoders (VAEs), achieving higher compression rates is desirable; yet, the…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Yubo Dong , Linchao Zhu

Neural Video Compression with Temporal Layer-Adaptive Hierarchical B-frame Coding

Neural video compression (NVC) is a rapidly evolving video coding research area, with some models achieving superior coding efficiency compared to the latest video coding standard Versatile Video Coding (VVC). In conventional video coding…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Yeongwoong Kim , Suyong Bahk , Seungeon Kim , Won Hee Lee , Dokwan Oh , Hui Yong Kim

Bidirectional Multirate Reconstruction for Temporal Modeling in Videos

Despite the recent success of neural networks in image feature learning, a major problem in the video domain is the lack of sufficient labeled data for learning to model temporal information. In this paper, we propose an unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2016-11-29 Linchao Zhu , Zhongwen Xu , Yi Yang

VMAF-based Bitrate Ladder Estimation for Adaptive Streaming

In HTTP Adaptive Streaming, video content is conventionally encoded by adapting its spatial resolution and quantization level to best match the prevailing network state and display characteristics. It is well known that the traditional…

Image and Video Processing · Electrical Eng. & Systems 2021-09-17 Angeliki V. Katsenou , Fan Zhang , Kyle Swanson , Mariana Afonso , Joel Sole , David R. Bull

Self-supervised Temporal Discriminative Learning for Video Representation Learning

Temporal cues in videos provide important information for recognizing actions accurately. However, temporal-discriminative features can hardly be extracted without using an annotated large-scale video action dataset for training. This paper…

Computer Vision and Pattern Recognition · Computer Science 2020-08-06 Jinpeng Wang , Yiqi Lin , Andy J. Ma , Pong C. Yuen

EV-NVC: Efficient Variable bitrate Neural Video Compression

Training neural video codec (NVC) with variable rate is a highly challenging task due to its complex training strategies and model structure. In this paper, we train an efficient variable bitrate neural video codec (EV-NVC) with the…

Multimedia · Computer Science 2025-11-04 Yongcun Hu , Yingzhen Zhai , Jixiang Luo , Wenrui Dai , Dell Zhang , Hongkai Xiong , Xuelong Li

Optimizing Rate-Distortion Performance of Motion Compensated Wavelet Lifting with Denoised Prediction and Update

Efficient lossless coding of medical volume data with temporal axis can be achieved by motion compensated wavelet lifting. As side benefit, a scalable bit stream is generated, which allows for displaying the data at different resolution…

Image and Video Processing · Electrical Eng. & Systems 2023-02-03 Daniela Lanz , André Kaup

ELF-VC: Efficient Learned Flexible-Rate Video Coding

While learned video codecs have demonstrated great promise, they have yet to achieve sufficient efficiency for practical deployment. In this work, we propose several novel ideas for learned video compression which allow for improved…

Image and Video Processing · Electrical Eng. & Systems 2021-10-06 Oren Rippel , Alexander G. Anderson , Kedar Tatwawadi , Sanjay Nair , Craig Lytle , Lubomir Bourdev

Deep Variable-Length Feedback Codes

Deep learning has enabled significant advances in feedback-based channel coding, yet existing learned schemes remain fundamentally limited: they employ fixed block lengths, suffer degraded performance at high rates, and cannot fully exploit…

Information Theory · Computer Science 2026-02-10 Yu Ding , Yulin Shao

Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

In this paper, we newly introduce the concept of temporal attention filters, and describe how they can be used for human activity recognition from videos. Many high-level activities are often composed of multiple temporal parts (e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2016-12-28 AJ Piergiovanni , Chenyou Fan , Michael S. Ryoo

Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression

Recently, learned video compression has achieved exciting performance. Following the traditional hybrid prediction coding framework, most learned methods generally adopt the motion estimation motion compensation (MEMC) method to remove…

Image and Video Processing · Electrical Eng. & Systems 2023-10-20 Yiming Wang , Qian Huang , Bin Tang , Huashan Sun , Xing Li

Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction

Video prediction is a pixel-wise dense prediction task to infer future frames based on past frames. Missing appearance details and motion blur are still two major problems for current predictive models, which lead to image distortion and…

Computer Vision and Pattern Recognition · Computer Science 2020-05-25 Beibei Jin , Yu Hu , Qiankun Tang , Jingyu Niu , Zhiping Shi , Yinhe Han , Xiaowei Li

Exploring Long- and Short-Range Temporal Information for Learned Video Compression

Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs. However, many current…

Image and Video Processing · Electrical Eng. & Systems 2024-02-02 Huairui Wang , Zhenzhong Chen

Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming

Pareto-front optimization is crucial for addressing the multi-objective challenges in video streaming, enabling the identification of optimal trade-offs between conflicting goals such as bitrate, video quality, and decoding complexity. This…

Image and Video Processing · Electrical Eng. & Systems 2024-09-30 Angeliki Katsenou , Vignesh V Menon , Adam Wieckowski , Benjamin Bross , Detlev Marpe

Content-Adaptive Motion Rate Adaption for Learned Video Compression

This paper introduces an online motion rate adaptation scheme for learned video compression, with the aim of achieving content-adaptive coding on individual test sequences to mitigate the domain gap between training and test data. It…

Image and Video Processing · Electrical Eng. & Systems 2023-02-14 Chih-Hsuan Lin , Yi-Hsin Chen , Wen-Hsiao Peng