Related papers: Deep Contextual Video Compression

Conditional Entropy Coding for Efficient Video Compression

We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames. Unlike prior learning-based approaches, we reduce complexity by not performing any form of explicit…

Image and Video Processing · Electrical Eng. & Systems 2020-08-24 Jerry Liu , Shenlong Wang , Wei-Chiu Ma , Meet Shah , Rui Hu , Pranaab Dhawan , Raquel Urtasun

DVC: An End-to-end Deep Video Compression Framework

Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional…

Image and Video Processing · Electrical Eng. & Systems 2019-04-09 Guo Lu , Wanli Ouyang , Dong Xu , Xiaoyun Zhang , Chunlei Cai , Zhiyong Gao

Extreme Video Compression with Pre-trained Diffusion Models

Diffusion models have achieved remarkable success in generating high quality image and video data. More recently, they have also been used for image compression with high perceptual quality. In this paper, we present a novel approach to…

Image and Video Processing · Electrical Eng. & Systems 2024-02-15 Bohan Li , Yiming Liu , Xueyan Niu , Bo Bai , Lei Deng , Deniz Gündüz

Conditional Coding for Flexible Learned Video Compression

This paper introduces a novel framework for end-to-end learned video coding. Image compression is generalized through conditional coding to exploit information from reference frames, allowing to process intra and inter frames with the same…

Image and Video Processing · Electrical Eng. & Systems 2021-04-29 Théo Ladune , Pierrick Philippe , Wassim Hamidouche , Lu Zhang , Olivier Déforges

Conditional Video Generation for High-Efficiency Video Compression

Perceptual studies demonstrate that conditional diffusion models excel at reconstructing video content aligned with human visual perception. Building on this insight, we propose a video compression framework that leverages conditional…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Fangqiu Yi , Jingyu Xu , Jiawei Shao , Chi Zhang , Xuelong Li

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

Motion and Context-Aware Audio-Visual Conditioned Video Prediction

The existing state-of-the-art method for audio-visual conditioned video prediction uses the latent codes of the audio-visual frames from a multimodal stochastic network and a frame encoder to predict the next visual frame. However, a direct…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Yating Xu , Conghui Hu , Gim Hee Lee

Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding

Conditional coding is a new video coding paradigm enabled by neural-network-based compression. It can be shown that conditional coding is in theory better than the traditional residual coding, which is widely used in video compression…

Image and Video Processing · Electrical Eng. & Systems 2024-01-29 Fabian Brand , Jürgen Seiler , André Kaup

Predictive Coding For Animation-Based Video Compression

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

Scene Matters: Model-based Deep Video Compression

Video compression has always been a popular research area, where many traditional and deep video compression methods have been proposed. These methods typically rely on signal prediction theory to enhance compression performance by…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Lv Tang , Xinfeng Zhang , Gai Zhang , Xiaoqi Ma

Bi-Directional Deep Contextual Video Compression

Deep video compression has made remarkable process in recent years, with the majority of advancements concentrated on P-frame coding. Although efforts to enhance B-frame coding are ongoing, their compression performance is still far behind…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Xihua Sheng , Li Li , Dong Liu , Shiqi Wang

Learned Video Compression for YUV 4:2:0 Content Using Flow-based Conditional Inter-frame Coding

This paper proposes a learning-based video compression framework for variable-rate coding on YUV 4:2:0 content. Most existing learning-based video compression models adopt the traditional hybrid-based coding architecture, which involves…

Image and Video Processing · Electrical Eng. & Systems 2022-10-18 Yung-Han Ho , Chih-Hsuan Lin , Peng-Yu Chen , Mu-Jung Chen , Chih-Peng Chang , Wen-Hsiao Peng , Hsueh-Ming Hang

Neural Video Compression with In-Loop Contextual Filtering and Out-of-Loop Reconstruction Enhancement

This paper explores the application of enhancement filtering techniques in neural video compression. Specifically, we categorize these techniques into in-loop contextual filtering and out-of-loop reconstruction enhancement based on whether…

Image and Video Processing · Electrical Eng. & Systems 2025-09-05 Yaojun Wu , Chaoyi Lin , Yiming Wang , Semih Esenlik , Zhaobin Zhang , Kai Zhang , Li Zhang

Accurate and Fast Compressed Video Captioning

Existing video captioning approaches typically require to first sample video frames from a decoded video and then conduct a subsequent process (e.g., feature extraction and/or captioning model learning). In this pipeline, manual frame…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Yaojie Shen , Xin Gu , Kai Xu , Heng Fan , Longyin Wen , Libo Zhang

End-to-End Learning for Video Frame Compression with Self-Attention

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

Parameter-Efficient Instance-Adaptive Neural Video Compression

Learning-based Neural Video Codecs (NVCs) have emerged as a compelling alternative to standard video codecs, demonstrating promising performance, and simple and easily maintainable pipelines. However, NVCs often fall short of compression…

Image and Video Processing · Electrical Eng. & Systems 2024-12-02 Hyunmo Yang , Seungjun Oh , Eunbyung Park

LCCM-VC: Learned Conditional Coding Modes for Video Compression

End-to-end learning-based video compression has made steady progress over the last several years. However, unlike learning-based image coding, which has already surpassed its handcrafted counterparts, learning-based video coding still has…

Image and Video Processing · Electrical Eng. & Systems 2023-04-20 Hadi Hadizadeh , Ivan V. Bajić

Context-aware Deep Feature Compression for High-speed Visual Tracking

We propose a new context-aware correlation filter based tracking framework to achieve both high computational speed and state-of-the-art performance among real-time trackers. The major contribution to the high computational speed lies in…

Computer Vision and Pattern Recognition · Computer Science 2020-10-21 Jongwon Choi , Hyung Jin Chang , Tobias Fischer , Sangdoo Yun , Kyuewang Lee , Jiyeoup Jeong , Yiannis Demiris , Jin Young Choi

Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set

We introduce a video compression algorithm based on instance-adaptive learning. On each video sequence to be transmitted, we finetune a pretrained compression model. The optimal parameters are transmitted to the receiver along with the…

Image and Video Processing · Electrical Eng. & Systems 2023-06-26 Ties van Rozendaal , Johann Brehmer , Yunfan Zhang , Reza Pourreza , Auke Wiggers , Taco S. Cohen