Related papers: Versatile Learned Video Compression

M-LVC: Multiple Frames Prediction for Learned Video Compression

We propose an end-to-end learned video compression scheme for low-latency scenarios. Previous methods are limited in using the previous one frame as reference. Our method introduces the usage of the previous multiple frames as references.…

Image and Video Processing · Electrical Eng. & Systems 2021-08-02 Jianping Lin , Dong Liu , Houqiang Li , Feng Wu

L-LBVC: Long-Term Motion Estimation and Prediction for Learned Bi-Directional Video Compression

Recently, learned video compression (LVC) has shown superior performance under low-delay configuration. However, the performance of learned bi-directional video compression (LBVC) still lags behind traditional bi-directional coding. The…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yongqi Zhai , Luyang Tang , Wei Jiang , Jiayu Yang , Ronggang Wang

DVC: An End-to-end Deep Video Compression Framework

Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional…

Image and Video Processing · Electrical Eng. & Systems 2019-04-09 Guo Lu , Wanli Ouyang , Dong Xu , Xiaoyun Zhang , Chunlei Cai , Zhiyong Gao

Advancing Learned Video Compression with In-loop Frame Prediction

Recent years have witnessed an increasing interest in end-to-end learned video compression. Most previous works explore temporal redundancy by detecting and compressing a motion map to warp the reference frame towards the target frame. Yet,…

Image and Video Processing · Electrical Eng. & Systems 2022-11-21 Ren Yang , Radu Timofte , Luc Van Gool

MMVC: Learned Multi-Mode Video Compression with Block-based Prediction Mode Selection and Density-Adaptive Entropy Coding

Learning-based video compression has been extensively studied over the past years, but it still has limitations in adapting to various motion patterns and entropy models. In this paper, we propose multi-mode video compression (MMVC), a…

Image and Video Processing · Electrical Eng. & Systems 2023-04-06 Bowen Liu , Yu Chen , Rakesh Chowdary Machineni , Shiyu Liu , Hun-Seok Kim

FVC: A New Framework towards Deep Video Compression in Feature Space

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Zhihao Hu , Guo Lu , Dong Xu

Uni-LVC: A Unified Method for Intra- and Inter-Mode Learned Video Compression

Recent advances in learned video compression (LVC) have led to significant performance gains, with codecs such as DCVC-RT surpassing the H.266/VVC low-delay mode in compression efficiency. However, existing LVCs still exhibit key…

Image and Video Processing · Electrical Eng. & Systems 2026-03-09 Yichi Zhang , Ruoyu Yang , Fengqing Zhu

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Neural Radiance Field (NeRF)-based volumetric video has revolutionized visual media by delivering photorealistic Free-Viewpoint Video (FVV) experiences that provide audiences with unprecedented immersion and interactivity. However, the…

Image and Video Processing · Electrical Eng. & Systems 2024-12-17 Qiang Hu , Houqiang Zhong , Zihan Zheng , Xiaoyun Zhang , Zhengxue Cheng , Li Song , Guangtao Zhai , Yanfeng Wang

Learned Video Compression via Heterogeneous Deformable Compensation Network

Learned video compression has recently emerged as an essential research topic in developing advanced video compression technologies, where motion compensation is considered one of the most challenging issues. In this paper, we propose a…

Image and Video Processing · Electrical Eng. & Systems 2023-06-30 Huairui Wang , Zhenzhong Chen , Chang Wen Chen

LMVC: An End-to-End Learned Multiview Video Coding Framework

Multiview video is a key data source for volumetric video, enabling immersive 3D scene reconstruction but posing significant challenges in storage and transmission due to its massive data volume. Recently, deep learning-based end-to-end…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Xihua Sheng , Yingwen Zhang , Long Xu , Shiqi Wang

Emerging Advances in Learned Video Compression: Models, Systems and Beyond

Video compression is a fundamental topic in the visual intelligence, bridging visual signal sensing/capturing and high-level visual analytics. The broad success of artificial intelligence (AI) technology has enriched the horizon of video…

Image and Video Processing · Electrical Eng. & Systems 2025-05-01 Chuanmin Jia , Feng Ye , Siwei Ma , Wen Gao , Huifang Sun , Leonardo Chiariglione

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

End-to-end Optimized Video Compression with MV-Residual Prediction

We present an end-to-end trainable framework for P-frame compression in this paper. A joint motion vector (MV) and residual prediction network MV-Residual is designed to extract the ensembled features of motion representations and residual…

Image and Video Processing · Electrical Eng. & Systems 2020-05-28 XiangJi Wu , Ziwen Zhang , Jie Feng , Lei Zhou , Junmin Wu

LCCM-VC: Learned Conditional Coding Modes for Video Compression

End-to-end learning-based video compression has made steady progress over the last several years. However, unlike learning-based image coding, which has already surpassed its handcrafted counterparts, learning-based video coding still has…

Image and Video Processing · Electrical Eng. & Systems 2023-04-20 Hadi Hadizadeh , Ivan V. Bajić

Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

Neural video codecs have demonstrated great potential in video transmission and storage applications. Existing neural hybrid video coding approaches rely on optical flow or Gaussian-scale flow for prediction, which cannot support…

Image and Video Processing · Electrical Eng. & Systems 2023-07-19 Zongyu Guo , Runsen Feng , Zhizheng Zhang , Xin Jin , Zhibo Chen

Variable Rate Video Compression using a Hybrid Recurrent Convolutional Learning Framework

In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal…

Image and Video Processing · Electrical Eng. & Systems 2020-08-25 Aishwarya Jadhav

High Visual-Fidelity Learned Video Compression

With the growing demand for video applications, many advanced learned video compression methods have been developed, outperforming traditional methods in terms of objective quality metrics such as PSNR. Existing methods primarily focus on…

Image and Video Processing · Electrical Eng. & Systems 2023-10-10 Meng Li , Yibo Shi , Jing Wang , Yunqi Huang

End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the…

Image and Video Processing · Electrical Eng. & Systems 2021-12-20 M. Akın Yılmaz , A. Murat Tekalp

Conditional Coding for Flexible Learned Video Compression

This paper introduces a novel framework for end-to-end learned video coding. Image compression is generalized through conditional coding to exploit information from reference frames, allowing to process intra and inter frames with the same…

Image and Video Processing · Electrical Eng. & Systems 2021-04-29 Théo Ladune , Pierrick Philippe , Wassim Hamidouche , Lu Zhang , Olivier Déforges

M3-CVC: Controllable Video Compression with Multimodal Generative Models

Traditional and neural video codecs commonly encounter limitations in controllability and generality under ultra-low-bitrate coding scenarios. To overcome these challenges, we propose M3-CVC, a controllable video compression framework…

Image and Video Processing · Electrical Eng. & Systems 2024-12-30 Rui Wan , Qi Zheng , Yibo Fan