Related papers: Scene Matters: Model-based Deep Video Compression

MMVC: Learned Multi-Mode Video Compression with Block-based Prediction Mode Selection and Density-Adaptive Entropy Coding

Learning-based video compression has been extensively studied over the past years, but it still has limitations in adapting to various motion patterns and entropy models. In this paper, we propose multi-mode video compression (MMVC), a…

Image and Video Processing · Electrical Eng. & Systems 2023-04-06 Bowen Liu , Yu Chen , Rakesh Chowdary Machineni , Shiyu Liu , Hun-Seok Kim

FVC: A New Framework towards Deep Video Compression in Feature Space

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Zhihao Hu , Guo Lu , Dong Xu

DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy

We introduce a cutting-edge video compression framework tailored for the age of ubiquitous video data, uniquely designed to serve machine learning applications. Unlike traditional compression methods that prioritize human visual perception,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Huan Cui , Qing Li , Hanling Wang , Yong jiang

DVC: An End-to-end Deep Video Compression Framework

Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional…

Image and Video Processing · Electrical Eng. & Systems 2019-04-09 Guo Lu , Wanli Ouyang , Dong Xu , Xiaoyun Zhang , Chunlei Cai , Zhiyong Gao

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

SMC++: Masked Learning of Unsupervised Video Semantic Compression

Most video compression methods focus on human visual perception, neglecting semantic preservation. This leads to severe semantic loss during the compression, hampering downstream video analysis tasks. In this paper, we propose a Masked…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Yuan Tian , Xiaoyue Ling , Cong Geng , Qiang Hu , Guo Lu , Guangtao Zhai

Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression

Recently, learned video compression has achieved exciting performance. Following the traditional hybrid prediction coding framework, most learned methods generally adopt the motion estimation motion compensation (MEMC) method to remove…

Image and Video Processing · Electrical Eng. & Systems 2023-10-20 Yiming Wang , Qian Huang , Bin Tang , Huashan Sun , Xing Li

Towards Practical Real-Time Neural Video Compression

We introduce a practical real-time neural video codec (NVC) designed to deliver high compression ratio, low latency and broad versatility. In practice, the coding speed of NVCs depends on 1) computational costs, and 2) non-computational…

Image and Video Processing · Electrical Eng. & Systems 2025-03-19 Zhaoyang Jia , Bin Li , Jiahao Li , Wenxuan Xie , Linfeng Qi , Houqiang Li , Yan Lu

Improving Video Compression With Deep Visual-Attention Models

Recent advances in deep learning have markedly improved the quality of visual-attention modelling. In this work we apply these advances to video compression. We propose a compression method that uses a saliency model to adaptively compress…

Computer Vision and Pattern Recognition · Computer Science 2019-07-25 Vitaliy Lyudvichenko , Mikhail Erofeev , Alexander Ploshkin , Dmitriy Vatolin

Enhancing Neural Video Compression of Static Scenes with Positive-Incentive Noise

Static scene videos, such as surveillance feeds and videotelephony streams, constitute a dominant share of storage consumption and network traffic. However, both traditional standardized codecs and neural video compression (NVC) methods…

Image and Video Processing · Electrical Eng. & Systems 2026-03-30 Cheng Yuan , Zhenyu Jia , Jiawei Shao , Xuelong Li

Emerging Advances in Learned Video Compression: Models, Systems and Beyond

Video compression is a fundamental topic in the visual intelligence, bridging visual signal sensing/capturing and high-level visual analytics. The broad success of artificial intelligence (AI) technology has enriched the horizon of video…

Image and Video Processing · Electrical Eng. & Systems 2025-05-01 Chuanmin Jia , Feng Ye , Siwei Ma , Wen Gao , Huifang Sun , Leonardo Chiariglione

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

The enhanced Deep Hierarchical Video Compression-DHVC 2.0-has been introduced. This single-model neural video codec operates across a broad range of bitrates, delivering not only superior compression performance to representative methods…

Image and Video Processing · Electrical Eng. & Systems 2024-10-04 Ming Lu , Zhihao Duan , Wuyang Cong , Dandan Ding , Fengqing Zhu , Zhan Ma

Content-Adaptive Inference for State-of-the-art Learned Video Compression

While the BD-rate performance of recent learned video codec models in both low-delay and random-access modes exceed that of respective modes of traditional codecs on average over common benchmarks, the performance improvements for…

Image and Video Processing · Electrical Eng. & Systems 2025-10-13 Ahmet Bilican , M. Akın Yılmaz , A. Murat Tekalp

M3-CVC: Controllable Video Compression with Multimodal Generative Models

Traditional and neural video codecs commonly encounter limitations in controllability and generality under ultra-low-bitrate coding scenarios. To overcome these challenges, we propose M3-CVC, a controllable video compression framework…

Image and Video Processing · Electrical Eng. & Systems 2024-12-30 Rui Wan , Qi Zheng , Yibo Fan

Neural Video Compression with Context Modulation

Efficient video coding is highly dependent on exploiting the temporal redundancy, which is usually achieved by extracting and leveraging the temporal context in the emerging conditional coding-based neural video codec (NVC). Although the…

Image and Video Processing · Electrical Eng. & Systems 2025-05-21 Chuanbo Tang , Zhuoyuan Li , Yifan Bian , Li Li , Dong Liu

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

I$^2$VC: A Unified Framework for Intra- & Inter-frame Video Compression

Video compression aims to reconstruct seamless frames by encoding the motion and residual information from existing frames. Previous neural video compression methods necessitate distinct codecs for three types of frames (I-frame, P-frame…

Image and Video Processing · Electrical Eng. & Systems 2024-06-04 Meiqin Liu , Chenming Xu , Yukai Gu , Chao Yao , Yao Zhao

Deep Generative Video Compression

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Jun Han , Salvator Lombardo , Christopher Schroers , Stephan Mandt

Parameter-Efficient Instance-Adaptive Neural Video Compression

Learning-based Neural Video Codecs (NVCs) have emerged as a compelling alternative to standard video codecs, demonstrating promising performance, and simple and easily maintainable pipelines. However, NVCs often fall short of compression…

Image and Video Processing · Electrical Eng. & Systems 2024-12-02 Hyunmo Yang , Seungjun Oh , Eunbyung Park

Neural Video Coding using Multiscale Motion Compensation and Spatiotemporal Context Model

Over the past two decades, traditional block-based video coding has made remarkable progress and spawned a series of well-known standards such as MPEG-4, H.264/AVC and H.265/HEVC. On the other hand, deep neural networks (DNNs) have shown…

Image and Video Processing · Electrical Eng. & Systems 2020-07-10 Haojie Liu , Ming Lu , Zhan Ma , Fan Wang , Zhihuang Xie , Xun Cao , Yao Wang