Related papers: Predictive Coding For Animation-Based Video Compre…

A new way of video compression via forward-referencing using deep learning

To exploit high temporal correlations in video frames of the same scene, the current frame is predicted from the already-encoded reference frames using block-based motion estimation and compensation techniques. While this approach can…

Computer Vision and Pattern Recognition · Computer Science 2022-08-16 S. M. A. K. Rajin , M. Murshed , M. Paul , S. W. Teng , J. Ma

Variable Rate Video Compression using a Hybrid Recurrent Convolutional Learning Framework

In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal…

Image and Video Processing · Electrical Eng. & Systems 2020-08-25 Aishwarya Jadhav

Ultra-low bitrate video conferencing using deep image animation

In this work we propose a novel deep learning approach for ultra-low bitrate video compression for video conferencing applications. To address the shortcomings of current video compression paradigms when the available bandwidth is extremely…

Computer Vision and Pattern Recognition · Computer Science 2020-12-02 Goluck Konuko , Giuseppe Valenzise , Stéphane Lathuilière

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While…

Multimedia · Computer Science 2022-07-28 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

Prediction-Aware Quality Enhancement of VVC Using CNN

The upcoming video coding standard, Versatile Video Coding (VVC), has shown great improvement compared to its predecessor, High Efficiency Video Coding (HEVC), in terms of bitrate saving. Despite its substantial performance, compressed…

Image and Video Processing · Electrical Eng. & Systems 2021-12-09 Fatemeh Nasiri , Wassim Hamidouche , Luce Morin , Nicolas Dhollande , Gildas Cocherel

Bidirectional Learned Facial Animation Codec for Low Bitrate Talking Head Videos

Existing deep facial animation coding techniques efficiently compress talking head videos by applying deep generative models. Instead of compressing the entire video sequence, these methods focus on compressing only the keyframe and the…

Image and Video Processing · Electrical Eng. & Systems 2025-03-14 Riku Takahashi , Ryugo Morita , Fuma Kimishima , Kosuke Iwama , Jinjia Zhou

ProGVC: Progressive-based Generative Video Compression via Auto-Regressive Context Modeling

Perceptual video compression leverages generative priors to reconstruct realistic textures and motions at low bitrates. However, existing perceptual codecs often lack native support for variable bitrate and progressive delivery, and their…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Daowen Li , Ruixiao Dong , Ying Chen , Kai Li , Ding Ding , Li Li

Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos

Talking head video compression has advanced with neural rendering and keypoint-based methods, but challenges remain, especially at low bit rates, including handling large head movements, suboptimal lip synchronization, and distorted facial…

Image and Video Processing · Electrical Eng. & Systems 2025-06-17 Riku Takahashi , Ryugo Morita , Jinjia Zhou

Position Dependent Prediction Combination For Intra-Frame Video Coding

Intra-frame prediction in the High Efficiency Video Coding (HEVC) standard can be empirically improved by applying sets of recursive two-dimensional filters to the predicted values. However, this approach does not allow (or complicates…

Image and Video Processing · Electrical Eng. & Systems 2025-05-30 Amir Said , Xin Zhao , Marta Karczewicz , Jianle Chen , Feng Zou

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

The enhanced Deep Hierarchical Video Compression-DHVC 2.0-has been introduced. This single-model neural video codec operates across a broad range of bitrates, delivering not only superior compression performance to representative methods…

Image and Video Processing · Electrical Eng. & Systems 2024-10-04 Ming Lu , Zhihao Duan , Wuyang Cong , Dandan Ding , Fengqing Zhu , Zhan Ma

AlphaVC: High-Performance and Efficient Learned Video Compression

Recently, learned video compression has drawn lots of attention and show a rapid development trend with promising results. However, the previous works still suffer from some criticial issues and have a performance gap with traditional…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Yibo Shi , Yunying Ge , Jing Wang , Jue Mao

HEVC Inter Coding Using Deep Recurrent Neural Networks and Artificial Reference Pictures

The efficiency of motion compensated prediction in modern video codecs highly depends on the available reference pictures. Occlusions and non-linear motion pose challenges for the motion compensation and often result in high bit rates for…

Multimedia · Computer Science 2018-12-06 Felix Haub , Thorsten Laude , Jörn Ostermann

Prediction of Transformed (DCT) Video Coding Residual for Video Compression

Video compression has been investigated by means of analysis-synthesis, and more particularly by means of inpainting. The first part of our approach has been to develop the inpainting of DCT coefficients in an image. This has shown good…

Information Theory · Computer Science 2014-04-17 Matthieu Moinard , Isabelle Amonou , Pierre Duhamel , Patrice Brault

Regression-based Intra-prediction for Image and Video Coding

By utilizing previously known areas in an image, intra-prediction techniques can find a good estimate of the current block. This allows the encoder to store only the error between the original block and the generated estimate, thus leading…

Multimedia · Computer Science 2016-05-13 Carlo Noel Ochotorena , Yukihiko Yamashita

Generative Compression for Face Video: A Hybrid Scheme

As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality. To excavate more compression potential for video conference scenarios under ultra-low bitrate, this paper proposes a bitrate…

Image and Video Processing · Electrical Eng. & Systems 2023-03-21 Anni Tang , Yan Huang , Jun Ling , Zhiyu Zhang , Yiwei Zhang , Rong Xie , Li Song

End-to-End Learning for Video Frame Compression with Self-Attention

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

Key-Point Sequence Lossless Compression for Intelligent Video Analysis

Feature coding has been recently considered to facilitate intelligent video analysis for urban computing. Instead of raw videos, extracted features in the front-end are encoded and transmitted to the back-end for further processing. In this…

Multimedia · Computer Science 2020-09-11 Weiyao Lin , Xiaoyi He , Wenrui Dai , John See , Tushar Shinde , Hongkai Xiong , Lingyu Duan

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly from a unified perspective of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Wenhan Yang , Haofeng Huang , Yueyu Hu , Ling-Yu Duan , Jiaying Liu