Related papers: GIViC: Generative Implicit Video Compression

DiV-INR: Extreme Low-Bitrate Diffusion Video Compression with INR Conditioning

We present a perceptually-driven video compression framework integrating implicit neural representations (INRs) and pre-trained video diffusion models to address the extremely low bitrate regime (<0.05 bpp). Our approach exploits the…

Image and Video Processing · Electrical Eng. & Systems 2026-04-10 Eren Çetin , Lucas Relic , Yuanyi Xue , Markus Gross , Christopher Schroers , Roberto Azevedo

Immersive Video Compression using Implicit Neural Representations

Recent work on implicit neural representations (INRs) has evidenced their potential for efficiently representing and encoding conventional video content. In this paper we, for the first time, extend their application to immersive…

Image and Video Processing · Electrical Eng. & Systems 2024-11-22 Ho Man Kwan , Fan Zhang , Andrew Gower , David Bull

NVRC: Neural Video Representation Compression

Recent advances in implicit neural representation (INR)-based video coding have demonstrated its potential to compete with both conventional and other learning-based approaches. With INR methods, a neural network is trained to overfit a…

Computer Vision and Pattern Recognition · Computer Science 2026-01-27 Ho Man Kwan , Ge Gao , Fan Zhang , Andrew Gower , David Bull

PNVC: Towards Practical INR-based Video Compression

Neural video compression has recently demonstrated significant potential to compete with conventional video codecs in terms of rate-quality performance. These learned video codecs are however associated with various issues related to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Ge Gao , Ho Man Kwan , Fan Zhang , David Bull

Generative Latent Video Compression

Perceptual optimization is widely recognized as essential for neural compression, yet balancing the rate-distortion-perception tradeoff remains challenging. This difficulty is especially pronounced in video compression, where frame-wise…

Image and Video Processing · Electrical Eng. & Systems 2025-10-14 Zongyu Guo , Zhaoyang Jia , Jiahao Li , Xiaoyi Zhang , Bin Li , Yan Lu

Rethinking Generative Human Video Coding with Implicit Motion Transformation

Beyond traditional hybrid-based video codec, generative video codec could achieve promising compression performance by evolving high-dimensional signals into compact feature representations for bitstream compactness at the encoder side and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Bolin Chen , Ru-Ling Liao , Jie Chen , Yan Ye

ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization

Recent advances in generative image compression (GIC) have delivered remarkable improvements in perceptual quality. However, many GICs rely on large-scale and rigid models, which severely constrain their utility for flexible transmission…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Hao Cao , Chengbin Liang , Wenqi Guo , Zhijin Qin , Jungong Han

RQAT-INR: Improved Implicit Neural Image Compression

Deep variational autoencoders for image and video compression have gained significant attraction in the recent years, due to their potential to offer competitive or better compression rates compared to the decades long traditional codecs…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Bharath Bhushan Damodaran , Muhammet Balcilar , Franck Galpin , Pierre Hellier

ProGVC: Progressive-based Generative Video Compression via Auto-Regressive Context Modeling

Perceptual video compression leverages generative priors to reconstruct realistic textures and motions at low bitrates. However, existing perceptual codecs often lack native support for variable bitrate and progressive delivery, and their…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Daowen Li , Ruixiao Dong , Ying Chen , Kai Li , Ding Ding , Li Li

Free-GVC: Towards Training-Free Extreme Generative Video Compression with Temporal Coherence

Building on recent advances in video generation, generative video compression has emerged as a new paradigm for achieving visually pleasing reconstructions. However, existing methods exhibit limited exploitation of temporal correlations,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Xiaoyue Ling , Chuqin Zhou , Chunyi Li , Yunuo Chen , Yuan Tian , Guo Lu , Wenjun Zhang

Implicit-explicit Integrated Representations for Multi-view Video Compression

With the increasing consumption of 3D displays and virtual reality, multi-view video has become a promising format. However, its high resolution and multi-camera shooting result in a substantial increase in data volume, making storage and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Chen Zhu , Guo Lu , Bing He , Rong Xie , Li Song

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

Learning-based video compression is currently a popular research topic, offering the potential to compete with conventional standard video codecs. In this context, Implicit Neural Representations (INRs) have previously been used to…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Ho Man Kwan , Ge Gao , Fan Zhang , Andrew Gower , David Bull

Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens

Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities. Beyond traditional hybrid video coding paradigms, Generative…

Image and Video Processing · Electrical Eng. & Systems 2024-10-14 Bolin Chen , Shanzhi Yin , Zihan Zhang , Jie Chen , Ru-Ling Liao , Lingyu Zhu , Shiqi Wang , Yan Ye

Generative Neural Video Compression via Video Diffusion Prior

We present GNVC-VD, the first DiT-based generative neural video compression framework built upon an advanced video generation foundation model, where spatio-temporal latent compression and sequence-level generative refinement are unified…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Qi Mao , Hao Cheng , Tinghan Yang , Libiao Jin , Siwei Ma

Generative Video Compression with One-Dimensional Latent Representation

Recent advancements in generative video codec (GVC) typically encode video into a 2D latent grid and employ high-capacity generative decoders for reconstruction. However, this paradigm still leaves two key challenges in fully exploiting…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Zihan Zheng , Zhaoyang Jia , Naifu Xue , Jiahao Li , Bin Li , Zongyu Guo , Xiaoyi Zhang , Zhenghao Chen , Houqiang Li , Yan Lu

Interactive Face Video Coding: A Generative Compression Framework

In this paper, we propose a novel framework for Interactive Face Video Coding (IFVC), which allows humans to interact with the intrinsic visual representations instead of the signals. The proposed solution enjoys several distinct…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Bolin Chen , Zhao Wang , Binzhe Li , Shurun Wang , Shiqi Wang , Yan Ye

Modality-Agnostic Variational Compression of Implicit Neural Representations

We introduce a modality-agnostic neural compression algorithm based on a functional view of data and parameterised as an Implicit Neural Representation (INR). Bridging the gap between latent coding and sparsity, we obtain compact latent…

Machine Learning · Statistics 2023-04-10 Jonathan Richard Schwarz , Jihoon Tack , Yee Whye Teh , Jaeho Lee , Jinwoo Shin

Generative Video Compression: Towards 0.01% Compression Rate for Video Transmission

Whether a video can be compressed at an extreme compression rate as low as 0.01%? To this end, we achieve the compression rate as 0.02% at some cases by introducing Generative Video Compression (GVC), a new framework that redefines the…

Image and Video Processing · Electrical Eng. & Systems 2026-02-03 Xiangyu Chen , Jixiang Luo , Jingyu Xu , Fangqiu Yi , Chi Zhang , Xuelong Li

NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling

Implicit Neural Representations (INR) have recently shown to be powerful tool for high-quality video compression. However, existing works are limiting as they do not explicitly exploit the temporal redundancy in videos, leading to a long…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Shishira R Maiya , Sharath Girish , Max Ehrlich , Hanyu Wang , Kwot Sin Lee , Patrick Poirson , Pengxiang Wu , Chen Wang , Abhinav Shrivastava

GIVT: Generative Infinite-Vocabulary Transformers

We introduce Generative Infinite-Vocabulary Transformers (GIVT) which generate vector sequences with real-valued entries, instead of discrete tokens from a finite vocabulary. To this end, we propose two surprisingly simple modifications to…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Michael Tschannen , Cian Eastwood , Fabian Mentzer