Related papers: Interactive Face Video Coding: A Generative Compre…

Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens

Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities. Beyond traditional hybrid video coding paradigms, Generative…

Image and Video Processing · Electrical Eng. & Systems 2024-10-14 Bolin Chen , Shanzhi Yin , Zihan Zhang , Jie Chen , Ru-Ling Liao , Lingyu Zhu , Shiqi Wang , Yan Ye

Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding

The rise of deep generative models has greatly advanced video compression, reshaping the paradigm of face video coding through their powerful capability for semantic-aware representation and lifelike synthesis. Generative Face Video Coding…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Bolin Chen , Shanzhi Yin , Goluck Konuko , Giuseppe Valenzise , Zihan Zhang , Shiqi Wang , Yan Ye

Compressing Human Body Video with Interactive Semantics: A Generative Approach

In this paper, we propose to compress human body video with interactive semantics, which can facilitate video coding to be interactive and controllable by manipulating semantic-level representations embedded in the coded bitstream. In…

Image and Video Processing · Electrical Eng. & Systems 2025-05-23 Bolin Chen , Shanzhi Yin , Hanwei Zhu , Lingyu Zhu , Zihan Zhang , Jie Chen , Ru-Ling Liao , Shiqi Wang , Yan Ye

Generative Face Video Coding Techniques and Standardization Efforts: A Review

Generative Face Video Coding (GFVC) techniques can exploit the compact representation of facial priors and the strong inference capability of deep generative models, achieving high-quality face video communication in ultra-low bandwidth…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Bolin Chen , Jie Chen , Shiqi Wang , Yan Ye

Audio-Visual Cross-Modal Compression for Generative Face Video Coding

Generative face video coding (GFVC) is vital for modern applications like video conferencing, yet existing methods primarily focus on video motion while neglecting the significant bitrate contribution of audio. Despite the well-established…

Image and Video Processing · Electrical Eng. & Systems 2025-12-18 Youmin Xu , Mengxi Guo , Shijie Zhao , Weiqi Li , Junlin Li , Li Zhang , Jian Zhang

Generative Compression for Face Video: A Hybrid Scheme

As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality. To excavate more compression potential for video conference scenarios under ultra-low bitrate, this paper proposes a bitrate…

Image and Video Processing · Electrical Eng. & Systems 2023-03-21 Anni Tang , Yan Huang , Jun Ling , Zhiyu Zhang , Yiwei Zhang , Rong Xie , Li Song

Rethinking Generative Human Video Coding with Implicit Motion Transformation

Beyond traditional hybrid-based video codec, generative video codec could achieve promising compression performance by evolving high-dimensional signals into compact feature representations for bitstream compactness at the encoder side and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Bolin Chen , Ru-Ling Liao , Jie Chen , Yan Ye

Multi-Reference Generative Face Video Compression with Contrastive Learning

Generative face video coding (GFVC) has been demonstrated as a potential approach to low-latency, low bitrate video conferencing. GFVC frameworks achieve an extreme gain in coding efficiency with over 70% bitrate savings when compared to…

Multimedia · Computer Science 2024-09-04 Goluck Konuko , Giuseppe Valenzise

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

FVC: A New Framework towards Deep Video Compression in Feature Space

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Zhihao Hu , Guo Lu , Dong Xu

Standardizing Generative Face Video Compression using Supplemental Enhancement Information

This paper proposes a Generative Face Video Compression (GFVC) approach using Supplemental Enhancement Information (SEI), where a series of compact spatial and temporal representations of a face video signal (e.g., 2D/3D keypoints, facial…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Bolin Chen , Yan Ye , Jie Chen , Ru-Ling Liao , Shanzhi Yin , Shiqi Wang , Kaifa Yang , Yue Li , Yiling Xu , Ye-Kui Wang , Shiv Gehlot , Guan-Ming Su , Peng Yin , Sean McCarthy , Gary J. Sullivan

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While…

Multimedia · Computer Science 2022-07-28 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

I$^2$VC: A Unified Framework for Intra- & Inter-frame Video Compression

Video compression aims to reconstruct seamless frames by encoding the motion and residual information from existing frames. Previous neural video compression methods necessitate distinct codecs for three types of frames (I-frame, P-frame…

Image and Video Processing · Electrical Eng. & Systems 2024-06-04 Meiqin Liu , Chenming Xu , Yukai Gu , Chao Yao , Yao Zhao

Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach

In this letter, we envision a new metaverse communication paradigm for virtual avatar faces, and develop the semantic face compression with compact 3D facial descriptors. The fundamental principle is that the communication of virtual avatar…

Computer Vision and Pattern Recognition · Computer Science 2023-11-23 Binzhe Li , Bolin Chen , Zhao Wang , Shiqi Wang , Yan Ye

Implicit-explicit Integrated Representations for Multi-view Video Compression

With the increasing consumption of 3D displays and virtual reality, multi-view video has become a promising format. However, its high resolution and multi-camera shooting result in a substantial increase in data volume, making storage and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Chen Zhu , Guo Lu , Bing He , Rong Xie , Li Song

A Lightweight Dual-Mode Optimization for Generative Face Video Coding

Generative Face Video Coding (GFVC) achieves superior rate-distortion performance by leveraging the strong inference capabilities of deep generative models. However, its practical deployment is hindered by large model parameters and high…

Computer Vision and Pattern Recognition · Computer Science 2025-08-20 Zihan Zhang , Shanzhi Yin , Bolin Chen , Ru-Ling Liao , Shiqi Wang , Yan Ye

Perceptual Quality Assessment of Face Video Compression: A Benchmark and An Effective Method

Recent years have witnessed an exponential increase in the demand for face video compression, and the success of artificial intelligence has expanded the boundaries beyond traditional hybrid video coding. Generative coding approaches have…

Image and Video Processing · Electrical Eng. & Systems 2023-10-31 Yixuan Li , Bolin Chen , Baoliang Chen , Meng Wang , Shiqi Wang , Weisi Lin

Predictive Coding For Animation-Based Video Compression

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

M3-CVC: Controllable Video Compression with Multimodal Generative Models

Traditional and neural video codecs commonly encounter limitations in controllability and generality under ultra-low-bitrate coding scenarios. To overcome these challenges, we propose M3-CVC, a controllable video compression framework…

Image and Video Processing · Electrical Eng. & Systems 2024-12-30 Rui Wan , Qi Zheng , Yibo Fan

FAIVConf: Face enhancement for AI-based Video Conference with Low Bit-rate

Recently, high-quality video conferencing with fewer transmission bits has become a very hot and challenging problem. We propose FAIVConf, a specially designed video compression framework for video conferencing, based on the effective…

Image and Video Processing · Electrical Eng. & Systems 2022-07-12 Zhengang Li , Sheng Lin , Shan Liu , Songnan Li , Xue Lin , Wei Wang , Wei Jiang