English
Related papers

Related papers: Multi-Reference Generative Face Video Compression …

200 papers

Recently, deep generative models have greatly advanced the progress of face video coding towards promising rate-distortion performance and diverse application functionalities. Beyond traditional hybrid video coding paradigms, Generative…

Image and Video Processing · Electrical Eng. & Systems 2024-10-14 Bolin Chen , Shanzhi Yin , Zihan Zhang , Jie Chen , Ru-Ling Liao , Lingyu Zhu , Shiqi Wang , Yan Ye

The rise of deep generative models has greatly advanced video compression, reshaping the paradigm of face video coding through their powerful capability for semantic-aware representation and lifelike synthesis. Generative Face Video Coding…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Bolin Chen , Shanzhi Yin , Goluck Konuko , Giuseppe Valenzise , Zihan Zhang , Shiqi Wang , Yan Ye

Generative face video coding (GFVC) is vital for modern applications like video conferencing, yet existing methods primarily focus on video motion while neglecting the significant bitrate contribution of audio. Despite the well-established…

Image and Video Processing · Electrical Eng. & Systems 2025-12-18 Youmin Xu , Mengxi Guo , Shijie Zhao , Weiqi Li , Junlin Li , Li Zhang , Jian Zhang

Perceptual optimization is widely recognized as essential for neural compression, yet balancing the rate-distortion-perception tradeoff remains challenging. This difficulty is especially pronounced in video compression, where frame-wise…

Image and Video Processing · Electrical Eng. & Systems 2025-10-14 Zongyu Guo , Zhaoyang Jia , Jiahao Li , Xiaoyi Zhang , Bin Li , Yan Lu

Generative Face Video Coding (GFVC) achieves superior rate-distortion performance by leveraging the strong inference capabilities of deep generative models. However, its practical deployment is hindered by large model parameters and high…

Computer Vision and Pattern Recognition · Computer Science 2025-08-20 Zihan Zhang , Shanzhi Yin , Bolin Chen , Ru-Ling Liao , Shiqi Wang , Yan Ye

In this paper, we propose a novel framework for Interactive Face Video Coding (IFVC), which allows humans to interact with the intrinsic visual representations instead of the signals. The proposed solution enjoys several distinct…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Bolin Chen , Zhao Wang , Binzhe Li , Shurun Wang , Shiqi Wang , Yan Ye

Generative Face Video Coding (GFVC) techniques can exploit the compact representation of facial priors and the strong inference capability of deep generative models, achieving high-quality face video communication in ultra-low bandwidth…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Bolin Chen , Jie Chen , Shiqi Wang , Yan Ye

As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality. To excavate more compression potential for video conference scenarios under ultra-low bitrate, this paper proposes a bitrate…

Image and Video Processing · Electrical Eng. & Systems 2023-03-21 Anni Tang , Yan Huang , Jun Ling , Zhiyu Zhang , Yiwei Zhang , Rong Xie , Li Song

This paper proposes a Generative Face Video Compression (GFVC) approach using Supplemental Enhancement Information (SEI), where a series of compact spatial and temporal representations of a face video signal (e.g., 2D/3D keypoints, facial…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Bolin Chen , Yan Ye , Jie Chen , Ru-Ling Liao , Shanzhi Yin , Shiqi Wang , Kaifa Yang , Yue Li , Yiling Xu , Ye-Kui Wang , Shiv Gehlot , Guan-Ming Su , Peng Yin , Sean McCarthy , Gary J. Sullivan

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While…

Multimedia · Computer Science 2022-07-28 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

Recent advancements in generative video codec (GVC) typically encode video into a 2D latent grid and employ high-capacity generative decoders for reconstruction. However, this paradigm still leaves two key challenges in fully exploiting…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Zihan Zheng , Zhaoyang Jia , Naifu Xue , Jiahao Li , Bin Li , Zongyu Guo , Xiaoyi Zhang , Zhenghao Chen , Houqiang Li , Yan Lu

Learning based video compression attracts increasing attention in the past few years. The previous hybrid coding approaches rely on pixel space operations to reduce spatial and temporal redundancy, which may suffer from inaccurate motion…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Zhihao Hu , Guo Lu , Dong Xu

Existing deep facial animation coding techniques efficiently compress talking head videos by applying deep generative models. Instead of compressing the entire video sequence, these methods focus on compressing only the keyframe and the…

Image and Video Processing · Electrical Eng. & Systems 2025-03-14 Riku Takahashi , Ryugo Morita , Fuma Kimishima , Kosuke Iwama , Jinjia Zhou

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Goluck Konuko , Stéphane Lathuilière , Giuseppe Valenzise

Perceptual video compression leverages generative priors to reconstruct realistic textures and motions at low bitrates. However, existing perceptual codecs often lack native support for variable bitrate and progressive delivery, and their…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Daowen Li , Ruixiao Dong , Ying Chen , Kai Li , Ding Ding , Li Li

The Animation-based Generative Codec (AGC) is an emerging paradigm for talking-face video compression. However, deploying its intricate decoder on resource and power-constrained edge devices presents challenges due to numerous parameters,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Rui Wan , Qi Zheng , Ruoyu Zhang , Bu Chen , Jiaming Liu , Min Li , Minge Jing , Jinjia Zhou , Yibo Fan

Recently deep learning-based image compression has shown the potential to outperform traditional codecs. However, most existing methods train multiple networks for multiple bit rates, which increase the implementation complexity. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Mohammad Akbari , Jie Liang , Jingning Han , Chengjie Tu

Beyond traditional hybrid-based video codec, generative video codec could achieve promising compression performance by evolving high-dimensional signals into compact feature representations for bitstream compactness at the encoder side and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Bolin Chen , Ru-Ling Liao , Jie Chen , Yan Ye

Building on recent advances in video generation, generative video compression has emerged as a new paradigm for achieving visually pleasing reconstructions. However, existing methods exhibit limited exploitation of temporal correlations,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Xiaoyue Ling , Chuqin Zhou , Chunyi Li , Yunuo Chen , Yuan Tian , Guo Lu , Wenjun Zhang

Most existing approaches for image and video compression perform transform coding in the pixel space to reduce redundancy. However, due to the misalignment between the pixel-space distortion and human perception, such schemes often face the…

Image and Video Processing · Electrical Eng. & Systems 2025-05-23 Linfeng Qi , Zhaoyang Jia , Jiahao Li , Bin Li , Houqiang Li , Yan Lu
‹ Prev 1 2 3 10 Next ›