Related papers: Video Coding Using Learned Latent GAN Compression

Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Many recent works have been proposed for face image editing by leveraging the latent space of pretrained GANs. However, few attempts have been made to directly apply them to videos, because 1) they do not guarantee temporal consistency, 2)…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Jiyang Yu , Jingen Liu , Jing Huang , Wei Zhang , Tao Mei

Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision

The accelerated proliferation of visual content and the rapid development of machine vision technologies bring significant challenges in delivering visual data on a gigantic scale, which shall be effectively represented to satisfy both…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Qi Mao , Chongyu Wang , Meng Wang , Shiqi Wang , Ruijie Chen , Libiao Jin , Siwei Ma

End-to-End Learning for Video Frame Compression with Self-Attention

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

Feature-Style Encoder for Style-Based GAN Inversion

We propose a novel architecture for GAN inversion, which we call Feature-Style encoder. The style encoder is key for the manipulation of the obtained latent codes, while the feature encoder is crucial for optimal image reconstruction. Our…

Computer Vision and Pattern Recognition · Computer Science 2022-02-07 Xu Yao , Alasdair Newson , Yann Gousseau , Pierre Hellier

Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

Conditional Entropy Coding for Efficient Video Compression

We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames. Unlike prior learning-based approaches, we reduce complexity by not performing any form of explicit…

Image and Video Processing · Electrical Eng. & Systems 2020-08-24 Jerry Liu , Shenlong Wang , Wei-Chiu Ma , Meet Shah , Rui Hu , Pranaab Dhawan , Raquel Urtasun

Expressive Talking Head Video Encoding in StyleGAN2 Latent-Space

While the recent advances in research on video reenactment have yielded promising results, the approaches fall short in capturing the fine, detailed, and expressive facial features (e.g., lip-pressing, mouth puckering, mouth gaping, and…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Trevine Oorloff , Yaser Yacoob

Perceptual Learned Video Compression with Recurrent Conditional GAN

This paper proposes a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional GAN. We employ the recurrent auto-encoder-based compression network as the generator, and most importantly, we propose a recurrent…

Image and Video Processing · Electrical Eng. & Systems 2022-05-03 Ren Yang , Radu Timofte , Luc Van Gool

Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN

In the latest years, videoconferencing has taken a fundamental role in interpersonal relations, both for personal and business purposes. Lossy video compression algorithms are the enabling technology for videoconferencing, as they reduce…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Lorenzo Agnolucci , Leonardo Galteri , Marco Bertini , Alberto Del Bimbo

Interpreting the Latent Space of GANs for Semantic Face Editing

Despite the recent advance of Generative Adversarial Networks (GANs) in high-fidelity image synthesis, there lacks enough understanding of how GANs are able to map a latent code sampled from a random distribution to a photo-realistic image.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Yujun Shen , Jinjin Gu , Xiaoou Tang , Bolei Zhou

Collaborative Learning for Faster StyleGAN Embedding

The latent code of the recent popular model StyleGAN has learned disentangled representations thanks to the multi-layer style-based generator. Embedding a given image back to the latent space of StyleGAN enables wide interesting semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Shanyan Guan , Ying Tai , Bingbing Ni , Feida Zhu , Feiyue Huang , Xiaokang Yang

Generative Latent Video Compression

Perceptual optimization is widely recognized as essential for neural compression, yet balancing the rate-distortion-perception tradeoff remains challenging. This difficulty is especially pronounced in video compression, where frame-wise…

Image and Video Processing · Electrical Eng. & Systems 2025-10-14 Zongyu Guo , Zhaoyang Jia , Jiahao Li , Xiaoyi Zhang , Bin Li , Yan Lu

Neural Video Compression using GANs for Detail Synthesis and Propagation

We present the first neural video compression method based on generative adversarial networks (GANs). Our approach significantly outperforms previous neural and non-neural video compression methods in a user study, setting a new…

Image and Video Processing · Electrical Eng. & Systems 2022-07-13 Fabian Mentzer , Eirikur Agustsson , Johannes Ballé , David Minnen , Nick Johnston , George Toderici

Video Compression Coding via Colorization: A Generative Adversarial Network (GAN)-Based Approach

Under the limited storage, computing and network bandwidth resources, the video compression coding technology plays an important role for visual communication. To efficiently compress raw video data, a colorization-based video compression…

Image and Video Processing · Electrical Eng. & Systems 2019-12-24 Zhaoqing Pan , Feng Yuan , Jianjun Lei , Sam Kwong

One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2

While recent research has progressively overcome the low-resolution constraint of one-shot face video re-enactment with the help of StyleGAN's high-fidelity portrait generation, these approaches rely on at least one of the following:…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Trevine Oorloff , Yaser Yacoob

Adversarial Video Compression Guided by Soft Edge Detection

We propose a video compression framework using conditional Generative Adversarial Networks (GANs). We rely on two encoders: one that deploys a standard video codec and another which generates low-level maps via a pipeline of down-sampling,…

Image and Video Processing · Electrical Eng. & Systems 2018-11-28 Sungsoo Kim , Jin Soo Park , Christos G. Bampis , Jaeseong Lee , Mia K. Markey , Alexandros G. Dimakis , Alan C. Bovik

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Implicit Neural Networks (INRs) have emerged as powerful representations to encode all forms of data, including images, videos, audios, and scenes. With video, many INRs for video have been proposed for the compression task, and recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Shishira R Maiya , Anubhav Gupta , Matthew Gwilliam , Max Ehrlich , Abhinav Shrivastava

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

Generative adversarial models (GANs) continue to produce advances in terms of the visual quality of still images, as well as the learning of temporal correlations. However, few works manage to combine these two interesting capabilities for…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Gereon Fox , Ayush Tewari , Mohamed Elgharib , Christian Theobalt

Enhanced Invertible Encoding for Learned Image Compression

Although deep learning based image compression methods have achieved promising progress these days, the performance of these methods still cannot match the latest compression standard Versatile Video Coding (VVC). Most of the recent…

Image and Video Processing · Electrical Eng. & Systems 2021-08-29 Yueqi Xie , Ka Leong Cheng , Qifeng Chen

Modelling Latent Dynamics of StyleGAN using Neural ODEs

In this paper, we propose to model the video dynamics by learning the trajectory of independently inverted latent codes from GANs. The entire sequence is seen as discrete-time observations of a continuous trajectory of the initial latent…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Weihao Xia , Yujiu Yang , Jing-Hao Xue