English
Related papers

Related papers: Video Coding Using Learned Latent GAN Compression

200 papers

Many recent works have been proposed for face image editing by leveraging the latent space of pretrained GANs. However, few attempts have been made to directly apply them to videos, because 1) they do not guarantee temporal consistency, 2)…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Jiyang Yu , Jingen Liu , Jing Huang , Wei Zhang , Tao Mei

The accelerated proliferation of visual content and the rapid development of machine vision technologies bring significant challenges in delivering visual data on a gigantic scale, which shall be effectively represented to satisfy both…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Qi Mao , Chongyu Wang , Meng Wang , Shiqi Wang , Ruijie Chen , Libiao Jin , Siwei Ma

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

We propose a novel architecture for GAN inversion, which we call Feature-Style encoder. The style encoder is key for the manipulation of the obtained latent codes, while the feature encoder is crucial for optimal image reconstruction. Our…

Computer Vision and Pattern Recognition · Computer Science 2022-02-07 Xu Yao , Alasdair Newson , Yann Gousseau , Pierre Hellier

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames. Unlike prior learning-based approaches, we reduce complexity by not performing any form of explicit…

Image and Video Processing · Electrical Eng. & Systems 2020-08-24 Jerry Liu , Shenlong Wang , Wei-Chiu Ma , Meet Shah , Rui Hu , Pranaab Dhawan , Raquel Urtasun

While the recent advances in research on video reenactment have yielded promising results, the approaches fall short in capturing the fine, detailed, and expressive facial features (e.g., lip-pressing, mouth puckering, mouth gaping, and…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Trevine Oorloff , Yaser Yacoob

This paper proposes a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional GAN. We employ the recurrent auto-encoder-based compression network as the generator, and most importantly, we propose a recurrent…

Image and Video Processing · Electrical Eng. & Systems 2022-05-03 Ren Yang , Radu Timofte , Luc Van Gool

In the latest years, videoconferencing has taken a fundamental role in interpersonal relations, both for personal and business purposes. Lossy video compression algorithms are the enabling technology for videoconferencing, as they reduce…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Lorenzo Agnolucci , Leonardo Galteri , Marco Bertini , Alberto Del Bimbo

Despite the recent advance of Generative Adversarial Networks (GANs) in high-fidelity image synthesis, there lacks enough understanding of how GANs are able to map a latent code sampled from a random distribution to a photo-realistic image.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Yujun Shen , Jinjin Gu , Xiaoou Tang , Bolei Zhou

The latent code of the recent popular model StyleGAN has learned disentangled representations thanks to the multi-layer style-based generator. Embedding a given image back to the latent space of StyleGAN enables wide interesting semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Shanyan Guan , Ying Tai , Bingbing Ni , Feida Zhu , Feiyue Huang , Xiaokang Yang

Perceptual optimization is widely recognized as essential for neural compression, yet balancing the rate-distortion-perception tradeoff remains challenging. This difficulty is especially pronounced in video compression, where frame-wise…

Image and Video Processing · Electrical Eng. & Systems 2025-10-14 Zongyu Guo , Zhaoyang Jia , Jiahao Li , Xiaoyi Zhang , Bin Li , Yan Lu

We present the first neural video compression method based on generative adversarial networks (GANs). Our approach significantly outperforms previous neural and non-neural video compression methods in a user study, setting a new…

Image and Video Processing · Electrical Eng. & Systems 2022-07-13 Fabian Mentzer , Eirikur Agustsson , Johannes Ballé , David Minnen , Nick Johnston , George Toderici

Under the limited storage, computing and network bandwidth resources, the video compression coding technology plays an important role for visual communication. To efficiently compress raw video data, a colorization-based video compression…

Image and Video Processing · Electrical Eng. & Systems 2019-12-24 Zhaoqing Pan , Feng Yuan , Jianjun Lei , Sam Kwong

While recent research has progressively overcome the low-resolution constraint of one-shot face video re-enactment with the help of StyleGAN's high-fidelity portrait generation, these approaches rely on at least one of the following:…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Trevine Oorloff , Yaser Yacoob

We propose a video compression framework using conditional Generative Adversarial Networks (GANs). We rely on two encoders: one that deploys a standard video codec and another which generates low-level maps via a pipeline of down-sampling,…

Image and Video Processing · Electrical Eng. & Systems 2018-11-28 Sungsoo Kim , Jin Soo Park , Christos G. Bampis , Jaeseong Lee , Mia K. Markey , Alexandros G. Dimakis , Alan C. Bovik

Implicit Neural Networks (INRs) have emerged as powerful representations to encode all forms of data, including images, videos, audios, and scenes. With video, many INRs for video have been proposed for the compression task, and recent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Shishira R Maiya , Anubhav Gupta , Matthew Gwilliam , Max Ehrlich , Abhinav Shrivastava

Generative adversarial models (GANs) continue to produce advances in terms of the visual quality of still images, as well as the learning of temporal correlations. However, few works manage to combine these two interesting capabilities for…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Gereon Fox , Ayush Tewari , Mohamed Elgharib , Christian Theobalt

Although deep learning based image compression methods have achieved promising progress these days, the performance of these methods still cannot match the latest compression standard Versatile Video Coding (VVC). Most of the recent…

Image and Video Processing · Electrical Eng. & Systems 2021-08-29 Yueqi Xie , Ka Leong Cheng , Qifeng Chen

In this paper, we propose to model the video dynamics by learning the trajectory of independently inverted latent codes from GANs. The entire sequence is seen as discrete-time observations of a continuous trajectory of the initial latent…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Weihao Xia , Yujiu Yang , Jing-Hao Xue
‹ Prev 1 2 3 10 Next ›