Related papers: Style Quantization for Data-Efficient GAN Training

Feature Quantization Improves GAN Training

The instability in GAN training has been a long-standing problem despite remarkable research efforts. We identify that instability issues stem from difficulties of performing feature matching with mini-batch statistics, due to a fragile…

Machine Learning · Computer Science 2020-07-16 Yang Zhao , Chunyuan Li , Ping Yu , Jianfeng Gao , Changyou Chen

Post-training Model Quantization Using GANs for Synthetic Data Generation

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-11 Athanasios Masouris , Mansi Sharma , Adrian Boguszewski , Alexander Kozlov , Zhuo Wu , Raymond Lo

Quantization of Generative Adversarial Networks for Efficient Inference: a Methodological Study

Generative adversarial networks (GANs) have an enormous potential impact on digital content creation, e.g., photo-realistic digital avatars, semantic content editing, and quality enhancement of speech and images. However, the performance of…

Artificial Intelligence · Computer Science 2021-09-01 Pavel Andreev , Alexander Fritzler , Dmitry Vetrov

Analyzing and Improving the Image Quality of StyleGAN

The style-based GAN architecture (StyleGAN) yields state-of-the-art results in data-driven unconditional generative image modeling. We expose and analyze several of its characteristic artifacts, and propose changes in both model…

Computer Vision and Pattern Recognition · Computer Science 2020-03-25 Tero Karras , Samuli Laine , Miika Aittala , Janne Hellsten , Jaakko Lehtinen , Timo Aila

GenQ: Quantization in Low Data Regimes with Generative Synthetic Data

In the realm of deep neural network deployment, low-bit quantization presents a promising avenue for enhancing computational efficiency. However, it often hinges on the availability of training data to mitigate quantization errors, a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Yuhang Li , Youngeun Kim , Donghyun Lee , Souvik Kundu , Priyadarshini Panda

Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation

Generative Adversarial Networks are used for generating the data using a generator and a discriminator, GANs usually produce high-quality images, but training GANs in an adversarial setting is a difficult task. GANs require high computation…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Md Nurul Muttakin , Malik Shahid Sultan , Robert Hoehndorf , Hernando Ombao

Latent Style-based Quantum GAN for high-quality Image Generation

Quantum generative modeling is among the promising candidates for achieving a practical advantage in data analysis. Nevertheless, one key challenge is to generate large-size images comparable to those generated by their classical…

Quantum Physics · Physics 2024-06-06 Su Yeon Chang , Supanut Thanasilp , Bertrand Le Saux , Sofia Vallecorsa , Michele Grossi

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

Generative adversarial models (GANs) continue to produce advances in terms of the visual quality of still images, as well as the learning of temporal correlations. However, few works manage to combine these two interesting capabilities for…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Gereon Fox , Ayush Tewari , Mohamed Elgharib , Christian Theobalt

LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling

Learning compact and meaningful latent space representations has been shown to be very useful in generative modeling tasks for visual data. One particular example is applying Vector Quantization (VQ) in variational autoencoders (VQ-VAEs,…

Machine Learning · Computer Science 2024-09-18 Xin Li , Anand Sarwate

Multi-level Latent Space Structuring for Generative Control

Truncation is widely used in generative models for improving the quality of the generated samples, at the expense of reducing their diversity. We propose to leverage the StyleGAN generative architecture to devise a new truncation technique,…

Computer Vision and Pattern Recognition · Computer Science 2022-02-15 Oren Katzir , Vicky Perepelook , Dani Lischinski , Daniel Cohen-Or

Selectively increasing the diversity of GAN-generated samples

Generative Adversarial Networks (GANs) are powerful models able to synthesize data samples closely resembling the distribution of real data, yet the diversity of those generated samples is limited due to the so-called mode collapse…

Computer Vision and Pattern Recognition · Computer Science 2023-06-26 Jan Dubiński , Kamil Deja , Sandro Wenzel , Przemysław Rokita , Tomasz Trzciński

Scaling Image Tokenizers with Grouped Spherical Quantization

Vision tokenizers have gained a lot of attraction due to their scalability and compactness; previous works depend on old-school GAN-based hyperparameters, biased comparisons, and a lack of comprehensive analysis of the scaling behaviours.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Jiangtao Wang , Zhen Qin , Yifan Zhang , Vincent Tao Hu , Björn Ommer , Rania Briq , Stefan Kesselheim

RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

With the rapid increase in the size of neural networks, model compression has become an important area of research. Quantization is an effective technique at decreasing the model size, memory access, and compute load of large models.…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-26 David Qiu , David Rim , Shaojin Ding , Oleg Rybakov , Yanzhang He

GANCompress: GAN-Enhanced Neural Image Compression with Binary Spherical Quantization

The exponential growth of visual data in digital communications has intensified the need for efficient compression techniques that balance rate-distortion performance with computational feasibility. While recent neural compression…

Image and Video Processing · Electrical Eng. & Systems 2025-05-21 Karthik Sivakoti

StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Text-to-image synthesis has recently seen significant progress thanks to large pretrained language models, large-scale training data, and the introduction of scalable model families such as diffusion and autoregressive models. However, the…

Machine Learning · Computer Science 2023-01-24 Axel Sauer , Tero Karras , Samuli Laine , Andreas Geiger , Timo Aila

Super-Resolution through StyleGAN Regularized Latent Search: A Realism-Fidelity Trade-off

This paper addresses the problem of super-resolution: constructing a highly resolved (HR) image from a low resolved (LR) one. Recent unsupervised approaches search the latent space of a StyleGAN pre-trained on HR images, for the image that…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Marzieh Gheisari , Auguste Genovesio

Robust GAN inversion

Recent advancements in real image editing have been attributed to the exploration of Generative Adversarial Networks (GANs) latent space. However, the main challenge of this procedure is GAN inversion, which aims to map the image to the…

Computer Vision and Pattern Recognition · Computer Science 2023-09-01 Egor Sevriugov , Ivan Oseledets

SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space

Generative Adversarial Networks (GANs) can synthesize realistic images, with the learned latent space shown to encode rich semantic information with various interpretable directions. However, due to the unstructured nature of the learned…

Computer Vision and Pattern Recognition · Computer Science 2023-10-11 Zikun Chen , Han Zhao , Parham Aarabi , Ruowei Jiang

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Ligong Han , Sri Harsha Musunuri , Martin Renqiang Min , Ruijiang Gao , Yu Tian , Dimitris Metaxas

Style Intervention: How to Achieve Spatial Disentanglement with Style-based Generators?

Generative Adversarial Networks (GANs) with style-based generators (e.g. StyleGAN) successfully enable semantic control over image synthesis, and recent studies have also revealed that interpretable image translations could be obtained by…

Computer Vision and Pattern Recognition · Computer Science 2020-11-20 Yunfan Liu , Qi Li , Zhenan Sun , Tieniu Tan