Related papers: Generating Images with Sparse Representations

A Diffusion-Based Generative Prior Approach to Sparse-view Computed Tomography

The reconstruction of X-rays CT images from sparse or limited-angle geometries is a highly challenging task. The lack of data typically results in artifacts in the reconstructed image and may even lead to object distortions. For this…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Davide Evangelista , Pasquale Cascarano , Elena Loli Piccolomini

Enhancing sparse representation of color images by cross channel transformation

Transformations for enhancing sparsity in the approximation of color images by 2D atomic decomposition are discussed. The sparsity is firstly considered with respect to the most significant coefficients in the wavelet decomposition of the…

Image and Video Processing · Electrical Eng. & Systems 2021-05-17 Laura Rebollo-Neira , Aurelien Inacio

Hierarchical Autoregressive Image Models with Auxiliary Decoders

Autoregressive generative models of images tend to be biased towards capturing local structure, and as a result they often produce samples which are lacking in terms of large-scale coherence. To address this, we propose two methods to learn…

Computer Vision and Pattern Recognition · Computer Science 2019-10-09 Jeffrey De Fauw , Sander Dieleman , Karen Simonyan

Image-Adaptive GAN based Reconstruction

In the recent years, there has been a significant improvement in the quality of samples produced by (deep) generative models such as variational auto-encoders and generative adversarial networks. However, the representation capabilities of…

Image and Video Processing · Electrical Eng. & Systems 2026-03-31 Shady Abu Hussein , Tom Tirer , Raja Giryes

Improved Image Generation via Sparse Modeling

The interest of the deep learning community in image synthesis has grown massively in recent years. Nowadays, deep generative methods, and especially Generative Adversarial Networks (GANs), are leading to state-of-the-art performance,…

Computer Vision and Pattern Recognition · Computer Science 2022-05-16 Roy Ganz , Michael Elad

Efficient Progressive Image Compression with Variance-aware Masking

Learned progressive image compression is gaining momentum as it allows improved image reconstruction as more bits are decoded at the receiver. We propose a progressive image compression method in which an image is first represented as a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Alberto Presta , Enzo Tartaglione , Attilio Fiandrotti , Marco Grangetto , Pamela Cosman

High-Resolution Complex Scene Synthesis with Transformers

The use of coarse-grained layouts for controllable synthesis of complex scene images via deep generative models has recently gained popularity. However, results of current approaches still fall short of their promise of high-resolution…

Computer Vision and Pattern Recognition · Computer Science 2021-05-14 Manuel Jahn , Robin Rombach , Björn Ommer

Wavelet-based Variational Autoencoders for High-Resolution Image Generation

Variational Autoencoders (VAEs) are powerful generative models capable of learning compact latent representations. However, conventional VAEs often generate relatively blurry images due to their assumption of an isotropic Gaussian latent…

Computer Vision and Pattern Recognition · Computer Science 2025-04-21 Andrew Kiruluta

VideoGPT: Video Generation using VQ-VAE and Transformers

We present VideoGPT: a conceptually simple architecture for scaling likelihood based generative modeling to natural videos. VideoGPT uses VQ-VAE that learns downsampled discrete latent representations of a raw video by employing 3D…

Computer Vision and Pattern Recognition · Computer Science 2021-09-16 Wilson Yan , Yunzhi Zhang , Pieter Abbeel , Aravind Srinivas

Lossy Image Compression with Conditional Diffusion Models

This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from…

Image and Video Processing · Electrical Eng. & Systems 2024-01-03 Ruihan Yang , Stephan Mandt

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization

Existing vector quantization (VQ) based autoregressive models follow a two-stage generation paradigm that first learns a codebook to encode images as discrete codes, and then completes generation based on the learned codebook. However, they…

Computer Vision and Pattern Recognition · Computer Science 2023-05-22 Mengqi Huang , Zhendong Mao , Zhuowei Chen , Yongdong Zhang

Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models

Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Prashnna K Gyawali , Rudra Saha , Linwei Wang , VSR Veeravasarapu , Maneesh Singh

PixelTransformer: Sample Conditioned Signal Generation

We propose a generative model that can infer a distribution for the underlying spatial signal conditioned on sparse samples e.g. plausible images given a few observed pixels. In contrast to sequential autoregressive generative models, our…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Shubham Tulsiani , Abhinav Gupta

Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints

We develop a new framework for learning variational autoencoders and other deep generative models that balances generative and discriminative goals. Our framework optimizes model parameters to maximize a variational lower bound on the…

Machine Learning · Computer Science 2020-12-15 Gabriel Hope , Madina Abdrakhmanova , Xiaoyin Chen , Michael C. Hughes , Michael C. Hughes , Erik B. Sudderth

Image Generation From Small Datasets via Batch Statistics Adaptation

Thanks to the recent development of deep generative models, it is becoming possible to generate high-quality images with both fidelity and diversity. However, the training of such generative models requires a large dataset. To reduce the…

Computer Vision and Pattern Recognition · Computer Science 2019-10-24 Atsuhiro Noguchi , Tatsuya Harada

PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images

Generating 3D shapes from single RGB images is essential in various applications such as robotics. Current approaches typically target images containing clear and complete visual descriptions of the object, without considering common…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Yiheng Xiong , Angela Dai

Fast Compressive Sensing Recovery Using Generative Models with Structured Latent Variables

Deep learning models have significantly improved the visual quality and accuracy on compressive sensing recovery. In this paper, we propose an algorithm for signal reconstruction from compressed measurements with image priors captured by a…

Machine Learning · Computer Science 2020-03-20 Shaojie Xu , Sihan Zeng , Justin Romberg

Unpriortized Autoencoder For Image Generation

In this paper, we treat the image generation task using an autoencoder, a representative latent model. Unlike many studies regularizing the latent variable's distribution by assuming a manually specified prior, we approach the image…

Machine Learning · Computer Science 2021-08-27 Jaeyoung Yoo , Hojun Lee , Nojun Kwak

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Noise-robust latent vector reconstruction in ptychography using deep generative models

Computational imaging is increasingly vital for a broad spectrum of applications, ranging from biological to material sciences. This includes applications where the object is known and sufficiently sparse, allowing it to be described with a…

Image and Video Processing · Electrical Eng. & Systems 2024-01-17 Jacob Seifert , Yifeng Shao , Allard P. Mosk