Related papers: Parallel Multiscale Autoregressive Density Estimat…

PixelCNN Models with Auxiliary Variables for Natural Image Modeling

We study probabilistic models of natural images and extend the autoregressive family of PixelCNN architectures by incorporating auxiliary variables. Subsequently, we describe two new generative image models that exploit different image…

Computer Vision and Pattern Recognition · Computer Science 2017-07-04 Alexander Kolesnikov , Christoph H. Lampert

Conditional Image Generation with PixelCNN Decoders

This work explores conditional image generation with a new image density model based on the PixelCNN architecture. The model can be conditioned on any vector, including descriptive labels or tags, or latent embeddings created by other…

Computer Vision and Pattern Recognition · Computer Science 2016-06-21 Aaron van den Oord , Nal Kalchbrenner , Oriol Vinyals , Lasse Espeholt , Alex Graves , Koray Kavukcuoglu

Spatial PixelCNN: Generating Images from Patches

In this paper we propose Spatial PixelCNN, a conditional autoregressive model that generates images from small patches. By conditioning on a grid of pixel coordinates and global features extracted from a Variational Autoencoder (VAE), we…

Computer Vision and Pattern Recognition · Computer Science 2017-12-05 Nader Akoury , Anh Nguyen

PixelSNAIL: An Improved Autoregressive Generative Model

Autoregressive generative models consistently achieve the best results in density estimation tasks involving high dimensional data, such as images or audio. They pose density estimation as a sequence modeling task, where a recurrent neural…

Machine Learning · Computer Science 2017-12-29 Xi Chen , Nikhil Mishra , Mostafa Rohaninejad , Pieter Abbeel

PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors

Conventional image sensors digitize high-resolution images at fast frame rates, producing a large amount of data that needs to be transmitted off the sensor for further processing. This is challenging for perception systems operating on…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Haley M. So , Laurie Bose , Piotr Dudek , Gordon Wetzstein

PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications

PixelCNNs are a recently proposed class of powerful generative models with tractable likelihood. Here we discuss our implementation of PixelCNNs which we make available at https://github.com/openai/pixel-cnn. Our implementation contains a…

Machine Learning · Computer Science 2017-01-24 Tim Salimans , Andrej Karpathy , Xi Chen , Diederik P. Kingma

PixelNN: Example-based Image Synthesis

We present a simple nearest-neighbor (NN) approach that synthesizes high-frequency photorealistic images from an "incomplete" signal such as a low-resolution image, a surface normal map, or edges. Current state-of-the-art deep generative…

Computer Vision and Pattern Recognition · Computer Science 2017-08-18 Aayush Bansal , Yaser Sheikh , Deva Ramanan

Pixel Recurrent Neural Networks

Modeling the distribution of natural images is a landmark problem in unsupervised learning. This task requires an image model that is at once expressive, tractable and scalable. We present a deep neural network that sequentially predicts…

Computer Vision and Pattern Recognition · Computer Science 2016-08-22 Aaron van den Oord , Nal Kalchbrenner , Koray Kavukcuoglu

Pixel Recursive Super Resolution

We present a pixel recursive super resolution model that synthesizes realistic details into images while enhancing their resolution. A low resolution image may correspond to multiple plausible high resolution images, thus modeling the super…

Computer Vision and Pattern Recognition · Computer Science 2017-03-23 Ryan Dahl , Mohammad Norouzi , Jonathon Shlens

Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions

Deep autoregressive models have shown state-of-the-art performance in density estimation for natural images on large-scale datasets such as ImageNet. However, such models require many thousands of gradient-based weight updates and unique…

Neural and Evolutionary Computing · Computer Science 2018-03-01 Scott Reed , Yutian Chen , Thomas Paine , Aäron van den Oord , S. M. Ali Eslami , Danilo Rezende , Oriol Vinyals , Nando de Freitas

Efficient Convolutional Neural Networks for Pixelwise Classification on Heterogeneous Hardware Systems

This work presents and analyzes three convolutional neural network (CNN) models for efficient pixelwise classification of images. When using convolutional neural networks to classify single pixels in patches of a whole image, a lot of…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Fabian Tschopp

Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling

The unconditional generation of high fidelity images is a longstanding benchmark for testing the performance of image decoders. Autoregressive image models have been able to generate small images unconditionally, but the extension of these…

Computer Vision and Pattern Recognition · Computer Science 2018-12-06 Jacob Menick , Nal Kalchbrenner

An attempt to generate new bridge types from latent space of PixelCNN

Try to generate new bridge types using generative artificial intelligence technology. Using symmetric structured image dataset of three-span beam bridge, arch bridge, cable-stayed bridge and suspension bridge , based on Python programming…

Machine Learning · Computer Science 2024-01-12 Hongjun Zhang

CNN+CNN: Convolutional Decoders for Image Captioning

Image captioning is a challenging task that combines the field of computer vision and natural language processing. A variety of approaches have been proposed to achieve the goal of automatically describing an image, and recurrent neural…

Computer Vision and Pattern Recognition · Computer Science 2018-05-24 Qingzhong Wang , Antoni B. Chan

Conditional Image Synthesis With Auxiliary Classifier GANs

Synthesizing high resolution photorealistic images has been a long-standing challenge in machine learning. In this paper we introduce new methods for the improved training of generative adversarial networks (GANs) for image synthesis. We…

Machine Learning · Statistics 2017-07-24 Augustus Odena , Christopher Olah , Jonathon Shlens

Class Balanced PixelNet for Neurological Image Segmentation

In this paper, we propose an automatic brain tumor segmentation approach (e.g., PixelNet) using a pixel-level convolutional neural network (CNN). The model extracts feature from multiple convolutional layers and concatenate them to form a…

Image and Video Processing · Electrical Eng. & Systems 2022-04-26 Mobarakol Islam , Hongliang Ren

Comparison of Neuronal Attention Models

Recent models for image processing are using the Convolutional neural network (CNN) which requires a pixel per pixel analysis of the input image. This method works well. However, it is time-consuming if we have large images. To increase the…

Machine Learning · Computer Science 2019-12-10 Mohamed Karim Belaid

Bayesian Volumetric Autoregressive generative models for better semisupervised learning

Deep generative models are rapidly gaining traction in medical imaging. Nonetheless, most generative architectures struggle to capture the underlying probability distributions of volumetric data, exhibit convergence problems, and offer no…

Machine Learning · Computer Science 2019-07-29 Guilherme Pombo , Robert Gray , Tom Varsavsky , John Ashburner , Parashkev Nachev

Scene Parsing with Integration of Parametric and Non-parametric Models

We adopt Convolutional Neural Networks (CNNs) to be our parametric model to learn discriminative features and classifiers for local patch classification. Based on the occurrence frequency distribution of classes, an ensemble of CNNs…

Computer Vision and Pattern Recognition · Computer Science 2016-04-21 Bing Shuai , Zhen Zuo , Gang Wang , Bing Wang

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

We present Locality-aware Parallel Decoding (LPD) to accelerate autoregressive image generation. Traditional autoregressive image generation relies on next-patch prediction, a memory-bound process that leads to high latency. Existing works…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Zhuoyang Zhang , Luke J. Huang , Chengyue Wu , Shang Yang , Kelly Peng , Yao Lu , Song Han