English
Related papers

Related papers: Diffusion bridges vector quantized Variational Aut…

200 papers

Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector…

Machine Learning · Computer Science 2018-05-31 Aaron van den Oord , Oriol Vinyals , Koray Kavukcuoglu

Deep neural networks with discrete latent variables offer the promise of better symbolic reasoning, and learning abstractions that are more useful to new tasks. There has been a surge in interest in discrete latent variable models, however,…

Machine Learning · Computer Science 2018-07-23 Aurko Roy , Ashish Vaswani , Arvind Neelakantan , Niki Parmar

In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial…

As a widely recognized approach to deep generative modeling, Variational Auto-Encoders (VAEs) still face challenges with the quality of generated images, often presenting noticeable blurriness. This issue stems from the unrealistic…

Machine Learning · Computer Science 2023-05-22 Georgios Batzolis , Jan Stanczuk , Carola-Bibiane Schönlieb

Learning deep discrete latent presentations offers a promise of better symbolic and summarized abstractions that are more useful to subsequent downstream tasks. Inspired by the seminal Vector Quantized Variational Auto-Encoder (VQ-VAE),…

Machine Learning · Computer Science 2023-06-21 Tung-Long Vuong , Trung Le , He Zhao , Chuanxia Zheng , Mehrtash Harandi , Jianfei Cai , Dinh Phung

Diffusion-based representation learning has achieved substantial attention due to its promising capabilities in latent representation and sample generation. Recent studies have employed an auxiliary encoder to identify a corresponding…

Machine Learning · Computer Science 2025-03-11 Yeongmin Kim , Kwanghyeon Lee , Minsang Park , Byeonghu Na , Il-Chul Moon

We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation. To this end, we scale and enhance the autoregressive priors used in VQ-VAE to generate synthetic samples of much higher…

Machine Learning · Computer Science 2019-06-04 Ali Razavi , Aaron van den Oord , Oriol Vinyals

Image generative models can learn the distributions of the training data and consequently generate examples by sampling from these distributions. However, when the training dataset is corrupted with outliers, generative models will likely…

Machine Learning · Computer Science 2022-09-21 Chieh-Hsin Lai , Dongmian Zou , Gilad Lerman

Variational Autoencoders (VAEs) are well-established as a principled approach to probabilistic unsupervised learning with neural networks. Typically, an encoder network defines the parameters of a Gaussian distributed latent space from…

Machine Learning · Computer Science 2025-05-16 Alan Jeffares , Liyuan Liu

An important task in quantum generative machine learning is to model the probability distribution of measurements of many-body quantum systems. Classical generative models, such as generative adversarial networks (GANs) and variational…

Quantum Physics · Physics 2023-05-19 Anantha Rao , Dhiraj Madan , Anupama Ray , Dhinakaran Vinayagamurthy , M. S. Santhanam

The vector quantization is a widely used method to map continuous representation to discrete space and has important application in tokenization for generative mode, bottlenecking information and many other tasks in machine learning. Vector…

Machine Learning · Computer Science 2024-10-15 Mingyuan Yan , Jiawei Wu , Rushi Shah , Dianbo Liu

Paradoxically, a Variational Autoencoder (VAE) could be pushed in two opposite directions, utilizing powerful decoder model for generating realistic images but collapsing the learned representation, or increasing regularization coefficient…

Machine Learning · Computer Science 2022-03-30 Trung Ngo , Najwa Laabid , Ville Hautamäki , Merja Heinäniemi

We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation. This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent space is modeled by a conditional variant of the recently…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Shuyang Gu , Dong Chen , Jianmin Bao , Fang Wen , Bo Zhang , Dongdong Chen , Lu Yuan , Baining Guo

The integration of Vector Quantised Variational AutoEncoder (VQ-VAE) with autoregressive models as generation part has yielded high-quality results on image generation. However, the autoregressive models will strictly follow the progressive…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Minghui Hu , Yujie Wang , Tat-Jen Cham , Jianfei Yang , P. N. Suganthan

Recent advancements in learning Discrete Representations as opposed to continuous ones have led to state of art results in tasks that involve Language, Audio and Vision. Some latent factors such as words, phonemes and shapes are better…

Machine Learning · Computer Science 2020-04-14 Iordanis Fostiropoulos

Sampling trajectories from a distribution followed by ranking them based on a specified cost function is a common approach in autonomous driving. Typically, the sampling distribution is hand-crafted (e.g a Gaussian, or a grid). Recently,…

Robotics · Computer Science 2024-04-26 Simon Idoko , Basant Sharma , Arun Kumar Singh

Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying…

Machine Learning · Computer Science 2019-10-08 Bin Dai , Yu Wang , John Aston , Gang Hua , David Wipf

Most visual generative models compress images into a latent space before applying diffusion or autoregressive modelling. Yet, existing approaches such as VAEs and foundation model aligned encoders implicitly constrain the latent space…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Sen Ye , Jianning Pei , Mengde Xu , Shuyang Gu , Chunyu Wang , Liwei Wang , Han Hu

Data-driven flow-field reconstruction typically relies on autoencoder architectures that compress high-dimensional states into low-dimensional latent representations. However, classical approaches such as variational autoencoders (VAEs)…

Machine Learning · Computer Science 2026-01-14 AmirPouya Hemmasian , Amir Barati Farimani

Generative learning models in medical research are crucial in developing training data for deep learning models and advancing diagnostic tools, but the problem of high-quality, diverse images is an open topic of research. Quantum-enhanced…

Quantum Physics · Physics 2025-08-14 Kübra Yeter-Aydeniz , Nora M. Bauer , Pranay Jain , Max Masnick
‹ Prev 1 2 3 10 Next ›