Related papers: Deep Quantization: Encoding Convolutional Activati…

Quantum Down Sampling Filter for Variational Auto-encoder

Variational autoencoders (VAEs) are fundamental for generative modeling and image reconstruction, yet their performance often struggles to maintain high fidelity in reconstructions. This study introduces a hybrid model, quantum variational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Farina Riaz , Fakhar Zaman , Hajime Suzuki , Sharif Abuadbba , David Nguyen

Deep Feature Consistent Variational Autoencoder

We present a novel method for constructing Variational Autoencoder (VAE). Instead of using pixel-by-pixel loss, we enforce deep feature consistency between the input and the output of a VAE, which ensures the VAE's output to preserve the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Xianxu Hou , Linlin Shen , Ke Sun , Guoping Qiu

Deep FisherNet for Object Classification

Despite the great success of convolutional neural networks (CNN) for the image classification task on datasets like Cifar and ImageNet, CNN's representation power is still somewhat limited in dealing with object images that have large…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Peng Tang , Xinggang Wang , Baoguang Shi , Xiang Bai , Wenyu Liu , Zhuowen Tu

VP-VAE: Rethinking Vector Quantization via Adaptive Vector Perturbation

Vector Quantized Variational Autoencoders (VQ-VAEs) are fundamental to modern generative modeling, yet they often suffer from training instability and "codebook collapse" due to the inherent coupling of representation learning and discrete…

Machine Learning · Computer Science 2026-02-20 Linwei Zhai , Han Ding , Mingzhi Lin , Cui Zhao , Fei Wang , Ge Wang , Wang Zhi , Wei Xi

Supervised Vector Quantized Variational Autoencoder for Learning Interpretable Global Representations

Learning interpretable representations of data remains a central challenge in deep learning. When training a deep generative model, the observed data are often associated with certain categorical labels, and, in parallel with learning to…

Machine Learning · Computer Science 2019-10-01 Yifan Xue , Michael Ding , Xinghua Lu

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Visual generative models (e.g., diffusion models) typically operate in compressed latent spaces to balance training efficiency and sample quality. In parallel, there has been growing interest in leveraging high-quality pre-trained visual…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Yuan Gao , Chen Chen , Tianrong Chen , Jiatao Gu

Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models

Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Prashnna K Gyawali , Rudra Saha , Linwei Wang , VSR Veeravasarapu , Maneesh Singh

Guided Variational Autoencoder for Disentanglement Learning

We propose an algorithm, guided variational autoencoder (Guided-VAE), that is able to learn a controllable generative model by performing latent representation disentanglement learning. The learning objective is achieved by providing…

Computer Vision and Pattern Recognition · Computer Science 2020-04-06 Zheng Ding , Yifan Xu , Weijian Xu , Gaurav Parmar , Yang Yang , Max Welling , Zhuowen Tu

NeVAE: A Deep Generative Model for Molecular Graphs

Deep generative models have been praised for their ability to learn smooth latent representation of images, text, and audio, which can then be used to generate new, plausible data. However, current generative models are unable to work with…

Machine Learning · Computer Science 2019-09-09 Bidisha Samanta , Abir De , Gourhari Jana , Pratim Kumar Chattaraj , Niloy Ganguly , Manuel Gomez-Rodriguez

Deep Fishing: Gradient Features from Deep Nets

Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the…

Computer Vision and Pattern Recognition · Computer Science 2015-07-24 Albert Gordo , Adrien Gaidon , Florent Perronnin

Theory and Experiments on Vector Quantized Autoencoders

Deep neural networks with discrete latent variables offer the promise of better symbolic reasoning, and learning abstractions that are more useful to new tasks. There has been a surge in interest in discrete latent variable models, however,…

Machine Learning · Computer Science 2018-07-23 Aurko Roy , Ashish Vaswani , Arvind Neelakantan , Niki Parmar

Variational Auto-Decoder: A Method for Neural Generative Modeling from Incomplete Data

Learning a generative model from partial data (data with missingness) is a challenging area of machine learning research. We study a specific implementation of the Auto-Encoding Variational Bayes (AEVB) algorithm, named in this paper as a…

Machine Learning · Computer Science 2021-01-05 Amir Zadeh , Yao-Chong Lim , Paul Pu Liang , Louis-Philippe Morency

A Quaternion-Valued Variational Autoencoder

Deep probabilistic generative models have achieved incredible success in many fields of application. Among such models, variational autoencoders (VAEs) have proved their ability in modeling a generative process by learning a latent…

Machine Learning · Computer Science 2022-12-16 Eleonora Grassucci , Danilo Comminiello , Aurelio Uncini

Improving Variational Autoencoder with Deep Feature Consistent and Generative Adversarial Training

We present a new method for improving the performances of variational autoencoder (VAE). In addition to enforcing the deep feature consistent principle thus ensuring the VAE output and its corresponding input images to have similar deep…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Xianxu Hou , Ke Sun , Linlin Shen , Guoping Qiu

Consistency Regularization for Variational Auto-Encoders

Variational auto-encoders (VAEs) are a powerful approach to unsupervised learning. They enable scalable approximate posterior inference in latent-variable models using variational inference (VI). A VAE posits a variational family…

Machine Learning · Computer Science 2022-06-08 Samarth Sinha , Adji B. Dieng

Diagnosing and Enhancing VAE Models

Although variational autoencoders (VAEs) represent a widely influential deep generative model, many aspects of the underlying energy function remain poorly understood. In particular, it is commonly believed that Gaussian encoder/decoder…

Machine Learning · Computer Science 2019-10-31 Bin Dai , David Wipf

Deep clustering with fusion autoencoder

Embracing the deep learning techniques for representation learning in clustering research has attracted broad attention in recent years, yielding a newly developed clustering paradigm, viz. the deep clustering (DC). Typically, the DC models…

Machine Learning · Computer Science 2022-01-17 Shuai Chang

Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering

Clustering is among the most fundamental tasks in computer vision and machine learning. In this paper, we propose Variational Deep Embedding (VaDE), a novel unsupervised generative clustering approach within the framework of Variational…

Computer Vision and Pattern Recognition · Computer Science 2017-06-29 Zhuxi Jiang , Yin Zheng , Huachun Tan , Bangsheng Tang , Hanning Zhou

DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding

Human face exhibits an inherent hierarchy in its representations (i.e., holistic facial expressions can be encoded via a set of facial action units (AUs) and their intensity). Variational (deep) auto-encoders (VAE) have shown great results…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Dieu Linh Tran , Robert Walecki , Ognjen Rudovic , Stefanos Eleftheriadis , Bjørn Schuller , Maja Pantic

Vector Quantized Wasserstein Auto-Encoder

Learning deep discrete latent presentations offers a promise of better symbolic and summarized abstractions that are more useful to subsequent downstream tasks. Inspired by the seminal Vector Quantized Variational Auto-Encoder (VQ-VAE),…

Machine Learning · Computer Science 2023-06-21 Tung-Long Vuong , Trung Le , He Zhao , Chuanxia Zheng , Mehrtash Harandi , Jianfei Cai , Dinh Phung