Related papers: A variational autoencoder-based nonnegative matrix…

A Variational Autoencoder for Probabilistic Non-Negative Matrix Factorisation

We introduce and demonstrate the variational autoencoder (VAE) for probabilistic non-negative matrix factorisation (PAE-NMF). We design a network which can perform non-negative matrix factorisation (NMF) and add in aspects of a VAE to make…

Machine Learning · Computer Science 2019-06-17 Steven Squires , Adam Prügel Bennett , Mahesan Niranjan

A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder

Recently, variational autoencoder (VAE), a deep representation learning (DRL) model, has been used to perform speech enhancement (SE). However, to the best of our knowledge, current VAE-based SE methods only apply VAE to the model speech…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-25 Yang Xiang , Jesper Lisby Højvang , Morten Højfeldt Rasmussen , Mads Græsbøll Christensen

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

This paper presents a statistical method of single-channel speech enhancement that uses a variational autoencoder (VAE) as a prior distribution on clean speech. A standard approach to speech enhancement is to train a deep neural network…

Sound · Computer Science 2019-03-12 Yoshiaki Bando , Masato Mimura , Katsutoshi Itoyama , Kazuyoshi Yoshii , Tatsuya Kawahara

Variational Autoencoder for Personalized Pathological Speech Enhancement

The generalizability of speech enhancement (SE) models across speaker conditions remains largely unexplored, despite its critical importance for broader applicability. This paper investigates the performance of the hybrid variational…

Audio and Speech Processing · Electrical Eng. & Systems 2025-03-19 Mingchi Hou , Ina Kodrasi

Can We Trust Deep Speech Prior?

Recently, speech enhancement (SE) based on deep speech prior has attracted much attention, such as the variational auto-encoder with non-negative matrix factorization (VAE-NMF) architecture. Compared to conventional approaches that…

Sound · Computer Science 2020-11-05 Ying Shi , Haolin Chen , Zhiyuan Tang , Lantian Li , Dong Wang , Jiqing Han

A variance modeling framework based on variational autoencoders for speech enhancement

In this paper we address the problem of enhancing speech signals in noisy mixtures using a source separation approach. We explore the use of neural networks as an alternative to a popular speech variance model based on supervised…

Sound · Computer Science 2019-02-06 Simon Leglaive , Laurent Girin , Radu Horaud

Supervised Dictionary Learning by a Variational Bayesian Group Sparse Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) with group sparsity constraints is formulated as a probabilistic graphical model and, assuming some observed data have been generated by the model, a feasible variational Bayesian algorithm is derived…

Computer Vision and Pattern Recognition · Computer Science 2014-05-28 Ivan Ivek

Unsupervised speech enhancement with deep dynamical generative speech and noise models

This work builds on a previous work on unsupervised speech enhancement using a dynamical variational autoencoder (DVAE) as the clean speech model and non-negative matrix factorization (NMF) as the noise model. We propose to replace the NMF…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-14 Xiaoyu Lin , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda

A Recurrent Variational Autoencoder for Speech Enhancement

This paper presents a generative approach to speech enhancement based on a recurrent variational autoencoder (RVAE). The deep generative speech model is trained using clean speech signals only, and it is combined with a nonnegative matrix…

Machine Learning · Computer Science 2020-02-11 Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space learned in an unsupervised manner. In the original VAE model,…

Sound · Computer Science 2021-06-15 Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda

Variational Autoencoders for Collaborative Filtering

We extend variational autoencoders (VAEs) to collaborative filtering for implicit feedback. This non-linear probabilistic model enables us to go beyond the limited modeling capacity of linear factor models which still largely dominate…

Machine Learning · Statistics 2018-02-19 Dawen Liang , Rahul G. Krishnan , Matthew D. Hoffman , Tony Jebara

Deep Variational Inference Without Pixel-Wise Reconstruction

Variational autoencoders (VAEs), that are built upon deep neural networks have emerged as popular generative models in computer vision. Most of the work towards improving variational autoencoders has focused mainly on making the…

Machine Learning · Statistics 2016-11-17 Siddharth Agrawal , Ambedkar Dukkipati

Unscented Autoencoder

The Variational Autoencoder (VAE) is a seminal approach in deep generative modeling with latent variables. Interpreting its reconstruction process as a nonlinear transformation of samples from the latent posterior distribution, we apply the…

Machine Learning · Computer Science 2023-06-09 Faris Janjoš , Lars Rosenbaum , Maxim Dolgov , J. Marius Zöllner

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders

Variational auto-encoders (VAEs) are deep generative latent variable models that can be used for learning the distribution of complex data. VAEs have been successfully used to learn a probabilistic prior over speech signals, which is then…

Sound · Computer Science 2020-12-18 Mostafa Sadeghi , Simon Leglaive , Xavier Alameda-PIneda , Laurent Girin , Radu Horaud

Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder

Recently, a generative variational autoencoder (VAE) has been proposed for speech enhancement to model speech statistics. However, this approach only uses clean speech in the training phase, making the estimation particularly sensitive to…

Audio and Speech Processing · Electrical Eng. & Systems 2021-05-18 Huajian Fang , Guillaume Carbajal , Stefan Wermter , Timo Gerkmann

Improving Variational Autoencoder with Deep Feature Consistent and Generative Adversarial Training

We present a new method for improving the performances of variational autoencoder (VAE). In addition to enforcing the deep feature consistent principle thus ensuring the VAE output and its corresponding input images to have similar deep…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Xianxu Hou , Ke Sun , Linlin Shen , Guoping Qiu

Variational autoencoder for inference of nonlinear mixed effect models based on ordinary differential equations

We propose a variational autoencoder (VAE) approach for parameter estimation in nonlinear mixed-effects models based on ordinary differential equations (NLME-ODEs) using longitudinal data from multiple subjects. In moderate dimensions,…

Methodology · Statistics 2026-02-11 Zhe Li , Mélanie Prague , Rodolphe Thiébaut , Quentin Clairon

Deep Feature Consistent Variational Autoencoder

We present a novel method for constructing Variational Autoencoder (VAE). Instead of using pixel-by-pixel loss, we enforce deep feature consistency between the input and the output of a VAE, which ensures the VAE's output to preserve the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Xianxu Hou , Linlin Shen , Ke Sun , Guoping Qiu

Tackling Over-pruning in Variational Autoencoders

Variational autoencoders (VAE) are directed generative models that learn factorial latent variables. As noted by Burda et al. (2015), these models exhibit the problem of factor over-pruning where a significant number of stochastic factors…

Machine Learning · Computer Science 2017-08-08 Serena Yeung , Anitha Kannan , Yann Dauphin , Li Fei-Fei

Variational autoencoder-based neural network model compression

Variational Autoencoders (VAEs), as a form of deep generative model, have been widely used in recent years, and shown great great peformance in a number of different domains, including image generation and anomaly detection, etc.. This…

Machine Learning · Computer Science 2024-08-28 Liang Cheng , Peiyuan Guan , Amir Taherkordi , Lei Liu , Dapeng Lan