English
Related papers

Related papers: Problems using deep generative models for probabil…

200 papers

Despite substantial progress in signal source separation, results for richly structured data continue to contain perceptible artifacts. In contrast, recent deep generative models can produce authentic samples in a variety of domains that…

Machine Learning · Computer Science 2020-09-22 Vivek Jayaram , John Thickstun

A prominent theory of affective response to music revolves around the concepts of surprisal and expectation. In prior work, this idea has been operationalized in the form of probabilistic models of music which allow for precise computation…

Sound · Computer Science 2023-10-06 Ninon Lizé Masclef , T. Anderson Keller

Sound modelling is the process of developing algorithms that generate sound under parametric control. There are a few distinct approaches that have been developed historically including modelling the physics of sound production and…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-26 M. Huzaifah , L. Wyse

State-of-the-art under-determined audio source separation systems rely on supervised end-end training of carefully tailored neural network architectures operating either in the time or the spectral domain. However, these methods are…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-29 Vivek Narayanaswamy , Jayaraman J. Thiagarajan , Rushil Anirudh , Andreas Spanias

Neural network based generative models with discriminative components are a powerful approach for semi-supervised learning. However, these techniques a) cannot account for model uncertainty in the estimation of the model's discriminative…

Machine Learning · Statistics 2017-06-30 Jonathan Gordon , José Miguel Hernández-Lobato

Generative diffusion models showed high success in many fields with a powerful theoretical background. They convert the data distribution to noise and remove the noise back to obtain a similar distribution. Many existing reviews focused on…

Machine Learning · Computer Science 2024-09-19 Melike Nur Yeğin , Mehmet Fatih Amasyalı

Accurately interpreting cardiac auscultation signals plays a crucial role in diagnosing and managing cardiovascular diseases. However, the paucity of labelled data inhibits classification models' training. Researchers have turned to…

Sound · Computer Science 2025-06-18 Leigh Abbott , Milan Marocchi , Matthew Fynn , Yue Rong , Sven Nordholm

Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely…

Extracting individual elements from music mixtures is a valuable tool for music production and practice. While neural networks optimized to mask or transform mixture spectrograms into the individual source(s) have been the leading approach,…

Sound · Computer Science 2025-11-26 Genís Plaja-Roglans , Yun-Ning Hung , Xavier Serra , Igor Pereira

Despite exceptional predictive performance of Deep sequence models (DSMs), the main concern of their deployment centers around the lack of uncertainty awareness. In contrast, probabilistic models quantify the uncertainty associated with…

Machine Learning · Computer Science 2026-03-03 Wenlong Chen

Most generative models of audio directly generate samples in one of two domains: time or frequency. While sufficient to express any signal, these representations are inefficient, as they do not utilize existing knowledge of how sound is…

Machine Learning · Computer Science 2020-01-15 Jesse Engel , Lamtharn Hantrakul , Chenjie Gu , Adam Roberts

Separating the individual elements in a musical mixture is an essential process for music analysis and practice. While this is generally addressed using neural networks optimized to mask or transform the time-frequency representation of a…

Sound · Computer Science 2025-11-27 Genís Plaja-Roglans , Yun-Ning Hung , Xavier Serra , Igor Pereira

Current generative models are able to generate high-quality artefacts but have been shown to struggle with compositional reasoning, which can be defined as the ability to generate complex structures from simpler elements. In this paper, we…

Machine Learning · Computer Science 2024-08-20 Giovanni Bindi , Philippe Esling

Generalization is a main issue for current audio deepfake detectors, which struggle to provide reliable results on out-of-distribution data. Given the speed at which more and more accurate synthesis methods are developed, it is very…

Sound · Computer Science 2024-07-02 Alessandro Pianese , Davide Cozzolino , Giovanni Poggi , Luisa Verdoliva

Modeling uncertainty in deep neural networks, despite recent important advances, is still an open problem. Bayesian neural networks are a powerful solution, where the prior over network weights is a design choice, often a normal…

Machine Learning · Statistics 2019-10-29 Raanan Y. Rohekar , Yaniv Gurwicz , Shami Nisimov , Gal Novik

Audio source separation is the process of separating a mixture (e.g. a pop band recording) into isolated sounds from individual sources (e.g. just the lead vocals). Deep learning models are the state-of-the-art in source separation, given…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-28 Alisa Liu , Prem Seetharaman , Bryan Pardo

This paper aims to apply a new deep learning approach to the task of generating raw audio files. It is based on diffusion models, a recent type of deep generative model. This new type of method has recently shown outstanding results with…

Sound · Computer Science 2023-07-21 Svetlana Pavlova

Diffusion probabilistic models have been successfully used to generate data from noise. However, most diffusion models are computationally expensive and difficult to interpret with a lack of theoretical justification. Random feature models…

Machine Learning · Statistics 2025-08-11 Esha Saha , Giang Tran

The recent literature on deep learning offers new tools to learn a rich probability distribution over high dimensional data such as images or sounds. In this work we investigate the possibility of learning the prior distribution over neural…

Machine Learning · Statistics 2017-12-19 Alexandre Lacoste , Thomas Boquet , Negar Rostamzadeh , Boris Oreshkin , Wonchang Chung , David Krueger

State of the art audio source separation models rely on supervised data-driven approaches, which can be expensive in terms of labeling resources. On the other hand, approaches for training these models without any direct supervision are…

Machine Learning · Computer Science 2022-04-04 Michele Mancusi , Emilian Postolache , Giorgio Mariani , Marco Fumero , Andrea Santilli , Luca Cosmo , Emanuele Rodolà
‹ Prev 1 2 3 10 Next ›