Related papers: Audio Decoding by Inverse Problem Solving

Removing Structured Noise with Diffusion Models

Solving ill-posed inverse problems requires careful formulation of prior beliefs over the signals of interest and an accurate description of their manifestation into noisy measurements. Handcrafted signal priors based on e.g. sparsity are…

Machine Learning · Computer Science 2025-08-14 Tristan S. W. Stevens , Hans van Gorp , Faik C. Meral , Junseob Shin , Jason Yu , Jean-Luc Robert , Ruud J. G. van Sloun

Solving Inverse Problems with a Flow-based Noise Model

We study image inverse problems with a normalizing flow prior. Our formulation views the solution as the maximum a posteriori estimate of the image conditioned on the measurements. This formulation allows us to use noise models with…

Machine Learning · Computer Science 2021-07-02 Jay Whang , Qi Lei , Alexandros G. Dimakis

Unsupervised Single-Channel Audio Separation with Diffusion Source Priors

Single-channel audio separation aims to separate individual sources from a single-channel mixture. Most existing methods rely on supervised learning with synthetically generated paired data. However, obtaining high-quality paired data in…

Audio and Speech Processing · Electrical Eng. & Systems 2025-12-24 Runwu Shi , Chang Li , Jiang Wang , Rui Zhang , Nabeela Khan , Benjamin Yen , Takeshi Ashizawa , Kazuhiro Nakadai

Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver

Diffusion models have been firmly established as principled zero-shot solvers for linear and nonlinear inverse problems, owing to their powerful image prior and iterative sampling algorithm. These approaches often rely on Tweedie's formula,…

Machine Learning · Computer Science 2026-04-29 Jonathan Patsenker , Henry Li , Myeongseob Ko , Ruoxi Jia , Yuval Kluger

Diffusion Posterior Sampling for General Noisy Inverse Problems

Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear…

Machine Learning · Statistics 2025-10-06 Hyungjin Chung , Jeongsol Kim , Michael T. Mccann , Marc L. Klasky , Jong Chul Ye

Bayesian Source Separation and Localization

The problem of mixed signals occurs in many different contexts; one of the most familiar being acoustics. The forward problem in acoustics consists of finding the sound pressure levels at various detectors resulting from sound signals…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Kevin H. Knuth

Tweedie Moment Projected Diffusions For Inverse Problems

Diffusion generative models unlock new possibilities for inverse problems as they allow for the incorporation of strong empirical priors in scientific inference. Recently, diffusion models are repurposed for solving inverse problems using…

Computation · Statistics 2024-09-26 Benjamin Boys , Mark Girolami , Jakiw Pidstrigach , Sebastian Reich , Alan Mosca , O. Deniz Akyildiz

High-Fidelity Noise Reduction with Differentiable Signal Processing

Noise reduction techniques based on deep learning have demonstrated impressive performance in enhancing the overall quality of recorded speech. While these approaches are highly performant, their application in audio engineering can be…

Sound · Computer Science 2023-10-18 Christian J. Steinmetz , Thomas Walther , Joshua D. Reiss

Token-Based Audio Inpainting via Discrete Diffusion

Audio inpainting seeks to restore missing segments in degraded recordings. Previous diffusion-based methods exhibit impaired performance when the missing region is large. We introduce the first approach that applies discrete diffusion over…

Sound · Computer Science 2026-02-18 Tali Dror , Iftach Shoham , Moshe Buchris , Oren Gal , Haim Permuter , Gilad Katz , Eliya Nachmani

Audio declipping performance enhancement via crossfading

Some audio declipping methods produce waveforms that do not fully respect the physical process of clipping, which is why we refer to them as inconsistent. This letter reports what effect on perception it has if the solution by inconsistent…

Audio and Speech Processing · Electrical Eng. & Systems 2023-03-08 Pavel Záviška , Pavel Rajmic , Ondřej Mokrý

Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements

Diffusion models have emerged as a powerful foundation model for visual generations. With an appropriate sampling process, it can effectively serve as a generative prior for solving general inverse problems. Current posterior sampling-based…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Shijie Zhou , Huaisheng Zhu , Rohan Sharma , Jiayi Chen , Ruiyi Zhang , Kaiyi Ji , Changyou Chen

Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance

Diffusion models are powerful tools for sampling from high-dimensional distributions by progressively transforming pure noise into structured data through a denoising process. When equipped with a guidance mechanism, these models can also…

Machine Learning · Computer Science 2026-05-04 Saeed Mohseni-Sehdeh , Walid Saad , Kei Sakaguchi , Tao Yu

Speech Denoising by Accumulating Per-Frequency Modeling Fluctuations

We present a method for audio denoising that combines processing done in both the time domain and the time-frequency domain. Given a noisy audio clip, the method trains a deep neural network to fit this signal. Since the fitting is only…

Sound · Computer Science 2020-06-11 Michael Michelashvili , Lior Wolf

Guided Diffusion Sampling on Function Spaces with Applications to PDEs

We propose a general framework for conditional sampling in PDE-based inverse problems, targeting the recovery of whole solutions from extremely sparse or noisy measurements. This is accomplished by a function-space diffusion model and…

Machine Learning · Computer Science 2026-02-06 Jiachen Yao , Abbas Mammadov , Julius Berner , Gavin Kerrigan , Jong Chul Ye , Kamyar Azizzadenesheli , Anima Anandkumar

An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning

Automated audio captioning aims to use natural language to describe the content of audio data. This paper presents an audio captioning system with an encoder-decoder architecture, where the decoder predicts words based on audio features…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-06 Xinhao Mei , Qiushi Huang , Xubo Liu , Gengyun Chen , Jingqian Wu , Yusong Wu , Jinzheng Zhao , Shengchen Li , Tom Ko , H Lilian Tang , Xi Shao , Mark D. Plumbley , Wenwu Wang

Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing

Diffusion models have recently achieved success in solving Bayesian inverse problems with learned data priors. Current methods build on top of the diffusion sampling process, where each denoising step makes small modifications to samples…

Machine Learning · Computer Science 2025-08-19 Bingliang Zhang , Wenda Chu , Julius Berner , Chenlin Meng , Anima Anandkumar , Yang Song

Diffusion models for audio semantic communication

Directly sending audio signals from a transmitter to a receiver across a noisy channel may absorb consistent bandwidth and be prone to errors when trying to recover the transmitted bits. On the contrary, the recent semantic communication…

Sound · Computer Science 2023-09-15 Eleonora Grassucci , Christian Marinoni , Andrea Rodriguez , Danilo Comminiello

Noise-robust voice conversion with domain adversarial training

Voice conversion has made great progress in the past few years under the studio-quality test scenario in terms of speech quality and speaker similarity. However, in real applications, test speech from source speaker or target speaker can be…

Sound · Computer Science 2022-01-27 Hongqiang Du , Lei Xie , Haizhou Li

A Study on Speech Enhancement Based on Diffusion Probabilistic Model

Diffusion probabilistic models have demonstrated an outstanding capability to model natural images and raw audio waveforms through a paired diffusion and reverse processes. The unique property of the reverse process (namely, eliminating…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-23 Yen-Ju Lu , Yu Tsao , Shinji Watanabe

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Diffusion models have recently emerged as powerful generative priors for solving inverse problems. However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Bowen Song , Soo Min Kwon , Zecheng Zhang , Xinyu Hu , Qing Qu , Liyue Shen