Related papers: Learning to Denoise Historical Music

Deep sound-field denoiser: optically-measured sound-field denoising using deep neural network

This paper proposes a deep sound-field denoiser, a deep neural network (DNN) based denoising of optically measured sound-field images. Sound-field imaging using optical methods has gained considerable attention due to its ability to achieve…

Signal Processing · Electrical Eng. & Systems 2023-09-22 Kenji Ishikawa , Daiki Takeuchi , Noboru Harada , Takehiro Moriya

Enhancing and Learning Denoiser without Clean Reference

Recent studies on learning-based image denoising have achieved promising performance on various noise reduction tasks. Most of these deep denoisers are trained either under the supervision of clean references, or unsupervised on synthetic…

Image and Video Processing · Electrical Eng. & Systems 2021-03-30 Rui Zhao , Daniel P. K. Lun , Kin-Man Lam

Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization

Learned denoisers play a fundamental role in various signal generation (e.g., diffusion models) and reconstruction (e.g., compressed sensing) architectures, whose success derives from their ability to leverage low-dimensional structure in…

Machine Learning · Computer Science 2025-08-14 Shiyu Wang , Mariam Avagyan , Yihan Shen , Arnaud Lamy , Tingran Wang , Szabolcs Márka , Zsuzsa Márka , John Wright

STFT spectral loss for training a neural speech waveform model

This paper proposes a new loss using short-time Fourier transform (STFT) spectra for the aim of training a high-performance neural speech waveform model that predicts raw continuous speech waveform samples directly. Not only amplitude…

Audio and Speech Processing · Electrical Eng. & Systems 2018-10-31 Shinji Takaki , Toru Nakashika , Xin Wang , Junichi Yamagishi

Denoising without access to clean data using a partitioned autoencoder

Training a denoising autoencoder neural network requires access to truly clean data, a requirement which is often impractical. To remedy this, we introduce a method to train an autoencoder using only noisy data, having examples with and…

Neural and Evolutionary Computing · Computer Science 2015-09-24 Dan Stowell , Richard E. Turner

Time Domain Neural Audio Style Transfer

A recently published method for audio style transfer has shown how to extend the process of image style transfer to audio. This method synthesizes audio "content" and "style" independently using the magnitudes of a short time Fourier…

Sound · Computer Science 2017-12-01 Parag K. Mital

Model-based STFT phase recovery for audio source separation

For audio source separation applications, it is common to estimate the magnitude of the short-time Fourier transform (STFT) of each source. In order to further synthesizing time-domain signals, it is necessary to recover the phase of the…

Sound · Computer Science 2018-02-28 Paul Magron , Roland Badeau , Bertrand David

An auditory cortex model for sound processing

The reconstruction mechanisms built by the human auditory system during sound reconstruction are still a matter of debate. The purpose of this study is to refine the auditory cortex model introduced in [9], and inspired by the geometrical…

Analysis of PDEs · Mathematics 2021-03-09 Rand Asswad , Ugo Boscain , Giuseppina Turco , Dario Prandi , Ludovic Sacchelli

Comparison of Time-Frequency Representations for Environmental Sound Classification using Convolutional Neural Networks

Recent successful applications of convolutional neural networks (CNNs) to audio classification and speech recognition have motivated the search for better input representations for more efficient training. Visual displays of an audio…

Computer Vision and Pattern Recognition · Computer Science 2017-06-23 M. Huzaifah

Bayesian Reconstruction of Fourier Pairs

In a number of data-driven applications such as detection of arrhythmia, interferometry or audio compression, observations are acquired indistinctly in the time or frequency domains: temporal observations allow us to study the spectral…

Signal Processing · Electrical Eng. & Systems 2020-11-10 Felipe Tobar , Lerko Araya-Hernández , Pablo Huijse , Petar M. Djurić

2-Shots in the Dark: Low-Light Denoising with Minimal Data Acquisition

Raw images taken in low-light conditions are very noisy due to low photon count and sensor noise. Learning-based denoisers have the potential to reconstruct high-quality images. For training, however, these denoisers require large paired…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Liying Lu , Raphaël Achddou , Sabine Süsstrunk

Enhancing Traffic Prediction with Learnable Filter Module

Modeling future traffic conditions often relies heavily on complex spatial-temporal neural networks to capture spatial and temporal correlations, which can overlook the inherent noise in the data. This noise, often manifesting as unexpected…

Machine Learning · Computer Science 2023-10-26 Yuanshao Zhu , Yongchao Ye , Xiangyu Zhao , James J. Q. Yu

DenoiSeg: Joint Denoising and Segmentation

Microscopy image analysis often requires the segmentation of objects, but training data for this task is typically scarce and hard to obtain. Here we propose DenoiSeg, a new method that can be trained end-to-end on only a few annotated…

Computer Vision and Pattern Recognition · Computer Science 2020-06-12 Tim-Oliver Buchholz , Mangal Prakash , Alexander Krull , Florian Jug

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

We propose a method using a long short-term memory (LSTM) network to estimate the noise power spectral density (PSD) of single-channel audio signals represented in the short time Fourier transform (STFT) domain. An LSTM network common to…

Signal Processing · Electrical Eng. & Systems 2020-11-11 Xiaofei Li , Simon Leglaive , Laurent Girin , Radu Horaud

Speech Denoising Without Clean Training Data: A Noise2Noise Approach

This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio-denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.…

Sound · Computer Science 2021-09-21 Madhav Mahesh Kashyap , Anuj Tambwekar , Krishnamoorthy Manohara , S Natarajan

Sines, Transient, Noise Neural Modeling of Piano Notes

This paper introduces a novel method for emulating piano sounds. We propose to exploit the sines, transient, and noise decomposition to design a differentiable spectral modeling synthesizer replicating piano notes. Three sub-modules learn…

Sound · Computer Science 2025-02-04 Riccardo Simionato , Stefano Fasciani

Consensus Neural Network for Medical Imaging Denoising with Only Noisy Training Samples

Deep neural networks have been proved efficient for medical image denoising. Current training methods require both noisy and clean images. However, clean images cannot be acquired for many practical medical applications due to naturally…

Image and Video Processing · Electrical Eng. & Systems 2019-06-11 Dufan Wu , Kuang Gong , Kyungsang Kim , Quanzheng Li

Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders

Generative models in vision have seen rapid progress due to algorithmic improvements and the availability of high-quality image datasets. In this paper, we offer contributions in both these areas to enable similar progress in audio…

Machine Learning · Computer Science 2017-04-06 Jesse Engel , Cinjon Resnick , Adam Roberts , Sander Dieleman , Douglas Eck , Karen Simonyan , Mohammad Norouzi

Complex Image-Generative Diffusion Transformer for Audio Denoising

The audio denoising technique has captured widespread attention in the deep neural network field. Recently, the audio denoising problem has been converted into an image generation task, and deep learning-based approaches have been applied…

Sound · Computer Science 2024-06-14 Junhui Li , Pu Wang , Jialu Li , Youshan Zhang

Denoising deterministic networks using iterative Fourier transforms

We detail a novel Fourier-based approach (IterativeFT) for identifying deterministic network structure in the presence of both edge pruning and Gaussian noise. This technique involves the iterative execution of forward and inverse 2D…

Signal Processing · Electrical Eng. & Systems 2026-02-03 H. Robert Frost