English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

Convolutional Neural Network (CNN) has been widely used in unstructured datasets, one of which is image denoising. Image denoising is a noisy image reconstruction process that aims to reduce additional noise that occurs from the noisy image…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Bintang Pradana Erlangga Putra , Heri Prasetyo , Esti Suryani

The human auditory system is able to distinguish the vocal source of thousands of speakers, yet not much is known about what features the auditory system uses to do this. Fourier Transforms are capable of capturing the pitch and harmonic…

Machine Learning · Statistics 2016-10-28 Shariq Mobin , Joan Bruna

Spectral characterization of noise environments that lead to the decoherence of qubits is critical to developing robust quantum technologies. While dynamical decoupling offers one of the most successful approaches to characterize noise…

Quantum Physics · Physics 2024-05-21 Arian Vezvaee , Nanako Shitara , Shuo Sun , Andrés Montoya-Castillo

We apply basic statistical reasoning to signal reconstruction by machine learning -- learning to map corrupted observations to clean signals -- with a simple and powerful conclusion: it is possible to learn to restore images by only looking…

Computer Vision and Pattern Recognition · Computer Science 2018-10-30 Jaakko Lehtinen , Jacob Munkberg , Jon Hasselgren , Samuli Laine , Tero Karras , Miika Aittala , Timo Aila

Diffusion models have been used for probabilistic time series forecasting and show strong potential. However, fixed noise schedules often produce intermediate states that are hard to invert and a terminal state that deviates from the near…

Machine Learning · Computer Science 2026-03-03 Jintao Zhang , Zirui Liu , Mingyue Cheng , Xianquan Wang , Zhiding Liu , Qi Liu

This paper proposes a delayed subband LSTM network for online monaural (single-channel) speech enhancement. The proposed method is developed in the short time Fourier transform (STFT) domain. Online processing requires frame-by-frame signal…

Sound · Computer Science 2023-12-13 Xiaofei Li , Radu Horaud

With recent research advancements, deep learning models are becoming attractive and powerful choices for speech enhancement in real-time applications. While state-of-the-art models can achieve outstanding results in terms of speech quality…

Audio and Speech Processing · Electrical Eng. & Systems 2021-05-20 Sebastian Braun , Hannes Gamper , Chandan K. A. Reddy , Ivan Tashev

Disentangling and recovering physical attributes, such as shape and material, from a few waveform examples is a challenging inverse problem in audio signal processing, with numerous applications in musical acoustics as well as structural…

Sound · Computer Science 2020-07-21 Han Han , Vincent Lostanlen

In this paper, we investigate a deep learning approach for speech denoising through an efficient ensemble of specialist neural networks. By splitting up the speech denoising task into non-overlapping subproblems and introducing a…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Aswin Sivaraman , Minje Kim

A new algorithm is developed to jointly recover a temporal sequence of images from noisy and under-sampled Fourier data. Specifically, we consider the case where each data set is missing vital information that prevents its (individual)…

Numerical Analysis · Mathematics 2022-05-13 Yao Xiao , Jan Glaubitz , Anne Gelb , Guohui Song

Medical image denoising is considered among the most challenging vision tasks. Despite the real-world implications, existing denoising methods have notable drawbacks as they often generate visual artifacts when applied to heterogeneous…

Image and Video Processing · Electrical Eng. & Systems 2025-03-11 S M A Sharif , Rizwan Ali Naqvi , Woong-Kee Loh

A convolution neural network (CNN) based classification method for broadband DOA estimation is proposed, where the phase component of the short-time Fourier transform coefficients of the received microphone signals are directly fed into the…

Sound · Computer Science 2019-12-18 Soumitro Chakrabarty , Emanuël. A. P. Habets

De-noising plays a crucial role in the post-processing of spectra. Machine learning-based methods show good performance in extracting intrinsic information from noisy data, but often require a high-quality training set that is typically…

Materials Science · Physics 2023-05-16 Dongchen Huang , Junde Liu , Tian Qian , Yi-feng Yang

The performance of deep learning models for music source separation heavily depends on training data quality. However, datasets are often corrupted by difficult-to-detect artifacts such as audio bleeding and label noise. Since the type and…

Audio and Speech Processing · Electrical Eng. & Systems 2025-10-20 Azalea Gui , Woosung Choi , Junghyun Koo , Kazuki Shimada , Takashi Shibuya , Joan Serrà , Wei-Hsiang Liao , Yuki Mitsufuji

We propose a new framework for manifold denoising based on processing in the graph Fourier frequency domain, derived from the spectral decomposition of the discrete graph Laplacian. Our approach uses the Spectral Graph Wavelet transform in…

Machine Learning · Computer Science 2016-11-30 Shay Deutsch , Antonio Ortega , Gerard Medioni

Lack of training data presents a grand challenge to scaling out spoken language understanding (SLU) to low-resource languages. Although various data augmentation approaches have been proposed to synthesize training data in low-resource…

Computation and Language · Computer Science 2021-09-06 Yingmei Guo , Linjun Shou , Jian Pei , Ming Gong , Mingxing Xu , Zhiyong Wu , Daxin Jiang

A deep convolutional neural network has been developed to denoise atomic-resolution TEM image datasets of nanoparticles acquired using direct electron counting detectors, for applications where the image signal is severely limited by shot…

The short-time Fourier transform (STFT) is widely used for analyzing non-stationary signals. However, its performance is highly sensitive to its parameters, and manual or heuristic tuning often yields suboptimal results. To overcome this…

Sound · Computer Science 2025-06-27 Maxime Leiber , Yosra Marnissi , Axel Barrau , Sylvain Meignen , Laurent Massoulié

We propose a fully-convolutional neural-network architecture for image denoising which is simple yet powerful. Its structure allows to exploit the gradual nature of the denoising process, in which shallow layers handle local noise…

Computer Vision and Pattern Recognition · Computer Science 2018-10-17 Tal Remez , Or Litany , Raja Giryes , Alex M. Bronstein

This paper describes a preliminary approach to algorithmically reproduce the archetypical structure adopted by humans to classify sounds. In particular, we propose an approach to predict the human perceived chaos/order level in a sound and…

Sound · Computer Science 2020-03-11 Eric Guizzo , Alberto Novello