English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

Recently, we proposed short-time Fourier transform (STFT)-based loss functions for training a neural speech waveform model. In this paper, we generalize the above framework and propose a training scheme for such models based on spectral…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-09 Shinji Takaki , Hirokazu Kameoka , Junichi Yamagishi

We study the problem of few-shot learning-based denoising where the training set contains just a handful of clean and noisy samples. A solution to mitigate the small training set issue is to pre-train a denoising model with small training…

Computer Vision and Pattern Recognition · Computer Science 2019-11-27 Leslie Casas , Attila Klimmek , Gustavo Carneiro , Nassir Navab , Vasileios Belagiannis

In traditional speech denoising tasks, clean audio signals are often used as the training target, but absolutely clean signals are collected from expensive recording equipment or in studios with the strict environments. To overcome this…

Audio and Speech Processing · Electrical Eng. & Systems 2023-01-20 Jiasong Wu , Qingchun Li , Guanyu Yang , Lei Li , Lotfi Senhadji , Huazhong Shu

Traditional NMF-based signal decomposition relies on the factorization of spectral data, which is typically computed by means of short-time frequency transform. In this paper we propose to relax the choice of a pre-fixed transform and learn…

Machine Learning · Computer Science 2017-12-18 Dylan Fagot , Cédric Févotte , Herwig Wendt

Deep neural networks provide state-of-the-art performance for image denoising, where the goal is to recover a near noise-free image from a noisy observation. The underlying principle is that neural networks trained on large datasets have…

Information Theory · Computer Science 2019-04-09 Reinhard Heckel , Wen Huang , Paul Hand , Vladislav Voroninski

Music performance synthesis aims to synthesize a musical score into a natural performance. In this paper, we borrow recent advances in text-to-speech synthesis and present the Deep Performer -- a novel system for score-to-audio music…

Sound · Computer Science 2022-02-22 Hao-Wen Dong , Cong Zhou , Taylor Berg-Kirkpatrick , Julian McAuley

In this paper, we present two variations of an algorithm for signal reconstruction from one-bit or two-bit noisy observations of the discrete Fourier transform (DFT). The one-bit observations of the DFT correspond to the sign of its real…

Signal Processing · Electrical Eng. & Systems 2022-05-25 Mohak Goyal , Animesh Kumar

We present a model for capturing musical features and creating novel sequences of music, called the Convolutional Variational Recurrent Neural Network. To generate sequential data, the model uses an encoder-decoder architecture with latent…

Sound · Computer Science 2018-10-09 Eunjeong Stella Koh , Shlomo Dubnov , Dustin Wright

Transformers (Vaswani et al., 2017) have brought a remarkable improvement in the performance of neural machine translation (NMT) systems but they could be surprisingly vulnerable to noise. In this work, we try to investigate how noise…

Computation and Language · Computer Science 2021-09-13 Peyman Passban , Puneeth S. M. Saladi , Qun Liu

In this paper we propose a scalable version of a state-of-the-art deterministic time-invariant feature extraction approach based on consecutive changes of basis and nonlinearities, namely, the scattering network. The first focus of the…

Machine Learning · Statistics 2017-07-20 Randall Balestriero , Herve Glotin

In this work we propose a method for learning wavelet filters directly from data. We accomplish this by framing the discrete wavelet transform as a modified convolutional neural network. We introduce an autoencoder wavelet transform network…

Machine Learning · Computer Science 2018-02-09 Daniel Recoskie , Richard Mann

In this paper we introduce a method for significantly improving the signal to noise ratio in financial data. The approach relies on combining a target variable with different context variables and use auto-encoders (AEs) to learn…

Statistical Finance · Quantitative Finance 2024-08-13 Matthias J. Feiler

Deep learning approaches in image processing predominantly resort to supervised learning. A majority of methods for image denoising are no exception to this rule and hence demand pairs of noisy and corresponding clean images. Only recently…

Image and Video Processing · Electrical Eng. & Systems 2020-10-02 Priyatham Kattakinda , A. N. Rajagopalan

Automatic music transcription (AMT) is the problem of analyzing an audio recording of a musical piece and detecting notes that are being played. AMT is a challenging problem, particularly when it comes to polyphonic music. The goal of AMT…

Sound · Computer Science 2025-05-08 Yohannis Telila , Tommaso Cucinotta , Davide Bacciu

Many applications of cross-modal music retrieval are related to connecting sheet music images to audio recordings. A typical and recent approach to this is to learn, via deep neural networks, a joint embedding space that correlates short…

Sound · Computer Science 2023-09-22 Luis Carvalho , Gerhard Widmer

This paper introduces a quantum-inspired denoising framework that integrates the Quantum Fourier Transform (QFT) into classical audio enhancement pipelines. Unlike conventional Fast Fourier Transform (FFT) based methods, QFT provides a…

Sound · Computer Science 2025-09-08 Rajeshwar Tripathi , Sahil Tomar , Sandeep Kumar , Monika Aggarwal

Convolutional Neural Network (CNN) recognition rates drop in the presence of noise. We demonstrate a novel method of counteracting this drop in recognition rate by adjusting the biases of the neurons in the convolutional layers according to…

Computer Vision and Pattern Recognition · Computer Science 2017-02-06 James R. Geraci , Parichay Kapoor

Microstructure imaging is crucial in materials science, but experimental images often introduce noise that obscures critical structural details. This study presents a novel deep learning approach for robust microstructure image denoising,…

Materials Science · Physics 2025-07-03 Owais Ahmad , Albert Linda , Saumya Ranjan Jha , Somnath Bhowmick

Lifelong audio feature extraction involves learning new sound classes incrementally, which is essential for adapting to new data distributions over time. However, optimizing the model only on new data can lead to catastrophic forgetting of…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-08 Xilin Jiang , Yinghao Aaron Li , Nima Mesgarani

Deep learning algorithms, especially Transformer-based models, have achieved significant performance by capturing long-range dependencies and historical information. However, the power of convolution has not been fully investigated.…

Machine Learning · Computer Science 2023-12-29 Zhihao Yu , Liantao Ma , Yasha Wang , Junfeng Zhao