English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

Music discovery services let users identify songs from short mobile recordings. These solutions are often based on Audio Fingerprinting, and rely more specifically on the extraction of spectral peaks in order to be robust to a number of…

Sound · Computer Science 2022-12-23 Kamil Akesbi

In machine learning approach to image denoising a network is trained to recover a clean image from a noisy one. In this paper a novel structure is proposed based on training multiple specialized networks as opposed to existing structures…

Image and Video Processing · Electrical Eng. & Systems 2020-12-01 Seyed Mohsen Hosseini

We studied the ability of deep neural networks (DNNs) to restore missing audio content based on its context, a process usually referred to as audio inpainting. We focused on gaps in the range of tens of milliseconds. The proposed DNN…

Sound · Computer Science 2022-02-21 Andrés Marafioti , Nicki Holighaus , Piotr Majdak , Nathanaël Perraudin

This paper introduces a dual-signal transformation LSTM network (DTLN) for real-time speech enhancement as part of the Deep Noise Suppression Challenge (DNS-Challenge). This approach combines a short-time Fourier transform (STFT) and a…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-23 Nils L. Westhausen , Bernd T. Meyer

We present a framework based on neural networks to extract music scores directly from polyphonic audio in an end-to-end fashion. Most previous Automatic Music Transcription (AMT) methods seek a piano-roll representation of the pitches, that…

Sound · Computer Science 2019-10-29 Miguel A. Román , Antonio Pertusa , Jorge Calvo-Zaragoza

In recent years, speech enhancement (SE) has achieved impressive progress with the success of deep neural networks (DNNs). However, the DNN approach usually fails to generalize well to unseen environmental noise that is not included in the…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-09 Haoyu Li , Junichi Yamagishi

In spectroscopic experiments, data acquisition in multi-dimensional phase space may require long acquisition time, owing to the large phase space volume to be covered. In such case, the limited time available for data acquisition can be a…

Nonnegative matrix factorization (NMF) is a popular method for audio spectral unmixing. While NMF is traditionally applied to off-the-shelf time-frequency representations based on the short-time Fourier or Cosine transforms, the ability to…

Machine Learning · Statistics 2018-11-07 Pierre Ablin , Dylan Fagot , Herwig Wendt , Alexandre Gramfort , Cédric Févotte

We have presented a new and alternative algorithm for noise reduction using the methods of discrete wavelet transform and numerical differentiation of the data. In our method the threshold for reducing noise comes out automatically. The…

This work proposes a learning-based statistical refinement method for improving the denoising results of a given denoiser without knowing the precise noise distribution or accessing clean images or calibration data. While there are many…

Machine Learning · Computer Science 2026-05-07 Rihuan Ke

We propose a method for the blind separation of sounds of musical instruments in audio signals. We describe the individual tones via a parametric model, training a dictionary to capture the relative amplitudes of the harmonics. The model…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-10 Sören Schulze , Johannes Leuschner , Emily J. King

Real-noise denoising is a challenging task because the statistics of real-noise do not follow the normal distribution, and they are also spatially and temporally changing. In order to cope with various and complex real-noise, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Yoonsik Kim , Jae Woong Soh , Gu Yong Park , Nam Ik Cho

We apply a Machine Learning technique known as Convolutional Denoising Autoencoder to denoise synthetic images of state-of-the-art radio telescopes, with the goal of detecting the faint, diffused radio sources predicted to characterise the…

Instrumentation and Methods for Astrophysics · Physics 2021-11-03 Claudio Gheller , Franco Vazza

Segmenting audio into homogeneous sections such as music and speech helps us understand the content of audio. It is useful as a pre-processing step to index, store, and modify audio recordings, radio broadcasts and TV programmes. Deep…

Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain on specific tasks due to the great improvements brought by the deep neural networks (DNN). The majority of state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 José Augusto Stuchi , Levy Boccato , Romis Attux

We present in this paper PerformacnceNet, a neural network model we proposed recently to achieve score-to-audio music generation. The model learns to convert a music piece from the symbolic domain to the audio domain, assigning…

Sound · Computer Science 2019-05-29 Yu-Hua Chen , Bryan Wang , Yi-Hsuan Yang

We propose a method for noise reduction, the task of producing a clean audio signal from a recording corrupted by additive noise. Many common approaches to this problem are based upon applying non-negative matrix factorization to…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-08 Andrew Sack , Wenzhao Jiang , Michael Perlmutter , Palina Salanevich , Deanna Needell

Real-world image noise removal is a long-standing yet very challenging task in computer vision. The success of deep neural network in denoising stimulates the research of noise generation, aiming at synthesizing more clean-noisy image pairs…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zongsheng Yue , Qian Zhao , Lei Zhang , Deyu Meng

The complex physics involved in atmospheric turbulence makes it very difficult for ground-based astronomy to build accurate scintillation models and develop efficient methodologies to remove this highly structured noise from valuable…

Instrumentation and Methods for Astrophysics · Physics 2022-05-18 Alejandra Rocha-Solache , Iván Rodríguez-Montoya , David Sánchez-Argüelles , Itziar Aretxaga

In this paper, we address the problem of multichannel speech enhancement in the short-time Fourier transform (STFT) domain. A long short-time memory (LSTM) network takes as input a sequence of STFT coefficients associated with a frequency…

Sound · Computer Science 2020-09-24 Xiaofei LI , Radu Horaud