English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

We present a method for audio denoising that combines processing done in both the time domain and the time-frequency domain. Given a noisy audio clip, the method trains a deep neural network to fit this signal. Since the fitting is only…

Sound · Computer Science 2020-06-11 Michael Michelashvili , Lior Wolf

Achieving high-performance audio denoising is still a challenging task in real-world applications. Existing time-frequency methods often ignore the quality of generated frequency domain images. This paper converts the audio denoising…

Sound · Computer Science 2023-10-26 Youshan Zhang , Jialu Li

A method for musical audio synthesis using autoencoding neural networks is proposed. The autoencoder is trained to compress and reconstruct magnitude short-time Fourier transform frames. The autoencoder produces a spectrogram by activating…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-29 Joseph Colonel , Christopher Curro , Sam Keene

Enhancing the sound quality of historical music recordings is a long-standing problem. This paper presents a novel denoising method based on a fully-convolutional deep neural network. A two-stage U-Net model architecture is designed to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-02-22 Eloi Moliner , Vesa Välimäki

Micro-Doppler analysis has become increasingly popular in recent years owning to the ability of the technique to enhance classification strategies. Applications include recognising everyday human activities, distinguishing drone from birds,…

Signal Processing · Electrical Eng. & Systems 2021-02-16 Chong Tang , Wenda Li , Shelly Vishwakarma , Karl Woodbridge , Simon Julier , Kevin Chetty

We present a deep neural network to reduce coherent noise in three-dimensional quantitative phase imaging. Inspired by the cycle generative adversarial network, the denoising network was trained to learn a transform between two image…

In this paper, we propose a novel approach for generating music based on an artificial intelligence (AI) system. We analyze the features of music and use them to fit and predict the music. The fractional Fourier transform (FrFT) and the…

Sound · Computer Science 2026-04-21 Li Ya , Chen Wei , Li Xiulai , Yu Lei , Deng Xinyi , Chen Chaofan

A recurrent Neural Network (RNN) is trained to predict sound samples based on audio input augmented by control parameter information for pitch, volume, and instrument identification. During the generative phase following training, audio…

Sound · Computer Science 2019-03-27 Lonce Wyse , Muhammad Huzaifah

We propose a novel approach for time-scale modification of audio signals. Unlike traditional methods that rely on the framing technique or the short-time Fourier transform to preserve the frequency during temporal stretching, our neural…

Sound · Computer Science 2023-10-09 Ernie Chu , Ju-Ting Chen , Chia-Ping Chen

We present an end-to-end deep learning approach to denoising speech signals by processing the raw waveform directly. Given input audio containing speech corrupted by an additive background signal, the system aims to produce a processed…

Audio and Speech Processing · Electrical Eng. & Systems 2018-09-18 Francois G. Germain , Qifeng Chen , Vladlen Koltun

Contemporary speech enhancement predominantly relies on audio transforms that are trained to reconstruct a clean speech waveform. The development of high-performing neural network sound recognition systems has raised the possibility of…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-18 Mark R. Saddler , Andrew Francl , Jenelle Feather , Kaizhi Qian , Yang Zhang , Josh H. McDermott

We present a method for training a neural network to perform image denoising without access to clean training examples or access to paired noisy training examples. Our method requires only a single noisy realization of each training example…

Image and Video Processing · Electrical Eng. & Systems 2019-10-29 Nick Moran , Dan Schmidt , Yu Zhong , Patrick Coady

While neural-based text to speech (TTS) models can synthesize natural and intelligible voice, they usually require high-quality speech data, which is costly to collect. In many scenarios, only noisy speech of a target speaker is available,…

Audio and Speech Processing · Electrical Eng. & Systems 2020-12-21 Chen Zhang , Yi Ren , Xu Tan , Jinglin Liu , Kejun Zhang , Tao Qin , Sheng Zhao , Tie-Yan Liu

People often listen to music in noisy environments, seeking to isolate themselves from ambient sounds. Indeed, a music signal can mask some of the noise's frequency components due to the effect of simultaneous masking. In this article, we…

Sound · Computer Science 2025-02-26 Clémentine Berger , Roland Badeau , Slim Essid

Compared with traditional seismic noise attenuation algorithms that depend on signal models and their corresponding prior assumptions, removing noise with a deep neural network is trained based on a large training set, where the inputs are…

Geophysics · Physics 2019-07-23 Siwei Yu , Jianwei Ma , Wenlong Wang

In many scientific applications, measured time series are corrupted by noise or distortions. Traditional denoising techniques often fail to recover the signal of interest, particularly when the signal-to-noise ratio is low or when certain…

Machine Learning · Computer Science 2022-11-02 Natalie Klein , Amber J. Day , Harris Mason , Michael W. Malone , Sinead A. Williamson

We present a framework to model the perceived quality of audio signals by combining convolutional architectures, with ideas from classical signal processing, and describe an approach to enhancing perceived acoustical quality. We demonstrate…

Sound · Computer Science 2019-12-13 Prateek Verma , Jonathan Berger

In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier…

Sound · Computer Science 2024-07-03 Lam Pham , Phat Lam , Truong Nguyen , Huyen Nguyen , Alexander Schindler

We present a neural network for rendering binaural speech from given monaural audio, position, and orientation of the source. Most of the previous works have focused on synthesizing binaural speeches by conditioning the positions and…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-02 Jin Woo Lee , Kyogu Lee

Noise reduction techniques based on deep learning have demonstrated impressive performance in enhancing the overall quality of recorded speech. While these approaches are highly performant, their application in audio engineering can be…

Sound · Computer Science 2023-10-18 Christian J. Steinmetz , Thomas Walther , Joshua D. Reiss
‹ Prev 1 2 3 10 Next ›