English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

In this study, we propose a dense frequency-time attentive network (DeFT-AN) for multichannel speech enhancement. DeFT-AN is a mask estimation network that predicts a complex spectral masking pattern for suppressing the noise and…

Audio and Speech Processing · Electrical Eng. & Systems 2023-03-07 Dongheon Lee , Jung-Woo Choi

We propose a test-time defense mechanism against adversarial attacks: imperceptible image perturbations that significantly alter the predictions of a model. Unlike existing methods that rely on feature filtering or smoothing, which can lead…

Computer Vision and Pattern Recognition · Computer Science 2025-10-06 Dong Lao , Yuxiang Zhang , Haniyeh Ehsani Oskouie , Yangchao Wu , Alex Wong , Stefano Soatto

In this study an Artificial Neural Network was trained to classify musical instruments, using audio samples transformed to the frequency domain. Different features of the sound, in both time and frequency domain, were analyzed and compared…

Sound · Computer Science 2017-05-16 Babak Toghiani-Rizi , Marcus Windmark

Time-frequency (TF) representations provide powerful and intuitive features for the analysis of time series such as audio. But still, generative modeling of audio in the TF domain is a subtle matter. Consequently, neural audio synthesis…

Sound · Computer Science 2019-05-17 Andrés Marafioti , Nicki Holighaus , Nathanaël Perraudin , Piotr Majdak

Music source separation is the task of extracting an estimate of one or more isolated sources or instruments (for example, drums or vocals) from musical audio. The task of music demixing or unmixing considers the case where the musical…

Sound · Computer Science 2021-12-13 Sevag Hanssian

We learn audio representations by solving a novel self-supervised learning task, which consists of predicting the phase of the short-time Fourier transform from its magnitude. A convolutional encoder is used to map the magnitude spectrum of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-29 Félix de Chaumont Quitry , Marco Tagliasacchi , Dominik Roblek

We propose an algorithm to denoise speakers from a single microphone in the presence of non-stationary and dynamic noise. Our approach is inspired by the recent success of neural network models separating speakers from other speakers and…

Sound · Computer Science 2018-05-01 Jeff Hetherly , Paul Gamble , Maria Barrios , Cory Stephenson , Karl Ni

Deep learning had already demonstrated its power in medical images, including denoising, classification, segmentation, etc. All these applications are proposed to automatically analyze medical images beforehand, which brings more…

Image and Video Processing · Electrical Eng. & Systems 2020-11-05 Shao-Cheng Wen , Yu-Jen Chen , Zihao Liu , Wujie Wen , Xiaowei Xu , Yiyu Shi , Tsung-Yi Ho , Qianjun Jia , Meiping Huang , Jian Zhuang

The presence of noise is common in signal processing regardless the signal type. Deep neural networks have shown good performance in noise removal, especially on the image domain. In this work, we consider deep neural networks as a…

Machine Learning · Computer Science 2020-07-07 Leslie Casas , Attila Klimmek , Nassir Navab , Vasileios Belagiannis

We consider audio decoding as an inverse problem and solve it through diffusion posterior sampling. Explicit conditioning functions are developed for input signal measurements provided by an example of a transform domain perceptual audio…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-13 Pedro J. Villasana T. , Lars Villemoes , Janusz Klejsa , Per Hedelin

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

Deep learning has been widely adopted to tackle various code-based tasks by building deep code models based on a large amount of code snippets. While these deep code models have achieved great success, even state-of-the-art models suffer…

Software Engineering · Computer Science 2023-08-22 Zhao Tian , Junjie Chen , Xiangyu Zhang

Current self-supervised denoising methods for paired noisy images typically involve mapping one noisy image through the network to the other noisy image. However, after measuring the spectral bias of such methods using our proposed Image…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Wang Zhang , Huaqiu Li , Xiaowan Hu , Tao Jiang , Zikang Chen , Haoqian Wang

Tunneling spectroscopy is an important tool for the study of both real-space and momentum-space electronic structure of correlated electron systems. However, such measurements often yield noisy data. Machine learning provides techniques to…

Fourier embedding has shown great promise in removing spectral bias during neural network training. However, it can still suffer from high generalization errors, especially when the labels or measurements are noisy. We demonstrate that…

Machine Learning · Computer Science 2024-09-04 Halyun Jeong , Jihun Han

Recovering a high-quality image from noisy indirect measurements is an important problem with many applications. For such inverse problems, supervised deep convolutional neural network (CNN)-based denoising methods have shown strong…

Image and Video Processing · Electrical Eng. & Systems 2020-09-16 Allard A. Hendriksen , Daniel M. Pelt , K. Joost Batenburg

Transformers have become central to recent advances in audio classification. However, training an audio spectrogram transformer, e.g. AST, from scratch can be resource and time-intensive. Furthermore, the complexity of transformers heavily…

Sound · Computer Science 2024-01-17 Jiu Feng , Mehmet Hamza Erol , Joon Son Chung , Arda Senocak

Audio denoising is critical in signal processing, enhancing intelligibility and fidelity for applications like restoring musical recordings. This paper presents a proof-of-concept for adapting a state-of-the-art neural audio codec, the…

Sound · Computer Science 2025-11-04 Daniel Jimon , Mircea Vaida , Adriana Stan

We propose the Neuralogram -- a deep neural network based representation for understanding audio signals which, as the name suggests, transforms an audio signal to a dense, compact representation based upon embeddings learned via a neural…

Sound · Computer Science 2019-04-11 Prateek Verma , Chris Chafe , Jonathan Berger

We present a representation learning method that learns features at multiple different levels of scale. Working within the unsupervised framework of denoising autoencoders, we observe that when the input is heavily corrupted during…

Machine Learning · Computer Science 2015-04-14 Krzysztof J. Geras , Charles Sutton
‹ Prev 1 4 5 6 7 8 10 Next ›