Related papers: Learning to Denoise Historical Music

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

In this study, we propose a dense frequency-time attentive network (DeFT-AN) for multichannel speech enhancement. DeFT-AN is a mask estimation network that predicts a complex spectral masking pattern for suppressing the noise and…

Audio and Speech Processing · Electrical Eng. & Systems 2023-03-07 Dongheon Lee , Jung-Woo Choi

Test-Time Defense Against Adversarial Attacks via Stochastic Resonance of Latent Ensembles

We propose a test-time defense mechanism against adversarial attacks: imperceptible image perturbations that significantly alter the predictions of a model. Unlike existing methods that rely on feature filtering or smoothing, which can lead…

Computer Vision and Pattern Recognition · Computer Science 2025-10-06 Dong Lao , Yuxiang Zhang , Haniyeh Ehsani Oskouie , Yangchao Wu , Alex Wong , Stefano Soatto

Musical Instrument Recognition Using Their Distinctive Characteristics in Artificial Neural Networks

In this study an Artificial Neural Network was trained to classify musical instruments, using audio samples transformed to the frequency domain. Different features of the sound, in both time and frequency domain, were analyzed and compared…

Sound · Computer Science 2017-05-16 Babak Toghiani-Rizi , Marcus Windmark

Adversarial Generation of Time-Frequency Features with application in audio synthesis

Time-frequency (TF) representations provide powerful and intuitive features for the analysis of time series such as audio. But still, generative modeling of audio in the TF domain is a subtle matter. Consequently, neural audio synthesis…

Sound · Computer Science 2019-05-17 Andrés Marafioti , Nicki Holighaus , Nathanaël Perraudin , Piotr Majdak

Music demixing with the sliCQ transform

Music source separation is the task of extracting an estimate of one or more isolated sources or instruments (for example, drums or vocals) from musical audio. The task of music demixing or unmixing considers the case where the musical…

Sound · Computer Science 2021-12-13 Sevag Hanssian

Learning audio representations via phase prediction

We learn audio representations by solving a novel self-supervised learning task, which consists of predicting the phase of the short-time Fourier transform from its magnitude. A convolutional encoder is used to map the magnitude spectrum of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-29 Félix de Chaumont Quitry , Marco Tagliasacchi , Dominik Roblek

Deep Speech Denoising with Vector Space Projections

We propose an algorithm to denoise speakers from a single microphone in the presence of non-stationary and dynamic noise. Our approach is inspired by the recent success of neural network models separating speakers from other speakers and…

Sound · Computer Science 2018-05-01 Jeff Hetherly , Paul Gamble , Maria Barrios , Cory Stephenson , Karl Ni

Do Noises Bother Human and Neural Networks In the Same Way? A Medical Image Analysis Perspective

Deep learning had already demonstrated its power in medical images, including denoising, classification, segmentation, etc. All these applications are proposed to automatically analyze medical images beforehand, which brings more…

Image and Video Processing · Electrical Eng. & Systems 2020-11-05 Shao-Cheng Wen , Yu-Jen Chen , Zihao Liu , Wujie Wen , Xiaowei Xu , Yiyu Shi , Tsung-Yi Ho , Qianjun Jia , Meiping Huang , Jian Zhuang

Adversarial Signal Denoising with Encoder-Decoder Networks

The presence of noise is common in signal processing regardless the signal type. Deep neural networks have shown good performance in noise removal, especially on the image domain. In this work, we consider deep neural networks as a…

Machine Learning · Computer Science 2020-07-07 Leslie Casas , Attila Klimmek , Nassir Navab , Vasileios Belagiannis

Audio Decoding by Inverse Problem Solving

We consider audio decoding as an inverse problem and solve it through diffusion posterior sampling. Explicit conditioning functions are developed for input signal measurements provided by an example of a transform domain perceptual audio…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-13 Pedro J. Villasana T. , Lars Villemoes , Janusz Klejsa , Per Hedelin

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

On-the-fly Improving Performance of Deep Code Models via Input Denoising

Deep learning has been widely adopted to tackle various code-based tasks by building deep code models based on a large amount of code snippets. While these deep code models have achieved great success, even state-of-the-art models suffer…

Software Engineering · Computer Science 2023-08-22 Zhao Tian , Junjie Chen , Xiangyu Zhang

Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising

Current self-supervised denoising methods for paired noisy images typically involve mapping one noisy image through the network to the other noisy image. However, after measuring the spectral bias of such methods using our proposed Image…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Wang Zhang , Huaqiu Li , Xiaowan Hu , Tao Jiang , Zikang Chen , Haoqian Wang

Self-supervised learning for denoising quasiparticle interference data

Tunneling spectroscopy is an important tool for the study of both real-space and momentum-space electronic structure of correlated electron systems. However, such measurements often yield noisy data. Machine learning provides techniques to…

Superconductivity · Physics 2024-09-16 Ilse S. Kuijf , Willem O. Tromp , Tjerk Benschop , Niño Philip Ramones , Miguel Antonio Sulangi , Evert P. L. van Nieuwenburg , Milan P. Allan

Robust Fourier Neural Networks

Fourier embedding has shown great promise in removing spectral bias during neural network training. However, it can still suffer from high generalization errors, especially when the labels or measurements are noisy. We demonstrate that…

Machine Learning · Computer Science 2024-09-04 Halyun Jeong , Jihun Han

Noise2Inverse: Self-supervised deep convolutional denoising for tomography

Recovering a high-quality image from noisy indirect measurements is an important problem with many applications. For such inverse problems, supervised deep convolutional neural network (CNN)-based denoising methods have shown strong…

Image and Video Processing · Electrical Eng. & Systems 2020-09-16 Allard A. Hendriksen , Daniel M. Pelt , K. Joost Batenburg

From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers

Transformers have become central to recent advances in audio classification. However, training an audio spectrogram transformer, e.g. AST, from scratch can be resource and time-intensive. Furthermore, the complexity of transformers heavily…

Sound · Computer Science 2024-01-17 Jiu Feng , Mehmet Hamza Erol , Joon Son Chung , Arda Senocak

ADNAC: Audio Denoiser using Neural Audio Codec

Audio denoising is critical in signal processing, enhancing intelligibility and fidelity for applications like restoring musical recordings. This paper presents a proof-of-concept for adapting a state-of-the-art neural audio codec, the…

Sound · Computer Science 2025-11-04 Daniel Jimon , Mircea Vaida , Adriana Stan

Neuralogram: A Deep Neural Network Based Representation for Audio Signals

We propose the Neuralogram -- a deep neural network based representation for understanding audio signals which, as the name suggests, transforms an audio signal to a dense, compact representation based upon embeddings learned via a neural…

Sound · Computer Science 2019-04-11 Prateek Verma , Chris Chafe , Jonathan Berger

Scheduled denoising autoencoders

We present a representation learning method that learns features at multiple different levels of scale. Working within the unsupervised framework of denoising autoencoders, we observe that when the input is heavily corrupted during…

Machine Learning · Computer Science 2015-04-14 Krzysztof J. Geras , Charles Sutton