English
Related papers

Related papers: Learning to Denoise Historical Music

200 papers

Falsely annotated samples, also known as noisy labels, can significantly harm the performance of deep learning models. Two main approaches for learning with noisy labels are global noise estimation and data filtering. Global noise…

Machine Learning · Computer Science 2025-07-31 Yuval Grinberg , Nimrod Harel , Jacob Goldberger , Ofir Lindenbaum

Continuous speech can be converted into a discrete sequence by deriving discrete units from the hidden features of self-supervised learned (SSL) speech models. Although SSL models are becoming larger and trained on more data, they are often…

Audio and Speech Processing · Electrical Eng. & Systems 2025-02-06 Jakob Poncelet , Yujun Wang , Hugo Van hamme

A significant research problem of recent interest is the localization of targets like vessels, surgical needles, and tumors in photoacoustic (PA) images. To achieve accurate localization, a high photoacoustic signal-to-noise ratio (SNR) is…

Image and Video Processing · Electrical Eng. & Systems 2021-05-03 Amirsaeed Yazdani , Sumit Agrawal , Kerrick Johnstonbaugh , Sri-Rajasekhar Kothapalli , Vishal Monga

The decomposition of sounds into sines, transients, and noise is a long-standing research problem in audio processing. The current solutions for this three-way separation detect either horizontal and vertical structures or anisotropy and…

Audio and Speech Processing · Electrical Eng. & Systems 2022-12-01 Leonardo Fierro , Vesa Välimäki

Temporal data such as time series can be viewed as discretized measurements of the underlying function. To build a generative model for such data we have to model the stochastic process that governs it. We propose a solution by defining the…

Machine Learning · Computer Science 2023-05-22 Marin Biloš , Kashif Rasul , Anderson Schneider , Yuriy Nevmyvaka , Stephan Günnemann

Modern digital cameras rely on the sequential execution of separate image processing steps to produce realistic images. The first two steps are usually related to denoising and demosaicking where the former aims to reduce noise from the…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Filippos Kokkinos , Stamatios Lefkimmiatis

This thesis is presenting a method for generating short musical phrases using a deep convolutional generative adversarial network (DCGAN). To train neural network were used datasets of classical and jazz music MIDI recordings. Our approach…

Sound · Computer Science 2019-12-24 Mateusz Dorobek

Traditionally, music was treated as an analogue signal and was generated manually. In recent years, music is conspicuous to technology which can generate a suite of music automatically without any human intervention. To accomplish this…

Sound · Computer Science 2019-08-06 Sanidhya Mangal , Rahul Modak , Poorva Joshi

Distantly-labeled data can be used to scale up training of statistical models, but it is typically noisy and that noise can vary with the distant labeling technique. In this work, we propose a two-stage procedure for handling this type of…

Computation and Language · Computer Science 2019-05-07 Yasumasa Onoe , Greg Durrett

Recently, denoising methods based on supervised learning have exhibited promising performance. However, their reliance on external datasets containing noisy-clean image pairs restricts their applicability. To address this limitation,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-21 Jaekyun Ko , Sanghwan Lee

One key step in audio signal processing is to transform the raw signal into representations that are efficient for encoding the original information. Traditionally, people transform the audio into spectral representations, as a function of…

Sound · Computer Science 2016-11-30 Shuhui Qu , Juncheng Li , Wei Dai , Samarjit Das

We reconstruct a closed denoised curve from an unstructured and highly noisy 2D point cloud. Our proposed method uses a two- pass approach: Previously recovered manifold connectivity is used for ordering noisy samples along this manifold…

Graphics · Computer Science 2018-08-24 Stefan Ohrhallinger , Michael Wimmer

Denoising diffusion models have recently shown impressive results in generative tasks. By learning powerful priors from huge collections of training images, such models are able to gradually modify complete noise to a clean natural image…

Computer Vision and Pattern Recognition · Computer Science 2023-06-29 Naama Pearl , Yaron Brodsky , Dana Berman , Assaf Zomet , Alex Rav Acha , Daniel Cohen-Or , Dani Lischinski

We propose an ECG denoising method based on a feed forward neural network with three hidden layers. Particulary useful for very noisy signals, this approach uses the available ECG channels to reconstruct a noisy channel. We tested the…

Computational Engineering, Finance, and Science · Computer Science 2012-12-21 Rui Rodrigues , Paula Couto

In recent years, the synchrosqueezing transform (SST) has gained popularity as a method for the analysis of signals that can be broken down into multiple components determined by instantaneous amplitudes and phases. One such version of SST,…

Numerical Analysis · Mathematics 2017-09-20 Alexander Berrian , Naoki Saito

The core challenge of hyperspectral image denoising is striking the right balance between data fidelity and noise prior modeling. Most existing methods place too much emphasis on the intrinsic priors of the image while overlooking diverse…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Xuelin Xie , Xiliang Lu , Zhengshan Wang , Yang Zhang , Long Chen

We propose a novel pitch estimation technique called DeepF0, which leverages the available annotated data to directly learns from the raw audio in a data-driven manner. F0 estimation is important in various speech processing and music…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-15 Satwinder Singh , Ruili Wang , Yuanhang Qiu

Deep convolutional neural networks (CNNs) have been actively adopted in the field of music information retrieval, e.g. genre classification, mood detection, and chord recognition. However, the process of learning and prediction is little…

Machine Learning · Computer Science 2016-07-11 Keunwoo Choi , George Fazekas , Mark Sandler

Ultrafast electron beam X-ray computed tomography produces noisy data due to short measurement times, causing reconstruction artifacts and limiting overall image quality. To counteract these issues, two self-supervised deep learning methods…

Machine Learning · Computer Science 2025-11-24 Israt Jahan Tulin , Sebastian Starke , Dominic Windisch , André Bieberle , Peter Steinbach

While both the data volume and heterogeneity of the digital music content is huge, it has become increasingly important and convenient to build a recommendation or search system to facilitate surfacing these content to the user or consumer…