Related papers: Toward Deep Drum Source Separation

The Inverse Drum Machine: Source Separation Through Joint Transcription and Analysis-by-Synthesis

We present the Inverse Drum Machine, a novel approach to Drum Source Separation that leverages an analysis-by-synthesis framework combined with deep learning. Unlike recent supervised methods that require isolated stem recordings for…

Sound · Computer Science 2025-10-01 Bernardo Torres , Geoffroy Peeters , Gael Richard

Improving Real-Time Music Accompaniment Separation with MMDenseNet

Music source separation aims to separate polyphonic music into different types of sources. Most existing methods focus on enhancing the quality of separated results by using a larger model structure, rendering them unsuitable for deployment…

Sound · Computer Science 2024-07-02 Chun-Hsiang Wang , Chung-Che Wang , Jun-You Wang , Jyh-Shing Roger Jang , Yen-Hsun Chu

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music. While deep learning methods have shown impressive results, there is a trend toward larger models. In our paper, we…

Audio and Speech Processing · Electrical Eng. & Systems 2024-03-20 Junyu Chen , Susmitha Vekkot , Pancham Shukla

SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation

Recent advancements in music source separation have significantly progressed, particularly in isolating vocals, drums, and bass elements from mixed tracks. These developments owe much to the creation and use of large-scale, multitrack…

Audio and Speech Processing · Electrical Eng. & Systems 2025-02-18 Jaime Garcia-Martinez , David Diaz-Guerra , Archontis Politis , Tuomas Virtanen , Julio J. Carabias-Orti , Pedro Vera-Candeas

Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity

Music source separation performance has greatly improved in recent years with the advent of approaches based on deep learning. Such methods typically require large amounts of labelled training data, which in the case of music consist of…

Sound · Computer Science 2019-09-19 Ethan Manilow , Gordon Wichern , Prem Seetharaman , Jonathan Le Roux

ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation

Most current music source separation (MSS) methods rely on supervised learning, limited by training data quantity and quality. Though web-crawling can bring abundant data, platform-level track labeling often causes metadata mismatches,…

Sound · Computer Science 2025-10-13 Ji Yu , Yang shuo , Xu Yuetonghui , Liu Mengmei , Ji Qiang , Han Zerui

Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients

We propose a method for the blind separation of sounds of musical instruments in audio signals. We describe the individual tones via a parametric model, training a dictionary to capture the relative amplitudes of the harmonics. The model…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-10 Sören Schulze , Johannes Leuschner , Emily J. King

Towards Realistic Synthetic Data for Automatic Drum Transcription

Deep learning models define the state-of-the-art in Automatic Drum Transcription (ADT), yet their performance is contingent upon large-scale, paired audio-MIDI datasets, which are scarce. Existing workarounds that use synthetic data often…

Sound · Computer Science 2026-01-15 Pierfrancesco Melucci , Paolo Merialdo , Taketo Akama

Fast accuracy estimation of deep learning based multi-class musical source separation

Music source separation represents the task of extracting all the instruments from a given song. Recent breakthroughs on this challenge have gravitated around a single dataset, MUSDB, only limited to four instrument classes. Larger datasets…

Sound · Computer Science 2021-12-02 Alexandru Mocanu , Benjamin Ricaud , Milos Cernak

SCNet: Sparse Compression Network for Music Source Separation

Deep learning-based methods have made significant achievements in music source separation. However, obtaining good results while maintaining a low model complexity remains challenging in super wide-band music source separation. Previous…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-25 Weinan Tong , Jiaxu Zhu , Jun Chen , Shiyin Kang , Tao Jiang , Yang Li , Zhiyong Wu , Helen Meng

MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation

Deep neural networks have become an indispensable technique for audio source separation (ASS). It was recently reported that a variant of CNN architecture called MMDenseNet was successfully employed to solve the ASS problem of estimating…

Sound · Computer Science 2018-05-30 Naoya Takahashi , Nabarun Goswami , Yuki Mitsufuji

Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset

We introduce the Expanded Groove MIDI dataset (E-GMD), an automatic drum transcription (ADT) dataset that contains 444 hours of audio from 43 drum kits, making it an order of magnitude larger than similar datasets, and the first with…

Sound · Computer Science 2020-12-02 Lee Callender , Curtis Hawthorne , Jesse Engel

Music Source Separation in the Waveform Domain

Source separation for music is the task of isolating contributions, or stems, from different instruments recorded individually and arranged together to form a song. Such components include voice, bass, drums and any other…

Sound · Computer Science 2021-04-29 Alexandre Défossez , Nicolas Usunier , Léon Bottou , Francis Bach

Classical Guitar Duet Separation using GuitarDuets -- a Dataset of Real and Synthesized Guitar Recordings

Recent advancements in music source separation (MSS) have focused in the multi-timbral case, with existing architectures tailored for the separation of distinct instruments, overlooking thus the challenge of separating instruments with…

Audio and Speech Processing · Electrical Eng. & Systems 2025-07-03 Marios Glytsos , Christos Garoufis , Athanasia Zlatintsi , Petros Maragos

CatNet: music source separation system with mix-audio augmentation

Music source separation (MSS) is the task of separating a music piece into individual sources, such as vocals and accompaniment. Recently, neural network based methods have been applied to address the MSS problem, and can be categorized…

Sound · Computer Science 2021-02-22 Xuchen Song , Qiuqiang Kong , Xingjian Du , Yuxuan Wang

A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems

Despite significant recent progress across multiple subtasks of audio source separation, few music source separation systems support separation beyond the four-stem vocals, drums, bass, and other (VDBO) setup. Of the very few current…

Sound · Computer Science 2024-08-27 Karn N. Watcharasupat , Alexander Lerch

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely…

Sound · Computer Science 2023-02-01 Kilian Schulze-Forster , Gaël Richard , Liam Kelley , Clement S. J. Doire , Roland Badeau

Spectrogram Feature Losses for Music Source Separation

In this paper we study deep learning-based music source separation, and explore using an alternative loss to the standard spectrogram pixel-level L2 loss for model training. Our main contribution is in demonstrating that adding a high-level…

Sound · Computer Science 2019-06-28 Abhimanyu Sahai , Romann Weber , Brian McWilliams

Source Separation & Automatic Transcription for Music

Source separation is the process of isolating individual sounds in an auditory mixture of multiple sounds [1], and has a variety of applications ranging from speech enhancement and lyric transcription [2] to digital audio production for…

Sound · Computer Science 2024-12-10 Bradford Derby , Lucas Dunker , Samarth Galchar , Shashank Jarmale , Akash Setti

Moisesdb: A dataset for source separation beyond 4-stems

In this paper, we introduce the MoisesDB dataset for musical source separation. It consists of 240 tracks from 45 artists, covering twelve musical genres. For each song, we provide its individual audio sources, organized in a two-level…

Sound · Computer Science 2023-08-01 Igor Pereira , Felipe Araújo , Filip Korzeniowski , Richard Vogl