Related papers: Hybrid Spectrogram and Waveform Source Separation

Music Source Separation in the Waveform Domain

Source separation for music is the task of isolating contributions, or stems, from different instruments recorded individually and arranged together to form a song. Such components include voice, bass, drums and any other…

Sound · Computer Science 2021-04-29 Alexandre Défossez , Nicolas Usunier , Léon Bottou , Francis Bach

Hybrid Transformers for Music Source Separation

A natural question arising in Music Source Separation (MSS) is whether long range contextual information is useful, or whether local acoustic features are sufficient. In other fields, attention based Transformers have shown their ability to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-17 Simon Rouard , Francisco Massa , Alexandre Défossez

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

We study the problem of source separation for music using deep learning with four known sources: drums, bass, vocals and other accompaniments. State-of-the-art approaches predict soft masks over mixture spectrograms while methods working on…

Sound · Computer Science 2019-09-04 Alexandre Défossez , Nicolas Usunier , Léon Bottou , Francis Bach

Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet

There have been significant advances in deep learning for music demixing in recent years. However, there has been little attention given to how these neural networks can be adapted for real-time low-latency applications, which could be…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-28 Satvik Venkatesh , Arthur Benilov , Philip Coleman , Frederic Roskam

Danna-Sep: Unite to separate them all

Deep learning-based music source separation has gained a lot of interest in the last decades. Most of the existing methods operate with either spectrograms or waveforms. Spectrogram based models learn suitable masks for separating magnitude…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-10 Chin-Yun Yu , Kin-Wai Cheuk

End-to-end music source separation: is it possible in the waveform domain?

Most of the currently successful source separation techniques use the magnitude spectrogram as input, and are therefore by default omitting part of the signal: the phase. To avoid omitting potentially useful information, we study the…

Sound · Computer Science 2019-07-01 Francesc Lluís , Jordi Pons , Xavier Serra

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the…

Audio and Speech Processing · Electrical Eng. & Systems 2024-04-22 Giorgio Fabbro , Stefan Uhlich , Chieh-Hsin Lai , Woosung Choi , Marco Martínez-Ramírez , Weihsiang Liao , Igor Gadelha , Geraldo Ramos , Eddie Hsu , Hugo Rodrigues , Fabian-Robert Stöter , Alexandre Défossez , Yi Luo , Jianwei Yu , Dipam Chakraborty , Sharada Mohanty , Roman Solovyev , Alexander Stempkovskiy , Tatiana Habruseva , Nabarun Goswami , Tatsuya Harada , Minseok Kim , Jun Hyung Lee , Yuanliang Dong , Xinran Zhang , Jiafeng Liu , Yuki Mitsufuji

Music Demixing Challenge 2021

Music source separation has been intensively studied in the last decade and tremendous progress with the advent of deep learning could be observed. Evaluation campaigns such as MIREX or SiSEC connected state-of-the-art models and…

Audio and Speech Processing · Electrical Eng. & Systems 2022-05-24 Yuki Mitsufuji , Giorgio Fabbro , Stefan Uhlich , Fabian-Robert Stöter , Alexandre Défossez , Minseok Kim , Woosung Choi , Chin-Yun Yu , Kin-Wai Cheuk

Benchmarks and leaderboards for sound demixing tasks

Music demixing is the task of separating different tracks from the given single audio signal into components, such as drums, bass, and vocals from the rest of the accompaniment. Separation of sources is useful for a range of areas,…

Sound · Computer Science 2024-05-08 Roman Solovyev , Alexander Stempkovskiy , Tatiana Habruseva

Music Source Separation with Band-split RNN

The performance of music source separation (MSS) models has been greatly improved in recent years thanks to the development of novel neural network architectures and training pipelines. However, recent model designs for MSS were mainly…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-03 Yi Luo , Jianwei Yu

SCNet: Sparse Compression Network for Music Source Separation

Deep learning-based methods have made significant achievements in music source separation. However, obtaining good results while maintaining a low model complexity remains challenging in super wide-band music source separation. Previous…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-25 Weinan Tong , Jiaxu Zhu , Jun Chen , Shiyin Kang , Tao Jiang , Yang Li , Zhiyong Wu , Helen Meng

Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge

In this challenge, we disentangle the deep filters from the original DeepfilterNet and incorporate them into our Spec-UNet-based network to further improve a hybrid Demucs (hdemucs) based remixing pipeline. The motivation behind the use of…

Sound · Computer Science 2024-04-18 Keren Shao , Ke Chen , Shlomo Dubnov

Hybrid Y-Net Architecture for Singing Voice Separation

This research paper presents a novel deep learning-based neural network architecture, named Y-Net, for achieving music source separation. The proposed architecture performs end-to-end hybrid source separation by extracting features from…

Sound · Computer Science 2023-03-07 Rashen Fernando , Pamudu Ranasinghe , Udula Ranasinghe , Janaka Wijayakulasooriya , Pantaleon Perera

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Deep neural network based methods have been successfully applied to music source separation. They typically learn a mapping from a mixture spectrogram to a set of source spectrograms, all with magnitudes only. This approach has several…

Sound · Computer Science 2021-09-14 Qiuqiang Kong , Yin Cao , Haohe Liu , Keunwoo Choi , Yuxuan Wang

Spectrogram Feature Losses for Music Source Separation

In this paper we study deep learning-based music source separation, and explore using an alternative loss to the standard spectrogram pixel-level L2 loss for model training. Our main contribution is in demonstrating that adding a high-level…

Sound · Computer Science 2019-06-28 Abhimanyu Sahai , Romann Weber , Brian McWilliams

Source Separation and Depthwise Separable Convolutions for Computer Audition

Given recent advances in deep music source separation, we propose a feature representation method that combines source separation with a state-of-the-art representation learning technique that is suitably repurposed for computer audition…

Sound · Computer Science 2020-12-08 Gabriel Mersy , Jin Hong Kuan

Music demixing with the sliCQ transform

Music source separation is the task of extracting an estimate of one or more isolated sources or instruments (for example, drums or vocals) from musical audio. The task of music demixing or unmixing considers the case where the musical…

Sound · Computer Science 2021-12-13 Sevag Hanssian

Sound Demixing Challenge 2023 Music Demixing Track Technical Report: TFC-TDF-UNet v3

In this report, we present our award-winning solutions for the Music Demixing Track of Sound Demixing Challenge 2023. First, we propose TFC-TDF-UNet v3, a time-efficient music source separation model that achieves state-of-the-art results…

Sound · Computer Science 2023-07-24 Minseok Kim , Jun Hyung Lee , Soonyoung Jung

Multi-scale Multi-band DenseNets for Audio Source Separation

This paper deals with the problem of audio source separation. To handle the complex and ill-posed nature of the problems of audio source separation, the current state-of-the-art approaches employ deep neural networks to obtain instrumental…

Sound · Computer Science 2017-06-30 Naoya Takahashi , Yuki Mitsufuji

Improving Music Source Separation with Diffusion and Consistency Refinement

In this work, we propose an approach to music source separation that uses a generative diffusion model as a last-stage refinement on top of a deterministic separator, progressively enhancing the separated sources through iterative…

Sound · Computer Science 2026-04-28 Tornike Karchkhadze , Mohammad Rasool Izadi , Shuo Zhang , Shlomo Dubnov