Related papers: Audio query-based music source separation

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely…

Sound · Computer Science 2023-02-01 Kilian Schulze-Forster , Gaël Richard , Liam Kelley , Clement S. J. Doire , Roland Badeau

Self-Supervised Music Source Separation Using Vector-Quantized Source Category Estimates

Music source separation is focused on extracting distinct sonic elements from composite tracks. Historically, many methods have been grounded in supervised learning, necessitating labeled data, which is occasionally constrained in its…

Sound · Computer Science 2023-11-23 Marco Pasini , Stefan Lattner , George Fazekas

A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation

The objective of deep learning methods based on encoder-decoder architectures for music source separation is to approximate either ideal time-frequency masks or spectral representations of the target music source(s). The spectral…

Sound · Computer Science 2018-04-25 Stylianos Ioannis Mimilakis , Konstantinos Drossos , Tuomas Virtanen , Gerald Schuller

Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model

Extracting individual elements from music mixtures is a valuable tool for music production and practice. While neural networks optimized to mask or transform mixture spectrograms into the individual source(s) have been the leading approach,…

Sound · Computer Science 2025-11-26 Genís Plaja-Roglans , Yun-Ning Hung , Xavier Serra , Igor Pereira

Music Source Separation Using Stacked Hourglass Networks

In this paper, we propose a simple yet effective method for multiple music source separation using convolutional neural networks. Stacked hourglass network, which was originally designed for human pose estimation in natural images, is…

Sound · Computer Science 2018-06-25 Sungheon Park , Taehoon Kim , Kyogu Lee , Nojun Kwak

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

Deep neural network based methods have been successfully applied to music source separation. They typically learn a mapping from a mixture spectrogram to a set of source spectrograms, all with magnitudes only. This approach has several…

Sound · Computer Science 2021-09-14 Qiuqiang Kong , Yin Cao , Haohe Liu , Keunwoo Choi , Yuxuan Wang

Class-conditional embeddings for music source separation

Isolating individual instruments in a musical mixture has a myriad of potential applications, and seems imminently achievable given the levels of performance reached by recent deep learning methods. While most musical source separation…

Sound · Computer Science 2018-11-08 Prem Seetharaman , Gordon Wichern , Shrikant Venkataramani , Jonathan Le Roux

Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

We study the problem of source separation for music using deep learning with four known sources: drums, bass, vocals and other accompaniments. State-of-the-art approaches predict soft masks over mixture spectrograms while methods working on…

Sound · Computer Science 2019-09-04 Alexandre Défossez , Nicolas Usunier , Léon Bottou , Francis Bach

Improving Real-Time Music Accompaniment Separation with MMDenseNet

Music source separation aims to separate polyphonic music into different types of sources. Most existing methods focus on enhancing the quality of separated results by using a larger model structure, rendering them unsuitable for deployment…

Sound · Computer Science 2024-07-02 Chun-Hsiang Wang , Chung-Che Wang , Jun-You Wang , Jyh-Shing Roger Jang , Yen-Hsun Chu

Hybrid Y-Net Architecture for Singing Voice Separation

This research paper presents a novel deep learning-based neural network architecture, named Y-Net, for achieving music source separation. The proposed architecture performs end-to-end hybrid source separation by extracting features from…

Sound · Computer Science 2023-03-07 Rashen Fernando , Pamudu Ranasinghe , Udula Ranasinghe , Janaka Wijayakulasooriya , Pantaleon Perera

Content Based Singing Voice Extraction From a Musical Mixture

We present a deep learning based methodology for extracting the singing voice signal from a musical mixture based on the underlying linguistic content. Our model follows an encoder decoder architecture and takes as input the magnitude…

Audio and Speech Processing · Electrical Eng. & Systems 2020-02-18 Pritish Chandna , Merlijn Blaauw , Jordi Bonada , Emilia Gomez

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data…

Machine Learning · Computer Science 2018-04-09 Daniel Stoller , Sebastian Ewert , Simon Dixon

Unsupervised Source Separation By Steering Pretrained Music Models

We showcase an unsupervised method that repurposes deep models trained for music generation and music tagging for audio source separation, without any retraining. An audio generation model is conditioned on an input mixture, producing a…

Sound · Computer Science 2021-10-26 Ethan Manilow , Patrick O'Reilly , Prem Seetharaman , Bryan Pardo

Fast accuracy estimation of deep learning based multi-class musical source separation

Music source separation represents the task of extracting all the instruments from a given song. Recent breakthroughs on this challenge have gravitated around a single dataset, MUSDB, only limited to four instrument classes. Larger datasets…

Sound · Computer Science 2021-12-02 Alexandru Mocanu , Benjamin Ricaud , Milos Cernak

CatNet: music source separation system with mix-audio augmentation

Music source separation (MSS) is the task of separating a music piece into individual sources, such as vocals and accompaniment. Recently, neural network based methods have been applied to address the MSS problem, and can be categorized…

Sound · Computer Science 2021-02-22 Xuchen Song , Qiuqiang Kong , Xingjian Du , Yuxuan Wang

Music Source Separation with Generative Flow

Fully-supervised models for source separation are trained on parallel mixture-source data and are currently state-of-the-art. However, such parallel data is often difficult to obtain, and it is cumbersome to adapt trained models to mixtures…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-30 Ge Zhu , Jordan Darefsky , Fei Jiang , Anton Selitskiy , Zhiyao Duan

Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data

Deep learning techniques for separating audio into different sound sources face several challenges. Standard architectures require training separate models for different types of audio sources. Although some universal separators employ a…

Sound · Computer Science 2022-02-15 Ke Chen , Xingjian Du , Bilei Zhu , Zejun Ma , Taylor Berg-Kirkpatrick , Shlomo Dubnov

Multi-scale Multi-band DenseNets for Audio Source Separation

This paper deals with the problem of audio source separation. To handle the complex and ill-posed nature of the problems of audio source separation, the current state-of-the-art approaches employ deep neural networks to obtain instrumental…

Sound · Computer Science 2017-06-30 Naoya Takahashi , Yuki Mitsufuji

Audio Source Separation Using a Deep Autoencoder

This paper proposes a novel framework for unsupervised audio source separation using a deep autoencoder. The characteristics of unknown source signals mixed in the mixed input is automatically by properly configured autoencoders implemented…

Sound · Computer Science 2014-12-24 Giljin Jang , Han-Gyu Kim , Yung-Hwan Oh

HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation

The advent of deep learning has led to the prevalence of deep neural network architectures for monaural music source separation, with end-to-end approaches that operate directly on the waveform level increasingly receiving research…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-09 Christos Garoufis , Athanasia Zlatintsi , Petros Maragos