Related papers: Complete and separate: Conditional separation with…

Optimal Condition Training for Target Source Separation

Recent research has shown remarkable performance in leveraging multiple extraneous conditional and non-mutually exclusive semantic concepts for sound source separation, allowing the flexibility to extract a given target source based on…

Sound · Computer Science 2022-11-14 Efthymios Tzinis , Gordon Wichern , Paris Smaragdis , Jonathan Le Roux

Heterogeneous Target Speech Separation

We introduce a new paradigm for single-channel target source separation where the sources of interest can be distinguished using non-mutually exclusive concepts (e.g., loudness, gender, language, spatial location, etc). Our proposed…

Sound · Computer Science 2022-11-14 Efthymios Tzinis , Gordon Wichern , Aswin Subramanian , Paris Smaragdis , Jonathan Le Roux

Conditioned Source Separation for Music Instrument Performances

In music source separation, the number of sources may vary for each piece and some of the sources may belong to the same family of instruments, thus sharing timbral characteristics and making the sources more correlated. This leads to…

Sound · Computer Science 2021-07-09 Olga Slizovskaia , Gloria Haro , Emilia Gómez

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data…

Machine Learning · Computer Science 2018-04-09 Daniel Stoller , Sebastian Ewert , Simon Dixon

Weakly-supervised Audio Separation via Bi-modal Semantic Similarity

Conditional sound separation in multi-source audio mixtures without having access to single source sound data during training is a long standing challenge. Existing mix-and-separate based methods suffer from significant performance drop…

Sound · Computer Science 2024-04-03 Tanvir Mahmud , Saeed Amizadeh , Kazuhito Koishida , Diana Marculescu

Application of multi-objective optimization to blind source separation

Several problems in signal processing are addressed by expert systems which take into account a set of priors on the sought signals and systems. For instance, blind source separation is often tackled by means of a mono-objective formulation…

Signal Processing · Electrical Eng. & Systems 2020-02-07 Guilherme Dean Pelegrina , Romis Attux , Leonardo Tomazeli Duarte

Topic Over Source: The Key to Effective Data Mixing for Language Models Pre-training

The performance of large language models (LLMs) is significantly affected by the quality and composition of their pre-training data, which is inherently diverse, spanning various languages, sources, and topics. Effectively integrating these…

Computation and Language · Computer Science 2025-08-11 Jiahui Peng , Xinlin Zhuang , Jiantao Qiu , Ren Ma , Jing Yu , He Zhu , Conghui He

Training-Free Multi-Step Audio Source Separation

Audio source separation aims to separate a mixture into target sources. Previous audio source separation systems usually conduct one-step inference, which does not fully explore the separation ability of models. In this work, we reveal that…

Sound · Computer Science 2025-05-27 Yongyi Zang , Jingyi Li , Qiuqiang Kong

Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small Step

As the performance of single-channel speech separation systems has improved, there has been a desire to move to more challenging conditions than the clean, near-field speech that initial systems were developed on. When training deep…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-23 Matthew Maciejewski , Jing Shi , Shinji Watanabe , Sanjeev Khudanpur

Improving Universal Sound Separation Using Sound Classification

Deep learning approaches have recently achieved impressive performance on both audio source separation and sound classification. Most audio source separation approaches focus only on separating sources belonging to a restricted domain of…

Sound · Computer Science 2021-05-14 Efthymios Tzinis , Scott Wisdom , John R. Hershey , Aren Jansen , Daniel P. W. Ellis

Interleaved Multitask Learning for Audio Source Separation with Independent Databases

Deep Neural Network-based source separation methods usually train independent models to optimize for the separation of individual sources. Although this can lead to good performance for well-defined targets, it can also be computationally…

Sound · Computer Science 2019-08-15 Clement S. J. Doire , Olumide Okubadejo

Target-Side Context for Discriminative Models in Statistical Machine Translation

Discriminative translation models utilizing source context have been shown to help statistical machine translation performance. We propose a novel extension of this work using target context information. Surprisingly, we show that this…

Computation and Language · Computer Science 2016-07-06 Aleš Tamchyna , Alexander Fraser , Ondřej Bojar , Marcin Junczys-Dowmunt

One-shot conditional audio filtering of arbitrary sounds

We consider the problem of separating a particular sound source from a single-channel mixture, based on only a short sample of the target source. Using SoundFilter, a wave-to-wave neural network architecture, we can train a model without…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-05 Beat Gfeller , Dominik Roblek , Marco Tagliasacchi

Content based singing voice source separation via strong conditioning using aligned phonemes

Informed source separation has recently gained renewed interest with the introduction of neural networks and the availability of large multitrack datasets containing both the mixture and the separated sources. These approaches use prior…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-06 Gabriel Meseguer-Brocal , Geoffroy Peeters

Music Source Separation with Generative Flow

Fully-supervised models for source separation are trained on parallel mixture-source data and are currently state-of-the-art. However, such parallel data is often difficult to obtain, and it is cumbersome to adapt trained models to mixtures…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-30 Ge Zhu , Jordan Darefsky , Fei Jiang , Anton Selitskiy , Zhiyao Duan

Speaker-Conditional Chain Model for Speech Separation and Extraction

Speech separation has been extensively explored to tackle the cocktail party problem. However, these studies are still far from having enough generalization capabilities for real scenarios. In this work, we raise a common strategy named…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-26 Jing Shi , Jiaming Xu , Yusuke Fujita , Shinji Watanabe , Bo Xu

Latent Iterative Refinement for Modular Source Separation

Traditional source separation approaches train deep neural network models end-to-end with all the data available at once by minimizing the empirical risk on the whole training set. On the inference side, after training the model, the user…

Sound · Computer Science 2023-10-17 Dimitrios Bralios , Efthymios Tzinis , Gordon Wichern , Paris Smaragdis , Jonathan Le Roux

Amicable examples for informed source separation

This paper deals with the problem of informed source separation (ISS), where the sources are accessible during the so-called \textit{encoding} stage. Previous works computed side-information during the encoding stage and source separation…

Sound · Computer Science 2022-02-21 Naoya Takahashi , Yuki Mitsufuji

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism

In this paper, we present a novel multi-channel speech extraction system to simultaneously extract multiple clean individual sources from a mixture in noisy and reverberant environments. The proposed method is built on an improved…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-17 Jisi Zhang , Catalin Zorila , Rama Doddipatla , Jon Barker

Improving Source Separation by Explicitly Modeling Dependencies Between Sources

We propose a new method for training a supervised source separation system that aims to learn the interdependent relationships between all combinations of sources in a mixture. Rather than independently estimating each source from a mix, we…

Sound · Computer Science 2022-03-30 Ethan Manilow , Curtis Hawthorne , Cheng-Zhi Anna Huang , Bryan Pardo , Jesse Engel