Related papers: Nonnegative Tensor Factorization for Directional B…

Nonnegative tensor factorization with frequency modulation cues for blind audio source separation

We present Vibrato Nonnegative Tensor Factorization, an algorithm for single-channel unsupervised audio source separation with an application to separating instrumental or vocal sources with nonstationary pitch from music recordings. Our…

Sound · Computer Science 2016-06-02 Elliot Creager , Noah D. Stein , Roland Badeau , Philippe Depalle

Audio Source Separation in Reverberant Environments using $\beta$-divergence based Nonnegative Factorization

In Gaussian model-based multichannel audio source separation, the likelihood of observed mixtures of source signals is parametrized by source spectral variances and by associated spatial covariance matrices. These parameters are estimated…

Sound · Computer Science 2026-04-15 Mahmoud Fakhry , Piergiorgio Svaizer , Maurizio Omologo

On audio enhancement via online non-negative matrix factorization

We propose a method for noise reduction, the task of producing a clean audio signal from a recording corrupted by additive noise. Many common approaches to this problem are based upon applying non-negative matrix factorization to…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-08 Andrew Sack , Wenzhao Jiang , Michael Perlmutter , Palina Salanevich , Deanna Needell

On Ambisonic Source Separation with Spatially Informed Non-negative Tensor Factorization

This article presents a Non-negative Tensor Factorization based method for sound source separation from Ambisonic microphone signals. The proposed method enables the use of prior knowledge about the Directions-of-Arrival (DOAs) of the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-20 Mateusz Guzik , Konrad Kowalczyk

Non-negative matrix factorization-based subband decomposition for acoustic source localization

A novel non-negative matrix factorization (NMF) based subband decomposition in frequency spatial domain for acoustic source localization using a microphone array is introduced. The proposed method decomposes source and noise subband and…

Sound · Computer Science 2016-10-18 Suwon Shon , Seongkyu Mun , David Han , Hanseok Ko

Joint Sound Source Separation and Speaker Recognition

Non-negative Matrix Factorization (NMF) has already been applied to learn speaker characterizations from single or non-simultaneous speech for speaker recognition applications. It is also known for its good performance in (blind) source…

Sound · Computer Science 2016-05-02 Jeroen Zegers , Hugo Van hamme

Blind Audio Source Separation with Minimum-Volume Beta-Divergence NMF

Considering a mixed signal composed of various audio sources and recorded with a single microphone, we consider on this paper the blind audio source separation problem which consists in isolating and extracting each of the sources. To…

Signal Processing · Electrical Eng. & Systems 2020-07-15 Valentin Leplat , Nicolas Gillis , Man Shun Ang

A Neural Network Alternative to Non-Negative Audio Models

We present a neural network that can act as an equivalent to a Non-Negative Matrix Factorization (NMF), and further show how it can be used to perform supervised source separation. Due to the extensibility of this approach we show how we…

Sound · Computer Science 2016-09-13 Paris Smaragdis , Shrikant Venkataramani

A New Non-Negative Matrix Factorization Approach for Blind Source Separation of Cardiovascular and Respiratory Sound Based on the Periodicity of Heart and Lung Function

Auscultation provides a rich diversity of information to diagnose cardiovascular and respiratory diseases. However, sound auscultation is challenging due to noise. In this study, a modified version of the affine non-negative matrix…

Signal Processing · Electrical Eng. & Systems 2026-05-27 Yasaman Torabi , Shahram Shirani , James P. Reilly

End-to-end Non-Negative Autoencoders for Sound Source Separation

Discriminative models for source separation have recently been shown to produce impressive results. However, when operating on sources outside of the training set, these models can not perform as well and are cumbersome to update. Classical…

Sound · Computer Science 2019-11-04 Shrikant Venkataramani , Efthymios Tzinis , Paris Smaragdis

Exploring Efficient Directional and Distance Cues for Regional Speech Separation

In this paper, we introduce a neural network-based method for regional speech separation using a microphone array. This approach leverages novel spatial cues to extract the sound source not only from specified direction but also within…

Sound · Computer Science 2025-08-12 Yiheng Jiang , Haoxu Wang , Yafeng Chen , Gang Qiao , Biao Tian

Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization

When we place microphones close to a sound source near other sources in audio recording, the obtained audio signal includes undesired sound from the other sources, which is often called cross-talk or bleeding sound. For many audio…

Sound · Computer Science 2021-09-02 Yusaku Mizobuchi , Daichi Kitamura , Tomohiko Nakamura , Hiroshi Saruwatari , Yu Takahashi , Kazunobu Kondo

Phase recovery in NMF for audio source separation: an insightful benchmark

Nonnegative Matrix Factorization (NMF) is a powerful tool for decomposing mixtures of audio signals in the Time-Frequency (TF) domain. In applications such as source separation, the phase recovery for each extracted component is a major…

Sound · Computer Science 2016-11-17 Paul Magron , Roland Badeau , Bertrand David

Online algorithms for Nonnegative Matrix Factorization with the Itakura-Saito divergence

Nonnegative matrix factorization (NMF) is now a common tool for audio source separation. When learning NMF on large audio databases, one major drawback is that the complexity in time is O(FKN) when updating the dictionary (where (F;N) is…

Machine Learning · Statistics 2011-06-22 Augustin Lefèvre , Francis Bach , Cédric Févotte

Neural Directional Filtering Using a Compact Microphone Array

Beamforming with desired directivity patterns using compact microphone arrays is essential in many audio applications. Directivity patterns achievable using traditional beamformers depend on the number of microphones and the array aperture.…

Audio and Speech Processing · Electrical Eng. & Systems 2026-03-24 Weilong Huang , Srikanth Raj Chetupalli , Mhd Modar Halimeh , Oliver Thiergart , Emanuël A. P. Habets

Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement

This paper investigates a non-negative matrix factorization (NMF)-based approach to the semi-supervised single-channel speech enhancement problem where only non-stationary additive noise signals are given. The proposed method relies on…

Sound · Computer Science 2013-09-25 Nikolay Lyubimov , Mikhail Kotov

Probabilistic Modelling of Signal Mixtures with Differentiable Dictionaries

We introduce a novel way to incorporate prior information into (semi-) supervised non-negative matrix factorization, which we call differentiable dictionary search. It enables general, highly flexible and principled modelling of mixtures…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-29 Lukáš Samuel Marták , Rainer Kelz , Gerhard Widmer

Nonnegative Matrix Factorization with Transform Learning

Traditional NMF-based signal decomposition relies on the factorization of spectral data, which is typically computed by means of short-time frequency transform. In this paper we propose to relax the choice of a pre-fixed transform and learn…

Machine Learning · Computer Science 2017-12-18 Dylan Fagot , Cédric Févotte , Herwig Wendt

Fast Multichannel NMF with Block-Diagonal Spatial Covariance Matrices for Efficient Blind Source Separation Using Distributed Microphone Arrays

Distributed microphone arrays composed of multiple subarrays enable blind source separation over a wide spatial area. Directly applying fast multichannel nonnegative matrix factorization (FastMNMF) to all subarrays can exploit observations…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-20 Hirotaka Nishikori , Nobutaka Ito , Kouei Yamaoka , Norihiro Takamune , Hiroshi Saruwatari

A Complex Matrix Factorization approach to Joint Modeling of Magnitude and Phase for Source Separation

Conventional NMF methods for source separation factorize the matrix of spectral magnitudes. Spectral Phase is not included in the decomposition process of these methods. However, phase of the speech mixture is generally used in…

Sound · Computer Science 2014-11-26 Chaitanya Ahuja , Karan Nathwani , Rajesh M. Hegde