English

Unifying Probabilistic Models for Time-Frequency Analysis

Signal Processing 2019-02-13 v6 Machine Learning Sound Audio and Speech Processing Machine Learning

Abstract

In audio signal processing, probabilistic time-frequency models have many benefits over their non-probabilistic counterparts. They adapt to the incoming signal, quantify uncertainty, and measure correlation between the signal's amplitude and phase information, making time domain resynthesis straightforward. However, these models are still not widely used since they come at a high computational cost, and because they are formulated in such a way that it can be difficult to interpret all the modelling assumptions. By showing their equivalence to Spectral Mixture Gaussian processes, we illuminate the underlying model assumptions and provide a general framework for constructing more complex models that better approximate real-world signals. Our interpretation makes it intuitive to inspect, compare, and alter the models since all prior knowledge is encoded in the Gaussian process kernel functions. We utilise a state space representation to perform efficient inference via Kalman smoothing, and we demonstrate how our interpretation allows for efficient parameter learning in the frequency domain.

Keywords

Cite

@article{arxiv.1811.02489,
  title  = {Unifying Probabilistic Models for Time-Frequency Analysis},
  author = {William J. Wilkinson and Michael Riis Andersen and Joshua D. Reiss and Dan Stowell and Arno Solin},
  journal= {arXiv preprint arXiv:1811.02489},
  year   = {2019}
}

Comments

Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019