English

Efficient Training Data Generation for Phase-Based DOA Estimation

Audio and Speech Processing 2022-02-17 v1 Machine Learning Sound

Abstract

Deep learning (DL) based direction of arrival (DOA) estimation is an active research topic and currently represents the state-of-the-art. Usually, DL-based DOA estimators are trained with recorded data or computationally expensive generated data. Both data types require significant storage and excessive time to, respectively, record or generate. We propose a low complexity online data generation method to train DL models with a phase-based feature input. The data generation method models the phases of the microphone signals in the frequency domain by employing a deterministic model for the direct path and a statistical model for the late reverberation of the room transfer function. By an evaluation using data from measured room impulse responses, we demonstrate that a model trained with the proposed training data generation method performs comparably to models trained with data generated based on the source-image method.

Keywords

Cite

@article{arxiv.2011.04456,
  title  = {Efficient Training Data Generation for Phase-Based DOA Estimation},
  author = {Fabian Hübner and Wolfgang Mack and Emanuël A. P. Habets},
  journal= {arXiv preprint arXiv:2011.04456},
  year   = {2022}
}

Comments

Submitted to ICASSP 2021

R2 v1 2026-06-23T20:00:55.937Z