English
Related papers

Related papers: Deep Learning Based Audio-Visual Multi-Speaker DOA…

200 papers

Multi-source localization is an important and challenging technique for multi-talker conversation analysis. This paper proposes a novel supervised learning method using deep neural networks to estimate the direction of arrival (DOA) of all…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-30 Aswin Shanmugam Subramanian , Chao Weng , Shinji Watanabe , Meng Yu , Dong Yu

Supervised learning based methods for source localization, being data driven, can be adapted to different acoustic conditions via training and have been shown to be robust to adverse acoustic environments. In this paper, a convolutional…

Audio and Speech Processing · Electrical Eng. & Systems 2019-05-22 Soumitro Chakrabarty , Emanuël A. P. Habets

Direction-of-Arrival (DOA) estimation is critical in spatial audio and acoustic signal processing, with wide-ranging applications in real-world. Most existing DOA models are trained on synthetic data by convolving clean speech with room…

Sound · Computer Science 2025-11-12 Haowen Li , Zhengding Luo , Dongyuan Shi , Boxiang Wang , Junwei Ji , Ziyi Yang , Woon-Seng Gan

Deep learning-based direction-of-arrival (DoA) estimation has gained increasing popularity. A popular family of DoA estimation algorithms is beamforming methods, which operate by constructing a spatial filter that is applied to array…

Computational Engineering, Finance, and Science · Computer Science 2025-12-25 Xuyao Deng , Yong Dou , Kele Xu

In multi-speaker environments the direction of arrival (DOA) of a target speaker is key for improving speech clarity and extracting target speaker's voice. However, traditional DOA estimation methods often struggle in the presence of noise,…

Sound · Computer Science 2024-12-30 Zixuan Li , Shulin He , Xueliang Zhang

Most of the prior studies in the spatial \ac{DoA} domain focus on a single modality. However, humans use auditory and visual senses to detect the presence of sound sources. With this motivation, we propose to use neural networks with audio…

Sound · Computer Science 2021-05-14 Xinyuan Qian , Maulik Madhavi , Zexu Pan , Jiadong Wang , Haizhou Li

We propose a novel multi-source direction of arrival (DOA) estimation technique using a convolutional neural network algorithm which learns the modal coherence patterns of an incident soundfield through measured spherical harmonic…

Sound · Computer Science 2020-03-19 A. Fahim , P. N. Samarasinghe , T. D. Abhayapala

Recently, deep representation learning has shown strong performance in multiple audio tasks. However, its use for learning spatial representations from multichannel audio is underexplored. We investigate the use of a pretraining stage based…

As we interact with the world, for example when we communicate with our colleagues in a large open space or meeting room, we continuously analyse the surrounding environment and, in particular, localise and recognise acoustic events. While…

Sound · Computer Science 2019-04-02 Pawel Swietojanski , Ondrej Miksik

For extracting a target speaker voice, direction-of-arrival (DOA) estimation is crucial for binaural hearing aids operating in noisy, multi-speaker environments. Among the solutions developed for this task, a deep learning convolutional…

Audio and Speech Processing · Electrical Eng. & Systems 2026-04-30 Farnaz Jazaeri , Homayoun Kamkar-Parsi , François Grondin , Martin Bouchard

This paper presents a tool for the analysis, and simulation of direction-of-arrival (DOA) estimation in wireless mobile communication systems over the fading channel. It reviews two methods of Direction of arrival (DOA) estimation…

Networking and Internet Architecture · Computer Science 2011-12-12 A. V. Meenakshi , V. Punitham , R. Kayalvizhi , S. Asha

In hearing aid applications, an important objective is to accurately estimate the direction of arrival (DOA) of multiple speakers in noisy and reverberant environments. Recently, we proposed a binaural DOA estimation method, where the DOAs…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-11 Daniel Fejgin , Simon Doclo

Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of…

Audio and Speech Processing · Electrical Eng. & Systems 2024-10-28 Daniel Fejgin , Simon Doclo

In this paper, we present a deep neural network-based online multi-speaker localisation algorithm. Following the W-disjoint orthogonality principle in the spectral domain, each time-frequency (TF) bin is dominated by a single speaker, and…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-28 Hodaya Hammer , Shlomo E. Chazan , Jacob Goldberger , Sharon Gannot

The use of audio and visual modality for speaker localization has been well studied in the literature by exploiting their complementary characteristics. However, most previous works employ the setting of static sensors mounted at fixed…

Multimedia · Computer Science 2023-09-29 Jinzheng Zhao , Yong Xu , Xinyuan Qian , Wenwu Wang

The direction-of-arrival (DOA) of sound sources is an essential acoustic parameter used, e.g., for multi-channel speech enhancement or source tracking. Complex acoustic scenarios consisting of sources-of-interest, interfering sources,…

Audio and Speech Processing · Electrical Eng. & Systems 2022-02-17 Wolfgang Mack , Julian Wechsler , Emanuël A. P. Habets

Direction of arrival (DoA) estimation is a common sensing problem in radar, sonar, audio, and wireless communication systems. It has gained renewed importance with the advent of the integrated sensing and communication paradigm. To fully…

We present a MUSIC-based Direction of Arrival (DOA) estimation strategy using small antenna arrays, via employing deep learning for reconstructing the signals of a virtual large antenna array. Not only does the proposed strategy deliver…

Signal Processing · Electrical Eng. & Systems 2021-03-08 Aya Mostafa Ahmed , Udaya Sampath K. P. Miriya Thanthrige , Aly El Gamal , Aydin Sezgin

The problem of direction-of-arrival (DOA) estimation in the presence of nonuniform sensor noise is considered and a novel algorithm is developed. The algorithm consists of three phases. First, the diagonal nonuniform sensor noise covariance…

Signal Processing · Electrical Eng. & Systems 2021-10-01 Majdoddin Esfandiari , Sergiy A. Vorobyov

This paper describes sound event localization and detection (SELD) for spatial audio recordings captured by firstorder ambisonics (FOA) microphones. In this task, one may train a deep neural network (DNN) using FOA data annotated with the…

Sound · Computer Science 2024-10-31 Yoto Fujita , Yoshiaki Bando , Keisuke Imoto , Masaki Onishi , Kazuyoshi Yoshii
‹ Prev 1 2 3 10 Next ›