English
Related papers

Related papers: Data-Efficient Framework for Real-world Multiple S…

200 papers

This paper introduces an ensemble of discriminators that improves the accuracy of a domain adaptation technique for the localization of multiple sound sources. Recently, deep neural networks have led to promising results for this task, yet…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-17 Guillaume Le Moing , Don Joven Agravante , Tadanobu Inoue , Jayakorn Vongkulbhisal , Asim Munawar , Ryuki Tachibana , Phongtharin Vinayavekhin

In this paper, we propose novel deep learning based algorithms for multiple sound source localization. Specifically, we aim to find the 2D Cartesian coordinates of multiple sound sources in an enclosed environment by using multiple…

Audio and Speech Processing · Electrical Eng. & Systems 2020-12-11 Guillaume Le Moing , Phongtharin Vinayavekhin , Tadanobu Inoue , Jayakorn Vongkulbhisal , Asim Munawar , Ryuki Tachibana , Don Joven Agravante

Multi-channel acoustic signal processing is a well-established and powerful tool to exploit the spatial diversity between a target signal and non-target or noise sources for signal enhancement. However, the textbook solutions for optimal…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-14 Reinhold Haeb-Umbach , Tomohiro Nakatani , Marc Delcroix , Christoph Boeddeker , Tsubasa Ochiai

We propose to use neural networks for simultaneous detection and localization of multiple sound sources in human-robot interaction. In contrast to conventional signal processing techniques, neural network-based sound source localization…

Sound · Computer Science 2018-09-18 Weipeng He , Petr Motlicek , Jean-Marc Odobez

Improper or erroneous labelling can pose a hindrance to reliable generalization for supervised learning. This can have negative consequences, especially for critical fields such as healthcare. We propose an effective new approach for…

Machine Learning · Computer Science 2021-11-16 Konstantinos Nikolaidis , Thomas Plagemann , Stein Kristiansen , Vera Goebel , Mohan Kankanhalli

Noisy labels are ubiquitous in real-world datasets, especially in the large-scale ones derived from crowdsourcing and web searching. It is challenging to train deep neural networks with noisy datasets since the networks are prone to…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Yangdi Lu , Wenbo He

This paper presents a novel approach for indoor acoustic source localization using microphone arrays and based on a Convolutional Neural Network (CNN). The proposed solution is, to the best of our knowledge, the first published work in…

Sound · Computer Science 2019-02-01 Juan Manuel Vera-Diaz , Daniel Pizarro , Javier Macias-Guarasa

Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e.g. a microphone array). Recent advances in this domain most prominently focused on utilizing deep…

Automatic music transcription is considered to be one of the hardest problems in music information retrieval, yet recent deep learning approaches have achieved substantial improvements on transcription performance. These approaches commonly…

Sound · Computer Science 2019-06-21 Jong Wook Kim , Juan Pablo Bello

Data lies at the core of modern deep learning. The impressive performance of supervised learning is built upon a base of massive accurately labeled data. However, in some real-world applications, accurate labeling might not be viable;…

Collecting large-scale datasets is crucial for training deep models, annotating the data, however, inevitably yields noisy labels, which poses challenges to deep learning algorithms. Previous efforts tend to mitigate this problem via…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Yuanpeng Tu , Boshen Zhang , Yuxi Li , Liang Liu , Jian Li , Jiangning Zhang , Yabiao Wang , Chengjie Wang , Cai Rong Zhao

Deep neural networks have proven to be highly effective when large amounts of data with clean labels are available. However, their performance degrades when training data contains noisy labels, leading to poor generalization on the test…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Fahimeh Fooladgar , Minh Nguyen Nhat To , Parvin Mousavi , Purang Abolmaesumi

Detecting sound source objects within visual observation is important for autonomous robots to comprehend surrounding environments. Since sounding objects have a large variety with different appearances in our living environments, labeling…

Sound · Computer Science 2020-07-29 Yoshiki Masuyama , Yoshiaki Bando , Kohei Yatabe , Yoko Sasaki , Masaki Onishi , Yasuhiro Oikawa

Sound source localization (SSL) is a critical technology for determining the position of sound sources in complex environments. However, existing methods face challenges such as high computational costs and precise calibration requirements,…

Sound · Computer Science 2025-05-28 Yiyuan Yang , Shitong Xu , Niki Trigoni , Andrew Markham

The deployment of machine listening algorithms in real-life applications is often impeded by a domain shift caused for instance by different microphone characteristics. In this paper, we propose a novel domain adaptation strategy based on…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-27 Jakob Abeßer , Meinard Müller

While deep-learning-based speaker localization has shown advantages in challenging acoustic environments, it often yields only direction-of-arrival (DOA) cues rather than precise two-dimensional (2D) coordinates. To address this, we propose…

Audio and Speech Processing · Electrical Eng. & Systems 2024-04-02 Shupei Liu , Linfeng Feng , Yijun Gong , Chengdong Liang , Chen Zhang , Xiao-Lei Zhang , Xuelong Li

Sound sources localization using multichannel signal processing has been a subject of active research for decades. In recent years, the use of deep learning in audio signal processing has allowed to drastically improve performances for…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-16 Hadrien Pujol , Éric Bavu , Alexandre Garcia

Deep learning techniques have been widely used in autonomous driving systems for the semantic understanding of urban scenes. However, they need a huge amount of labeled data for training, which is difficult and expensive to acquire. A…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Umberto Michieli , Matteo Biasetton , Gianluca Agresti , Pietro Zanuttigh

Training a deep network to perform semantic segmentation requires large amounts of labeled data. To alleviate the manual effort of annotating real images, researchers have investigated the use of synthetic data, which can be labeled…

Computer Vision and Pattern Recognition · Computer Science 2018-07-18 Fatemeh Sadat Saleh , Mohammad Sadegh Aliakbarian , Mathieu Salzmann , Lars Petersson , Jose M. Alvarez

The performance of machine learning algorithms is known to be negatively affected by possible mismatches between training (source) and test (target) data distributions. In fact, this problem emerges whenever an acoustic scene classification…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-04 Alessandro Ilic Mezza , Emanuël A. P. Habets , Meinard Müller , Augusto Sarti
‹ Prev 1 2 3 10 Next ›