Related papers: Phased Microphone Array for Sound Source Localizat…

Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates

This paper presents a novel approach for indoor acoustic source localization using microphone arrays and based on a Convolutional Neural Network (CNN). The proposed solution is, to the best of our knowledge, the first published work in…

Sound · Computer Science 2019-02-01 Juan Manuel Vera-Diaz , Daniel Pizarro , Javier Macias-Guarasa

DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays

Deep neural network (DNN)-based speech enhancement algorithms in microphone arrays have now proven to be efficient solutions to speech understanding and speech recognition in noisy environments. However, in the context of ad-hoc microphone…

Signal Processing · Electrical Eng. & Systems 2020-11-04 Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features

Motivated by the fact that characteristics of different sound classes are highly diverse in different temporal scales and hierarchical levels, a novel deep convolutional neural network (CNN) architecture is proposed for the environmental…

Sound · Computer Science 2018-06-15 Boqing Zhu , Kele Xu , Dezhi Wang , Lilun Zhang , Bo Li , Yuxing Peng

Speaker Recognition from Raw Waveform with SincNet

Deep learning is progressively gaining popularity as a viable alternative to i-vectors for speaker recognition. Promising results have been recently obtained with Convolutional Neural Networks (CNNs) when fed by raw speech samples directly.…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-12 Mirco Ravanelli , Yoshua Bengio

Explaining Deep Convolutional Neural Networks on Music Classification

Deep convolutional neural networks (CNNs) have been actively adopted in the field of music information retrieval, e.g. genre classification, mood detection, and chord recognition. However, the process of learning and prediction is little…

Machine Learning · Computer Science 2016-07-11 Keunwoo Choi , George Fazekas , Mark Sandler

Graph neural networks for sound source localization on distributed microphone networks

Distributed Microphone Arrays (DMAs) present many challenges with respect to centralized microphone arrays. An important requirement of applications on these arrays is handling a variable number of input channels. We consider the use of…

Sound · Computer Science 2023-06-29 Eric Grinstein , Mike Brookes , Patrick A. Naylor

Deep learning for source camera identification on mobile devices

In the present paper, we propose a source camera identification method for mobile devices based on deep learning. Recently, convolutional neural networks (CNNs) have shown a remarkable performance on several tasks such as image recognition,…

Computer Vision and Pattern Recognition · Computer Science 2018-01-25 David Freire-Obregón , Fabio Narducci , Silvio Barra , Modesto Castrillón-Santana

End-to-end Binaural Sound Localisation from the Raw Waveform

A novel end-to-end binaural sound localisation approach is proposed which estimates the azimuth of a sound source directly from the waveform. Instead of employing hand-crafted features commonly employed for binaural sound localisation, such…

Sound · Computer Science 2019-04-04 Paolo Vecchiotti , Ning Ma , Stefano Squartini , Guy J. Brown

Adaptive high-precision sound source localization at low frequencies based on convolutional neural network

Sound source localization (SSL) technology plays a crucial role in various application areas such as fault diagnosis, speech separation, and vibration noise reduction. Although beamforming algorithms are widely used in SSL, their resolution…

Sound · Computer Science 2024-10-01 Wenbo Ma , Yan Lu , Yijun Liu

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

Deep Convolutional Neural Network (CNN) is a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing. Some of the exciting application areas of CNN…

Computer Vision and Pattern Recognition · Computer Science 2020-05-12 Asifullah Khan , Anabia Sohail , Umme Zahoora , Aqsa Saeed Qureshi

Sound Source Localization in a Multipath Environment Using Convolutional Neural Networks

The propagation of sound in a shallow water environment is characterized by boundary reflections from the sea surface and sea floor. These reflections result in multiple (indirect) sound propagation paths, which can degrade the performance…

Sound · Computer Science 2017-10-31 Eric L. Ferguson , Stefan B. Williams , Craig T. Jin

Sound texture synthesis using convolutional neural networks

The following article introduces a new parametric synthesis algorithm for sound textures inspired by existing methods used for visual textures. Using a 2D Convolutional Neural Network (CNN), a sound signal is modified until the temporal…

Sound · Computer Science 2019-05-10 Hugo Caracalla , Axel Roebel

Approximation analysis of CNNs from a feature extraction view

Deep learning based on deep neural networks has been very successful in many practical applications, but it lacks enough theoretical understanding due to the network architectures and structures. In this paper we establish some analysis for…

Machine Learning · Computer Science 2024-01-03 Jianfei Li , Han Feng , Ding-Xuan Zhou

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNN) are able to extract higher level features that are invariant to…

Machine Learning · Computer Science 2017-05-31 Emre Çakır , Giambattista Parascandolo , Toni Heittola , Heikki Huttunen , Tuomas Virtanen

Deep Residual Network for Sound Source Localization in the Time Domain

This study presents a system for sound source localization in time domain using a deep residual neural network. Data from the linear 8 channel microphone array with 3 cm spacing is used by the network for direction estimation. We propose to…

Sound · Computer Science 2018-08-21 Dmitry Suvorov , Ge Dong , Roman Zhukov

Microphone Array Signal Processing and Deep Learning for Speech Enhancement

Multi-channel acoustic signal processing is a well-established and powerful tool to exploit the spatial diversity between a target signal and non-target or noise sources for signal enhancement. However, the textbook solutions for optimal…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-14 Reinhold Haeb-Umbach , Tomohiro Nakatani , Marc Delcroix , Christoph Boeddeker , Tsubasa Ochiai

Inference-Adaptive Neural Steering for Real-Time Area-Based Sound Source Separation

We propose a novel Neural Steering technique that adapts the target area of a spatial-aware multi-microphone sound source separation algorithm during inference without the necessity of retraining the deep neural network (DNN). To achieve…

Audio and Speech Processing · Electrical Eng. & Systems 2024-10-23 Martin Strauss , Wolfgang Mack , María Luis Valero , Okan Köpüklü

Study and Observation of the Variations of Accuracies for Handwritten Digits Recognition with Various Hidden Layers and Epochs using Convolutional Neural Network

Nowadays, deep learning can be employed to a wide ranges of fields including medicine, engineering, etc. In deep learning, Convolutional Neural Network (CNN) is extensively used in the pattern and sequence recognition, video analysis,…

Computer Vision and Pattern Recognition · Computer Science 2019-02-06 Rezoana Bente Arif , Md. Abu Bakr Siddique , Mohammad Mahmudur Rahman Khan , Mahjabin Rahman Oishe

Speech and Speaker Recognition from Raw Waveform with SincNet

Deep neural networks can learn complex and abstract representations, that are progressively obtained by combining simpler ones. A recent trend in speech and speaker recognition consists in discovering these representations starting from raw…

Audio and Speech Processing · Electrical Eng. & Systems 2019-02-26 Mirco Ravanelli , Yoshua Bengio

Musical instrument sound classification with deep convolutional neural network using feature fusion approach

A new musical instrument classification method using convolutional neural networks (CNNs) is presented in this paper. Unlike the traditional methods, we investigated a scheme for classifying musical instruments using the learned features…

Sound · Computer Science 2015-12-24 Taejin Park , Taejin Lee