Related papers: Microphone Array Based Surveillance Audio Classifi…

Sound Event Recognition in a Smart City Surveillance Context

Due to the growing demand for improving surveillance capabilities in smart cities, systems need to be developed to provide better monitoring capabilities to competent authorities, agencies responsible for strategic resource management, and…

Sound · Computer Science 2020-02-04 Tito Spadini , Dimitri Leandro de Oliveira Silva , Ricardo Suyama

Sound Event Localization and Detection Using CRNN on Pairs of Microphones

This paper proposes sound event localization and detection methods from multichannel recording. The proposed system is based on two Convolutional Recurrent Neural Networks (CRNNs) to perform sound event detection (SED) and time difference…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-23 Francois Grondin , James Glass , Iwona Sobieraj , Mark D. Plumbley

Sound event localization and classification using WASN in Outdoor Environment

Deep learning-based sound event localization and classification is an emerging research area within wireless acoustic sensor networks. However, current methods for sound event localization and classification typically rely on a single…

Sound · Computer Science 2026-01-27 Dongzhe Zhang , Jianfeng Chen , Jisheng Bai , Mou Wang , Dongyuan Shi , Qixiang Niu , Alberto Bernardini

Asynchronous Microphone Array Calibration using Hybrid TDOA Information

Asynchronous microphone array calibration is a prerequisite for many audition robot applications. A popular solution to the above calibration problem is the batch form of Simultaneous Localisation and Mapping (SLAM), using the time…

Audio and Speech Processing · Electrical Eng. & Systems 2024-10-02 Chengjie Zhang , Jiang Wang , He Kong

Acoustic Scene Analysis using Analog Spiking Neural Network

Sensor nodes in a wireless sensor network (WSN) for security surveillance applications should preferably be small, energy-efficient, and inexpensive with in-sensor computational abilities. An appropriate data processing scheme in the sensor…

Neural and Evolutionary Computing · Computer Science 2022-05-04 Anand Kumar Mukhopadhyay , Naligala Moses Prabhakar , Divya Lakshmi Duggisetty , Indrajit Chakrabarti , Mrigank Sharad

Anti-spoofing Methods for Automatic SpeakerVerification System

Growing interest in automatic speaker verification (ASV)systems has lead to significant quality improvement of spoofing attackson them. Many research works confirm that despite the low equal er-ror rate (EER) ASV systems are still…

Sound · Computer Science 2017-05-25 Galina Lavrentyeva , Sergey Novoselov , Konstantin Simonchik

An Ensemble SVM-based Approach for Voice Activity Detection

Voice activity detection (VAD), used as the front end of speech enhancement, speech and speaker recognition algorithms, determines the overall accuracy and efficiency of the algorithms. Therefore, a VAD with low complexity and high accuracy…

Sound · Computer Science 2019-02-06 Jayanta Dey , Md Sanzid Bin Hossain , Mohammad Ariful Haque

Deep Learning Features for Robust Detection of Acoustic Events in Sleep-Disordered Breathing

Sleep-disordered breathing (SDB) is a serious and prevalent condition, and acoustic analysis via consumer devices (e.g. smartphones) offers a low-cost solution to screening for it. We present a novel approach for the acoustic identification…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-08 Hector E. Romero , Ning Ma , Guy J. Brown , Amy V. Beeston , Madina Hasan

A Comparison of deep learning methods for environmental sound

Environmental sound detection is a challenging application of machine learning because of the noisy nature of the signal, and the small amount of (labeled) data that is typically available. This work thus presents a comparison of several…

Sound · Computer Science 2017-03-22 Juncheng Li , Wei Dai , Florian Metze , Shuhui Qu , Samarjit Das

Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

The frequent breakdowns and malfunctions of industrial equipment have driven increasing interest in utilizing cost-effective and easy-to-deploy sensors, such as microphones, for effective condition monitoring of machinery. Microphones offer…

Sound · Computer Science 2025-05-28 Sertac Kilickaya , Mete Ahishali , Cansu Celebioglu , Fahad Sohrab , Levent Eren , Turker Ince , Murat Askar , Moncef Gabbouj

Selective Attention System (SAS): Device-Addressed Speech Detection for Real-Time On-Device Voice AI

We study device-addressed speech detection under pre-ASR edge deployment constraints, where systems must decide whether to forward audio before transcription under strict latency and compute limits. We show that, in multi-speaker…

Sound · Computer Science 2026-04-10 David Joohun Kim , Daniyal Anjum , Bonny Banerjee , Omar Abbasi

Conditioned Time-Dilated Convolutions for Sound Event Detection

Sound event detection (SED) is the task of identifying sound events along with their onset and offset times. A recent, convolutional neural networks based SED method, proposed the usage of depthwise separable (DWS) and time-dilated…

Sound · Computer Science 2020-07-13 Konstantinos Drossos , Stylianos I. Mimilakis , Tuomas Virtanen

Toward Noise-Aware Audio Deepfake Detection: Survey, SNR-Benchmarks, and Practical Recipes

Deepfake audio detection has progressed rapidly with strong pre-trained encoders (e.g., WavLM, Wav2Vec2, MMS). However, performance in realistic capture conditions - background noise (domestic/office/transport), room reverberation, and…

Sound · Computer Science 2025-12-17 Udayon Sen , Alka Luqman , Anupam Chattopadhyay

Improving the Efficiency of DAMAS for Sound Source Localization via Wavelet Compression Computational Grid

Phased microphone arrays are used widely in the applications for acoustic source localization. Deconvolution approaches such as DAMAS successfully overcome the spatial resolution limit of the conventional delay-and-sum (DAS) beamforming…

Sound · Computer Science 2017-02-14 Wei Ma , Xun Liu

A Lite Microphone Array Beamforming Scheme with Maximum Signal-to-Noise Ratio Filter

Since space-domain information can be utilized, microphone array beamforming is often used to enhance the quality of the speech by suppressing directional disturbance. However, with the increasing number of microphone, the complexity would…

Sound · Computer Science 2020-05-20 Lu Ma , Xin Zhao , Pei Zhao , Tengrong Su

Prediction of speech intelligibility with DNN-based performance measures

This paper presents a speech intelligibility model based on automatic speech recognition (ASR), combining phoneme probabilities from deep neural networks (DNN) and a performance measure that estimates the word error rate from these…

Sound · Computer Science 2022-03-18 Angel Mario Castro Martinez , Constantin Spille , Jana Roßbach , Birger Kollmeier , Bernd T. Meyer

Noise adaptive beamforming for linear array photoacoustic imaging

Delay-and-sum (DAS) algorithms are widely used for beamforming in linear array photoacoustic imaging systems and are characterized by fast execution. However, these algorithms suffer from various drawbacks like low resolution, low contrast,…

Medical Physics · Physics 2021-07-29 Souradip Paul , Subhamoy Mandal , Mayanglambam Suheshkumar Singh

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Sound event detection (SED) is an interesting but challenging task due to the scarcity of data and diverse sound events in real life. This paper presents a multi-grained based attention network (MGA-Net) for semi-supervised sound event…

Sound · Computer Science 2022-11-01 Ying Hu , Xiujuan Zhu , Yunlong Li , Hao Huang , Liang He

Direction of Arrival Estimation of Wide-band Signals with Planar Microphone Arrays

An approach to the estimation of the Direction of Arrival (DOA) of wide-band signals with a planar microphone array is presented. Our algorithm estimates an unambiguous DOA using a single planar array in which the microphones are placed…

Sound · Computer Science 2018-11-19 Rudolf Byker , Thomas Niesler

The Reasonable Effectiveness of Speaker Embeddings for Violence Detection

In this paper, we focus on audio violence detection (AVD). AVD is necessary for several reasons, especially in the context of maintaining safety, preventing harm, and ensuring security in various environments. This calls for accurate AVD…

Audio and Speech Processing · Electrical Eng. & Systems 2024-06-12 Sarthak Jain , Orchid Chetia Phukan , Arun Balaji Buduru , Rajesh Sharma