English
Related papers

Related papers: Polyphonic audio event detection: multi-label or m…

200 papers

Polyphonic Sound Event Detection (SED) in real-world recordings is a challenging task because of the dynamic polyphony level, intensity, and duration of sound events. Current polyphonic SED systems fail to model the temporal structure of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-02 Arjun Pankajakshan , Helen L. Bear , Emmanouil Benetos

We propose a multi-label multi-task framework based on a convolutional recurrent neural network to unify detection of isolated and overlapping audio events. The framework leverages the power of convolutional recurrent neural network…

Machine Learning · Computer Science 2019-02-20 Huy Phan , Oliver Y. Chén , Philipp Koch , Lam Pham , Ian McLoughlin , Alfred Mertins , Maarten De Vos

Current audio classification models have small class vocabularies relative to the large number of sound event classes of interest in the real world. Thus, they provide a limited view of the world that may miss important yet unexpected or…

Sound · Computer Science 2023-10-24 Sripathi Sridhar , Mark Cartwright

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2020 Challenge Task 4. The proposed SED method is based on semi-supervised learning to deal with the different combination of training datasets such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-03 Nam Kyun Kim , Hong Kook Kim

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, Deep Learning offers valuable techniques for this goal such as Convolutional Neural…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-26 Fabio Vesperini , Leonardo Gabrielli , Emanuele Principi , Stefano Squartini

Polyphonic sound event localization and detection is not only detecting what sound events are happening but localizing corresponding sound sources. This series of tasks was first introduced in DCASE 2019 Task 3. In 2020, the sound event…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-02 Yin Cao , Turab Iqbal , Qiuqiang Kong , Yue Zhong , Wenwu Wang , Mark D. Plumbley

Polyphonic sound event localization and detection (SELD), which jointly performs sound event detection (SED) and direction-of-arrival (DoA) estimation, detects the type and occurrence time of sound events as well as their corresponding DoA…

Sound · Computer Science 2021-02-12 Yin Cao , Turab Iqbal , Qiuqiang Kong , Fengyan An , Wenwu Wang , Mark D. Plumbley

Considering that acoustic scenes and sound events are closely related to each other, in some previous papers, a joint analysis of acoustic scenes and sound events utilizing multitask learning (MTL)-based neural networks was proposed. In…

Sound · Computer Science 2022-07-12 Shunsuke Tsubaki , Keisuke Imoto , Nobutaka Ono

Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording. With advances in deep neural…

Sound · Computer Science 2024-12-31 Sangwook Park , David K. Han , Mounya Elhilali

The challenges of polyphonic sound event detection (PSED) stem from the detection of multiple overlapping events in a time series. Recent efforts exploit Deep Neural Networks (DNNs) on Time-Frequency Representations (TFRs) of audio clips as…

Sound · Computer Science 2021-11-29 Wangkai Jin , Junyu Liu , Jianfeng Ren , Xiangjun Peng

Polyphonic sound event detection (polyphonic SED) is an interesting but challenging task due to the concurrence of multiple sound events. Recently, SED methods based on convolutional neural networks (CNN) and recurrent neural networks (RNN)…

Audio and Speech Processing · Electrical Eng. & Systems 2018-07-24 Yaming Liu , Jian Tang , Yan Song , Lirong Dai

Sound event detection (SED) entails identifying the type of sound and estimating its temporal boundaries from acoustic signals. These events are uniquely characterized by their spatio-temporal features, which are determined by the way they…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-19 Tanmay Khandelwal , Rohan Kumar Das

Sound event detection (SED) and acoustic scene classification (ASC) are important research topics in environmental sound analysis. Many research groups have addressed SED and ASC using neural-network-based methods, such as the convolutional…

Sound · Computer Science 2021-02-24 Noriyuki Tonami , Keisuke Imoto , Ryosuke Yamanishi , Yoichi Yamashita

Sound event detection is a challenging task, especially for scenes with multiple simultaneous events. While event classification methods tend to be fairly accurate, event localization presents additional challenges, especially when large…

Audio and Speech Processing · Electrical Eng. & Systems 2018-11-12 Sandeep Kothinti , Keisuke Imoto , Debmalya Chakrabarty , Gregory Sell , Shinji Watanabe , Mounya Elhilali

In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D convolutional neural network (CNN) in the first layer for the multichannel sound event detection (SED) task. The 3D CNN enables the network to…

Sound · Computer Science 2018-01-30 Sharath Adavanne , Archontis Politis , Tuomas Virtanen

One hour before sunrise, one can experience the dawn chorus where birds from different species sing together. In this scenario, high levels of polyphony, as in the number of overlapping sound sources, are prone to happen resulting in a…

Sound · Computer Science 2022-07-14 Alberto García Arroba Parrilla , Dan Stowell

This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning. The goal of the task4…

Sound · Computer Science 2019-10-16 Hyoungwoo Park , Sungrack Yun , Jungyun Eum , Janghoon Cho , Kyuwoong Hwang

This paper presents our work of training acoustic event detection (AED) models using unlabeled dataset. Recent acoustic event detectors are based on large-scale neural networks, which are typically trained with huge amounts of labeled data.…

Audio and Speech Processing · Electrical Eng. & Systems 2019-05-01 Bowen Shi , Ming Sun , Chieh-Chi Kao , Viktor Rozgic , Spyros Matsoukas , Chao Wang

Sound event detection (SED) and acoustic scene classification (ASC) are major tasks in environmental sound analysis. Considering that sound events and scenes are closely related to each other, some works have addressed joint analyses of…

This paper proposes an effective modelling of sound event spectra with a hidden data-size-imbalance, for improved Acoustic Event Detection (AED). The proposed method models each event as an aggregated representation of a few latent factors,…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-08 Chaitanya Narisetty , Tatsuya Komatsu , Reishi Kondo
‹ Prev 1 2 3 10 Next ›