Related papers: Polyphonic audio event detection: multi-label or m…

Polyphonic Sound Event and Sound Activity Detection: A Multi-task approach

Polyphonic Sound Event Detection (SED) in real-world recordings is a challenging task because of the dynamic polyphony level, intensity, and duration of sound events. Current polyphonic SED systems fail to model the temporal structure of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-02 Arjun Pankajakshan , Helen L. Bear , Emmanouil Benetos

Unifying Isolated and Overlapping Audio Event Detection with Multi-Label Multi-Task Convolutional Recurrent Neural Networks

We propose a multi-label multi-task framework based on a convolutional recurrent neural network to unify detection of isolated and overlapping audio events. The framework leverages the power of convolutional recurrent neural network…

Machine Learning · Computer Science 2019-02-20 Huy Phan , Oliver Y. Chén , Philipp Koch , Lam Pham , Ian McLoughlin , Alfred Mertins , Maarten De Vos

Multi-label Open-set Audio Classification

Current audio classification models have small class vocabularies relative to the large number of sound event classes of interest in the real world. Thus, they provide a limited view of the world that may miss important yet unexpected or…

Sound · Computer Science 2023-10-24 Sripathi Sridhar , Mark Cartwright

Polyphonic sound event detection based on convolutional recurrent neural networks with semi-supervised loss function for DCASE challenge 2020 task 4

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2020 Challenge Task 4. The proposed SED method is based on semi-supervised learning to deal with the different combination of training datasets such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-03 Nam Kyun Kim , Hong Kook Kim

Polyphonic Sound Event Detection by using Capsule Neural Networks

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, Deep Learning offers valuable techniques for this goal such as Convolutional Neural…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-26 Fabio Vesperini , Leonardo Gabrielli , Emanuele Principi , Stefano Squartini

Event-Independent Network for Polyphonic Sound Event Localization and Detection

Polyphonic sound event localization and detection is not only detecting what sound events are happening but localizing corresponding sound sources. This series of tasks was first introduced in DCASE 2019 Task 3. In 2020, the sound event…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-02 Yin Cao , Turab Iqbal , Qiuqiang Kong , Yue Zhong , Wenwu Wang , Mark D. Plumbley

An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection

Polyphonic sound event localization and detection (SELD), which jointly performs sound event detection (SED) and direction-of-arrival (DoA) estimation, detects the type and occurrence time of sound events as well as their corresponding DoA…

Sound · Computer Science 2021-02-12 Yin Cao , Turab Iqbal , Qiuqiang Kong , Fengyan An , Wenwu Wang , Mark D. Plumbley

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Considering that acoustic scenes and sound events are closely related to each other, in some previous papers, a joint analysis of acoustic scenes and sound events utilizing multitask learning (MTL)-based neural networks was proposed. In…

Sound · Computer Science 2022-07-12 Shunsuke Tsubaki , Keisuke Imoto , Nobutaka Ono

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures

Sound event detection is an important facet of audio tagging that aims to identify sounds of interest and define both the sound category and time boundaries for each sound event in a continuous recording. With advances in deep neural…

Sound · Computer Science 2024-12-31 Sangwook Park , David K. Han , Mounya Elhilali

Polyphonic Sound Event Detection Using Capsule Neural Network on Multi-Type-Multi-Scale Time-Frequency Representation

The challenges of polyphonic sound event detection (PSED) stem from the detection of multiple overlapping events in a time series. Recent efforts exploit Deep Neural Networks (DNNs) on Time-Frequency Representations (TFRs) of audio clips as…

Sound · Computer Science 2021-11-29 Wangkai Jin , Junyu Liu , Jianfeng Ren , Xiangjun Peng

A Capsule based Approach for Polyphonic Sound Event Detection

Polyphonic sound event detection (polyphonic SED) is an interesting but challenging task due to the concurrence of multiple sound events. Recently, SED methods based on convolutional neural networks (CNN) and recurrent neural networks (RNN)…

Audio and Speech Processing · Electrical Eng. & Systems 2018-07-24 Yaming Liu , Jian Tang , Yan Song , Lirong Dai

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Sound event detection (SED) entails identifying the type of sound and estimating its temporal boundaries from acoustic signals. These events are uniquely characterized by their spatio-temporal features, which are determined by the way they…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-19 Tanmay Khandelwal , Rohan Kumar Das

Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning

Sound event detection (SED) and acoustic scene classification (ASC) are important research topics in environmental sound analysis. Many research groups have addressed SED and ASC using neural-network-based methods, such as the convolutional…

Sound · Computer Science 2021-02-24 Noriyuki Tonami , Keisuke Imoto , Ryosuke Yamanishi , Yoichi Yamashita

Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection

Sound event detection is a challenging task, especially for scenes with multiple simultaneous events. While event classification methods tend to be fairly accurate, event localization presents additional challenges, especially when large…

Audio and Speech Processing · Electrical Eng. & Systems 2018-11-12 Sandeep Kothinti , Keisuke Imoto , Debmalya Chakrabarty , Gregory Sell , Shinji Watanabe , Mounya Elhilali

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features

In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D convolutional neural network (CNN) in the first layer for the multichannel sound event detection (SED) task. The 3D CNN enables the network to…

Sound · Computer Science 2018-01-30 Sharath Adavanne , Archontis Politis , Tuomas Virtanen

Polyphonic sound event detection for highly dense birdsong scenes

One hour before sunrise, one can experience the dawn chorus where birds from different species sing together. In this scenario, high levels of polyphony, as in the number of overlapping sound sources, are prone to happen resulting in a…

Sound · Computer Science 2022-07-14 Alberto García Arroba Parrilla , Dan Stowell

Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning. The goal of the task4…

Sound · Computer Science 2019-10-16 Hyoungwoo Park , Sungrack Yun , Jungyun Eum , Janghoon Cho , Kyuwoong Hwang

Semi-supervised Acoustic Event Detection based on tri-training

This paper presents our work of training acoustic event detection (AED) models using unlabeled dataset. Recent acoustic event detectors are based on large-scale neural networks, which are typically trained with huge amounts of labeled data.…

Audio and Speech Processing · Electrical Eng. & Systems 2019-05-01 Bowen Shi , Ming Sun , Chieh-Chi Kao , Viktor Rozgic , Spyros Matsoukas , Chao Wang

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Sound event detection (SED) and acoustic scene classification (ASC) are major tasks in environmental sound analysis. Considering that sound events and scenes are closely related to each other, some works have addressed joint analyses of…

Sound · Computer Science 2020-02-17 Keisuke Imoto , Noriyuki Tonami , Yuma Koizumi , Masahiro Yasuda , Ryosuke Yamanishi , Yoichi Yamashita

Modelling of Sound Events with Hidden Imbalances Based on Clustering and Separate Sub-Dictionary Learning

This paper proposes an effective modelling of sound event spectra with a hidden data-size-imbalance, for improved Acoustic Event Detection (AED). The proposed method models each event as an aggregated representation of a few latent factors,…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-08 Chaitanya Narisetty , Tatsuya Komatsu , Reishi Kondo