Related papers: Guided Learning Convolution System for DCASE 2019 …

Sound event detection using weakly-labeled semi-supervised data with GCRNNS, VAT and Self-Adaptive Label Refinement

In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear…

Sound · Computer Science 2018-10-17 Robert Harb , Franz Pernkopf

Polyphonic sound event detection based on convolutional recurrent neural networks with semi-supervised loss function for DCASE challenge 2020 task 4

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2020 Challenge Task 4. The proposed SED method is based on semi-supervised learning to deal with the different combination of training datasets such as…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-03 Nam Kyun Kim , Hong Kook Kim

Guided multi-branch learning systems for sound event detection with sound separation

In this paper, we describe in detail our systems for DCASE 2020 Task 4. The systems are based on the 1st-place system of DCASE 2019 Task 4, which adopts weakly-supervised framework with an attention-based embedding-level pooling module and…

Sound · Computer Science 2020-11-03 Yuxin Huang , Liwei Lin , Shuo Ma , Xiangdong Wang , Hong Liu , Yueliang Qian , Min Liu , Kazushige Ouch

Semi-Supervised NMF-CNN For Sound Event Detection

In this paper, a combinative approach using Nonnegative Matrix Factorization (NMF) and Convolutional Neural Network (CNN) is proposed for audio clip Sound Event Detection (SED). The main idea begins with the use of NMF to approximate strong…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-22 Chan Teck Kai , Chin Cheng Siong , Li Ye

SAM-GCNN: A Gated Convolutional Neural Network with Segment-Level Attention Mechanism for Home Activity Monitoring

In this paper, we propose a method for home activity monitoring. We demonstrate our model on dataset of Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 Challenge Task 5. This task aims to classify multi-channel…

Sound · Computer Science 2018-11-15 Yu-Han Shen , Ke-Xin He , Wei-Qiang Zhang

A Hybrid System of Sound Event Detection Transformer and Frame-wise Model for DCASE 2022 Task 4

In this paper, we describe in detail our system for DCASE 2022 Task4. The system combines two considerably different models: an end-to-end Sound Event Detection Transformer (SEDT) and a frame-wise model, Metric Learning and Focal Loss CNN…

Sound · Computer Science 2022-10-19 Yiming Li , Zhifang Guo , Zhirong Ye , Xiangdong Wang , Hong Liu , Yueliang Qian , Rui Tao , Long Yan , Kazushige Ouchi

Sound Event Detection with Depthwise Separable and Dilated Convolutions

State-of-the-art sound event detection (SED) methods usually employ a series of convolutional neural networks (CNNs) to extract useful features from the input audio signal, and then recurrent neural networks (RNNs) to model longer temporal…

Sound · Computer Science 2020-02-04 Konstantinos Drossos , Stylianos I. Mimilakis , Shayan Gharib , Yanxiong Li , Tuomas Virtanen

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Sound event detection (SED) entails identifying the type of sound and estimating its temporal boundaries from acoustic signals. These events are uniquely characterized by their spatio-temporal features, which are determined by the way they…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-19 Tanmay Khandelwal , Rohan Kumar Das

Surrey-cvssp system for DCASE2017 challenge task4

In this technique report, we present a bunch of methods for the task 4 of Detection and Classification of Acoustic Scenes and Events 2017 (DCASE2017) challenge. This task evaluates systems for the large-scale detection of sound events using…

Sound · Computer Science 2017-11-28 Yong Xu , Qiuqiang Kong , Wenwu Wang , Mark D. Plumbley

Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4

This report proposes a frequency dynamic convolution (FDY) with a large kernel attention (LKA)-convolutional recurrent neural network (CRNN) with a pre-trained bidirectional encoder representation from audio transformers (BEATs)…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-13 Ji Won Kim , Sang Won Son , Yoonah Song , Hong Kook Kim , Il Hoon Song , Jeong Eun Lim

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features

In this paper, we propose a stacked convolutional and recurrent neural network (CRNN) with a 3D convolutional neural network (CNN) in the first layer for the multichannel sound event detection (SED) task. The 3D CNN enables the network to…

Sound · Computer Science 2018-01-30 Sharath Adavanne , Archontis Politis , Tuomas Virtanen

Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4

In this report, we propose three novel methods for developing a sound event detection (SED) model for the DCASE 2024 Challenge Task 4. First, we propose an auxiliary decoder attached to the final convolutional block to improve feature…

Audio and Speech Processing · Electrical Eng. & Systems 2024-06-25 Sang Won Son , Jongyeon Park , Hong Kook Kim , Sulaiman Vesal , Jeong Eun Lim

Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-supervised Sound Event Detection

In this paper we present our system for the detection and classification of acoustic scenes and events (DCASE) 2020 Challenge Task 4: Sound event detection and separation in domestic environments. We introduce two new models: the…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-12 Janek Ebbers , Reinhold Haeb-Umbach

Non-Negative Matrix Factorization-Convolutional Neural Network (NMF-CNN) For Sound Event Detection

The main scientific question of this year DCASE challenge, Task 4 - Sound Event Detection in Domestic Environments, is to investigate the types of data (strongly labeled synthetic data, weakly labeled data, unlabeled in domain data)…

Sound · Computer Science 2020-01-23 Teck Kai Chan , Cheng Siong Chin , Ye Li

DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection

This report presents our audio event detection system submitted for Task 2, "Detection of rare sound events", of DCASE 2017 challenge. The proposed system is based on convolutional neural networks (CNNs) and deep neural networks (DNNs)…

Sound · Computer Science 2017-10-19 Huy Phan , Martin Krawczyk-Becker , Timo Gerkmann , Alfred Mertins

Sound Event Detection of Weakly Labelled Data with CNN-Transformer and Automatic Threshold Optimization

Sound event detection (SED) is a task to detect sound events in an audio recording. One challenge of the SED task is that many datasets such as the Detection and Classification of Acoustic Scenes and Events (DCASE) datasets are weakly…

Sound · Computer Science 2020-08-25 Qiuqiang Kong , Yong Xu , Wenwu Wang , Mark D. Plumbley

Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection

In this technical report, the systems we submitted for subtask 4 of the DCASE 2021 challenge, regarding sound event detection, are described in detail. These models are closely related to the baseline provided for this problem, as they are…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-20 Wim Boes , Hugo Van hamme

HODGEPODGE: Sound event detection based on ensemble of semi-supervised learning methods

In this paper, we present a method called HODGEPODGE\footnotemark[1] for large-scale detection of sound events using weakly labeled, synthetic, and unlabeled data proposed in the Detection and Classification of Acoustic Scenes and Events…

Sound · Computer Science 2019-07-18 Ziqiang Shi , Liu Liu , Huibin Lin , Rujie Liu , Anyan Shi

Guided learning for weakly-labeled semi-supervised sound event detection

We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of…

Machine Learning · Computer Science 2020-02-05 Liwei Lin , Xiangdong Wang , Hong Liu , Yueliang Qian

Self-training with noisy student model and semi-supervised loss function for dcase 2021 challenge task 4

This report proposes a polyphonic sound event detection (SED) method for the DCASE 2021 Challenge Task 4. The proposed SED model consists of two stages: a mean-teacher model for providing target labels regarding weakly labeled or unlabeled…

Sound · Computer Science 2021-07-07 Nam Kyun Kim , Hong Kook Kim