English
Related papers

Related papers: Audio-Visual Scene Classification Using A Transfer…

200 papers

In this paper, we propose two techniques, namely joint modeling and data augmentation, to improve system performances for audio-visual scene classification (AVSC). We employ pre-trained networks trained only on image data sets to extract…

Recent efforts have been made on acoustic scene classification in the audio signal processing community. In contrast, few studies have been conducted on acoustic scene clustering, which is a newly emerging problem. Acoustic scene clustering…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-12 Yanxiong Li , Mingle Liu , Wucheng Wang , Yuhan Zhang , Qianhua He

In the past, Acoustic Scene Classification systems have been based on hand crafting audio features that are input to a classifier. Nowadays, the common trend is to adopt data driven techniques, e.g., deep learning, where audio…

Sound · Computer Science 2018-06-29 Eduardo Fonseca , Rong Gong , Xavier Serra

In Acoustic Scene Classification (ASC) two major approaches have been followed . While one utilizes engineered features such as mel-frequency-cepstral-coefficients (MFCCs), the other uses learned features that are the outcome of an…

Sound · Computer Science 2017-11-15 Hamid Eghbal-zadeh , Bernhard Lehner , Matthias Dorfer , Gerhard Widmer

Acoustic scene classification (ASC) has been approached in the last years using deep learning techniques such as convolutional neural networks or recurrent neural networks. Many state-of-the-art solutions are based on image classification…

The use of multiple and semantically correlated sources can provide complementary information to each other that may not be evident when working with individual modalities on their own. In this context, multi-modal models can help producing…

In this paper, we present a deep learning framework applied for Acoustic Scene Classification (ASC), the task of classifying scene contexts from environmental input sounds. An ASC system generally comprises of two main steps, referred to as…

Sound · Computer Science 2020-05-27 Dat Ngo , Hao Hoang , Anh Nguyen , Tien Ly , Lam Pham

Acoustic scene classification systems using deep neural networks classify given recordings into pre-defined classes. In this study, we propose a novel scheme for acoustic scene classification which adopts an audio tagging system inspired by…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-21 Jee-weon Jung , Hye-jin Shim , Ju-ho Kim , Seung-bin Kim , Ha-Jin Yu

This article proposes an encoder-decoder network model for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature. We make use of multiple low-level spectrogram features at…

Sound · Computer Science 2020-02-12 Lam Pham , Huy Phan , Truc Nguyen , Ramaswamy Palaniappan , Alfred Mertins , Ian McLoughlin

Acoustic scene classification (ASC) and acoustic event detection (AED) are different but related tasks. Acoustic events can provide useful information for recognizing acoustic scenes. However, most of the datasets are provided without…

Sound · Computer Science 2020-10-27 Ruixiong Zhang , Wei Zou , Xiangang Li

In the acoustic scene classification (ASC) task, an acoustic scene consists of diverse sounds and is inferred by identifying combinations of distinct attributes among them. This study aims to extract and cluster these attributes effectively…

Sound · Computer Science 2022-07-01 Won-Gook Choi , Joon-Hyuk Chang , Jae-Mo Yang , Han-Gil Moon

In this paper, we presents a low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed framework can be separated into three main steps: Front-end spectrogram extraction, back-end classification, and late…

Sound · Computer Science 2021-06-17 Lam Pham , Hieu Tang , Anahid Jalali , Alexander Schindler , Ross King

We present a compact, quantization-ready acoustic scene classification (ASC) framework that couples an efficient student network with a learned teacher ensemble and knowledge distillation. The student backbone uses stacked…

We introduce in this work an efficient approach for audio scene classification using deep recurrent neural networks. An audio scene is firstly transformed into a sequence of high-level label tree embedding feature vectors. The vector…

Sound · Computer Science 2017-06-06 Huy Phan , Philipp Koch , Fabrice Katzberg , Marco Maass , Radoslaw Mazur , Alfred Mertins

This paper presents an alternate representation framework to commonly used time-frequency representation for acoustic scene classification (ASC). A raw audio signal is represented using a pre-trained convolutional neural network (CNN) using…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-04 Arshdeep Singh

Recently, convolutional neural networks (CNN) have achieved the state-of-the-art performance in acoustic scene classification (ASC) task. The audio data is often transformed into two-dimensional spectrogram representations, which are then…

Sound · Computer Science 2020-07-09 Helin Wang , Yuexian Zou , Dading Chong

In this paper, we present deep learning frameworks for audio-visual scene classification (SC) and indicate how individual visual and audio features as well as their combination affect SC performance. Our extensive experiments, which are…

Sound · Computer Science 2021-06-17 Lam Pham , Alexander Schindler , Mina Schütz , Jasmin Lampert , Sven Schlarb , Ross King

The goal of the acoustic scene classification (ASC) task is to classify recordings into one of the predefined acoustic scene classes. However, in real-world scenarios, ASC systems often encounter challenges such as recording device…

Acoustic scene classification (ASC) is a problem related to the field of machine listening whose objective is to classify/tag an audio clip in a predefined label describing a scene location (e. g. park, airport, etc.). Many state-of-the-art…

Sound · Computer Science 2020-06-29 Javier Naranjo-Alcazar , Sergi Perez-Castanos , Pedro Zuccarello , Maximo Cobos

Audio-Visual Segmentation (AVS) aims to achieve pixel-level localization of sound sources in videos, while Audio-Visual Semantic Segmentation (AVSS), as an extension of AVS, further pursues semantic understanding of audio-visual scenes.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Juncheng Ma , Peiwen Sun , Yaoting Wang , Di Hu
‹ Prev 1 2 3 10 Next ›