Related papers: Audio-Visual Scene Classification Using A Transfer…

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification

In this paper, we propose two techniques, namely joint modeling and data augmentation, to improve system performances for audio-visual scene classification (AVSC). We employ pre-trained networks trained only on image data sets to extract…

Multimedia · Computer Science 2022-09-02 Qing Wang , Jun Du , Siyuan Zheng , Yunqing Li , Yajian Wang , Yuzhong Wu , Hu Hu , Chao-Han Huck Yang , Sabato Marco Siniscalchi , Yannan Wang , Chin-Hui Lee

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Recent efforts have been made on acoustic scene classification in the audio signal processing community. In contrast, few studies have been conducted on acoustic scene clustering, which is a newly emerging problem. Acoustic scene clustering…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-12 Yanxiong Li , Mingle Liu , Wucheng Wang , Yuhan Zhang , Qianhua He

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification

In the past, Acoustic Scene Classification systems have been based on hand crafting audio features that are input to a classifier. Nowadays, the common trend is to adopt data driven techniques, e.g., deep learning, where audio…

Sound · Computer Science 2018-06-29 Eduardo Fonseca , Rong Gong , Xavier Serra

A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification

In Acoustic Scene Classification (ASC) two major approaches have been followed . While one utilizes engineered features such as mel-frequency-cepstral-coefficients (MFCCs), the other uses learned features that are the outcome of an…

Sound · Computer Science 2017-11-15 Hamid Eghbal-zadeh , Bernhard Lehner , Matthias Dorfer , Gerhard Widmer

CNN depth analysis with different channel inputs for Acoustic Scene Classification

Acoustic scene classification (ASC) has been approached in the last years using deep learning techniques such as convolutional neural networks or recurrent neural networks. Many state-of-the-art solutions are based on image classification…

Sound · Computer Science 2021-08-16 Sergi Perez-Castanos , Javier Naranjo-Alcazar , Pedro Zuccarello , Maximo Cobos , Frances J. Ferri

Squeeze-Excitation Convolutional Recurrent Neural Networks for Audio-Visual Scene Classification

The use of multiple and semantically correlated sources can provide complementary information to each other that may not be evident when working with individual modalities on their own. In this context, multi-modal models can help producing…

Multimedia · Computer Science 2021-07-29 Javier Naranjo-Alcazar , Sergi Perez-Castanos , Aaron Lopez-Garcia , Pedro Zuccarello , Maximo Cobos , Francesc J. Ferri

Sound Context Classification Basing on Join Learning Model and Multi-Spectrogram Features

In this paper, we present a deep learning framework applied for Acoustic Scene Classification (ASC), the task of classifying scene contexts from environmental input sounds. An ASC system generally comprises of two main steps, referred to as…

Sound · Computer Science 2020-05-27 Dat Ngo , Hao Hoang , Anh Nguyen , Tien Ly , Lam Pham

Acoustic Scene Classification using Audio Tagging

Acoustic scene classification systems using deep neural networks classify given recordings into pre-defined classes. In this study, we propose a novel scheme for acoustic scene classification which adopts an audio tagging system inspired by…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-21 Jee-weon Jung , Hye-jin Shim , Ju-ho Kim , Seung-bin Kim , Ha-Jin Yu

Robust Acoustic Scene Classification using a Multi-Spectrogram Encoder-Decoder Framework

This article proposes an encoder-decoder network model for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording from its acoustic signature. We make use of multiple low-level spectrogram features at…

Sound · Computer Science 2020-02-12 Lam Pham , Huy Phan , Truc Nguyen , Ramaswamy Palaniappan , Alfred Mertins , Ian McLoughlin

Cross-task pre-training for on-device acoustic scene classification

Acoustic scene classification (ASC) and acoustic event detection (AED) are different but related tasks. Acoustic events can provide useful information for recognizing acoustic scenes. However, most of the datasets are provided without…

Sound · Computer Science 2020-10-27 Ruixiong Zhang , Wei Zou , Xiangang Li

Instance-level loss based multiple-instance learning framework for acoustic scene classification

In the acoustic scene classification (ASC) task, an acoustic scene consists of diverse sounds and is inferred by identifying combinations of distinct attributes among them. This study aims to extract and cluster these attributes effectively…

Sound · Computer Science 2022-07-01 Won-Gook Choi , Joon-Hyuk Chang , Jae-Mo Yang , Han-Gil Moon

A Low-Compexity Deep Learning Framework For Acoustic Scene Classification

In this paper, we presents a low-complexity deep learning frameworks for acoustic scene classification (ASC). The proposed framework can be separated into three main steps: Front-end spectrogram extraction, back-end classification, and late…

Sound · Computer Science 2021-06-17 Lam Pham , Hieu Tang , Anahid Jalali , Alexander Schindler , Ross King

Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices

We present a compact, quantization-ready acoustic scene classification (ASC) framework that couples an efficient student network with a learned teacher ensemble and knowledge distillation. The student backbone uses stacked…

Sound · Computer Science 2025-12-17 Hossein Sharify , Behnam Raoufi , Mahdy Ramezani , Khosrow Hajsadeghi , Saeed Bagheri Shouraki

Audio Scene Classification with Deep Recurrent Neural Networks

We introduce in this work an efficient approach for audio scene classification using deep recurrent neural networks. An audio scene is firstly transformed into a sequence of high-level label tree embedding feature vectors. The vector…

Sound · Computer Science 2017-06-06 Huy Phan , Philipp Koch , Fabrice Katzberg , Marco Maass , Radoslaw Mazur , Alfred Mertins

1-D CNN based Acoustic Scene Classification via Reducing Layer-wise Dimensionality

This paper presents an alternate representation framework to commonly used time-frequency representation for acoustic scene classification (ASC). A raw audio signal is represented using a pre-trained convolutional neural network (CNN) using…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-04 Arshdeep Singh

Acoustic Scene Classification with Spectrogram Processing Strategies

Recently, convolutional neural networks (CNN) have achieved the state-of-the-art performance in acoustic scene classification (ASC) task. The audio data is often transformed into two-dimensional spectrogram representations, which are then…

Sound · Computer Science 2020-07-09 Helin Wang , Yuexian Zou , Dading Chong

Deep Learning Frameworks Applied For Audio-Visual Scene Classification

In this paper, we present deep learning frameworks for audio-visual scene classification (SC) and indicate how individual visual and audio features as well as their combination affect SC performance. Our extensive experiments, which are…

Sound · Computer Science 2021-06-17 Lam Pham , Alexander Schindler , Mina Schütz , Jasmin Lampert , Sven Schlarb , Ross King

Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning

The goal of the acoustic scene classification (ASC) task is to classify recordings into one of the predefined acoustic scene classes. However, in real-world scenarios, ASC systems often encounter challenges such as recording device…

Sound · Computer Science 2025-05-08 Bing Han , Wen Huang , Zhengyang Chen , Anbai Jiang , Pingyi Fan , Cheng Lu , Zhiqiang Lv , Jia Liu , Wei-Qiang Zhang , Yanmin Qian

Acoustic Scene Classification with Squeeze-Excitation Residual Networks

Acoustic scene classification (ASC) is a problem related to the field of machine listening whose objective is to classify/tag an audio clip in a predefined label describing a scene location (e. g. park, airport, etc.). Many state-of-the-art…

Sound · Computer Science 2020-06-29 Javier Naranjo-Alcazar , Sergi Perez-Castanos , Pedro Zuccarello , Maximo Cobos

Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

Audio-Visual Segmentation (AVS) aims to achieve pixel-level localization of sound sources in videos, while Audio-Visual Semantic Segmentation (AVSS), as an extension of AVS, further pursues semantic understanding of audio-visual scenes.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Juncheng Ma , Peiwen Sun , Yaoting Wang , Di Hu