Related papers: An Audio-Based Deep Learning Framework For BBC Tel…

Convolutional Neural Network Achieves Human-level Accuracy in Music Genre Classification

Music genre classification is one example of content-based analysis of music signals. Traditionally, human-engineered features were used to automatize this task and 61% accuracy has been achieved in the 10-genre classification. However,…

Sound · Computer Science 2024-10-16 Mingwen Dong

An Ensemble of Deep Learning Frameworks Applied For Predicting Respiratory Anomalies

In this paper, we evaluate various deep learning frameworks for detecting respiratory anomalies from input audio recordings. To this end, we firstly transform audio respiratory cycles collected from patients into spectrograms where both…

Sound · Computer Science 2022-01-11 Lam Pham , Dat Ngo , Truong Hoang , Alexander Schindler , Ian McLoughlin

SpectNet : End-to-End Audio Signal Classification Using Learnable Spectrograms

Pattern recognition from audio signals is an active research topic encompassing audio tagging, acoustic scene classification, music classification, and other areas. Spectrogram and mel-frequency cepstral coefficients (MFCC) are among the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-18 Md. Istiaq Ansari , Taufiq Hasan

Deep Learning Framework Applied for Predicting Anomaly of Respiratory Sounds

This paper proposes a robust deep learning framework used for classifying anomaly of respiratory cycles. Initially, our framework starts with front-end feature extraction step. This step aims to transform the respiratory input sound into a…

Machine Learning · Computer Science 2020-12-29 Dat Ngo , Lam Pham , Anh Nguyen , Ben Phan , Khoa Tran , Truong Nguyen

CNN-MoE based framework for classification of respiratory anomalies and lung disease detection

This paper presents and explores a robust deep learning framework for auscultation analysis. This aims to classify anomalies in respiratory cycles and detect disease, from respiratory sound recordings. The framework begins with front-end…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-04 Lam Pham , Huy Phan , Ramaswamy Palaniappan , Alfred Mertins , Ian McLoughlin

Robust Deep Learning Framework For Predicting Respiratory Anomalies and Diseases

This paper presents a robust deep learning framework developed to detect respiratory diseases from recordings of respiratory sounds. The complete detection process firstly involves front end feature extraction where recordings are…

Sound · Computer Science 2020-02-11 Lam Pham , Ian McLoughlin , Huy Phan , Minh Tran , Truc Nguyen , Ramaswamy Palaniappan

Using Deep learning methods for generation of a personalized list of shuffled songs

The shuffle mode, where songs are played in a randomized order that is decided upon for all tracks at once, is widely found and known to exist in music player systems. There are only few music enthusiasts who use this mode since it either…

Information Retrieval · Computer Science 2019-09-09 Rushin Gindra , Srushti Kotak , Asmita Natekar , Grishma Sharma

Background-tracking Acoustic Features for Genre Identification of Broadcast Shows

This paper presents a novel method for extracting acoustic features that characterise the background environment in audio recordings. These features are based on the output of an alignment that fits multiple parallel background--based…

Sound · Computer Science 2016-11-17 Oscar Saz , Mortaza Doulaty , Thomas Hain

Automatic Genre and Show Identification of Broadcast Media

Huge amounts of digital videos are being produced and broadcast every day, leading to giant media archives. Effective techniques are needed to make such data accessible further. Automatic meta-data labelling of broadcast media is an…

Multimedia · Computer Science 2016-06-13 Mortaza Doulaty , Oscar Saz , Raymond W. M. Ng , Thomas Hain

Deep clustering: Discriminative embeddings for segmentation and separation

We address the problem of acoustic source separation in a deep learning framework we call "deep clustering." Rather than directly estimating signals or masking functions, we train a deep network to produce spectrogram embeddings that are…

Neural and Evolutionary Computing · Computer Science 2015-08-19 John R. Hershey , Zhuo Chen , Jonathan Le Roux , Shinji Watanabe

Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio Broadcast

Segmenting audio into homogeneous sections such as music and speech helps us understand the content of audio. It is useful as a pre-processing step to index, store, and modify audio recordings, radio broadcasts and TV programmes. Deep…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-22 Satvik Venkatesh , David Moffat , Alexis Kirke , Gözel Shakeri , Stephen Brewster , Jörg Fachner , Helen Odell-Miller , Alex Street , Nicolas Farina , Sube Banerjee , Eduardo Reck Miranda

Explaining Deep Convolutional Neural Networks on Music Classification

Deep convolutional neural networks (CNNs) have been actively adopted in the field of music information retrieval, e.g. genre classification, mood detection, and chord recognition. However, the process of learning and prediction is little…

Machine Learning · Computer Science 2016-07-11 Keunwoo Choi , George Fazekas , Mark Sandler

Musical instrument sound classification with deep convolutional neural network using feature fusion approach

A new musical instrument classification method using convolutional neural networks (CNNs) is presented in this paper. Unlike the traditional methods, we investigated a scheme for classifying musical instruments using the learned features…

Sound · Computer Science 2015-12-24 Taejin Park , Taejin Lee

Spectrum Sensing Based on Deep Learning Classification for Cognitive Radios

Spectrum sensing is a key technology for cognitive radios. We present spectrum sensing as a classification problem and propose a sensing method based on deep learning classification. We normalize the received signal power to overcome the…

Signal Processing · Electrical Eng. & Systems 2019-09-16 Shilian Zheng , Shichuan Chen , Peihan Qi , Huaji Zhou , Xiaoniu Yang

Deepfake Audio Detection Using Spectrogram-based Feature and Ensemble of Deep Learning Models

In this paper, we propose a deep learning based system for the task of deepfake audio detection. In particular, the draw input audio is first transformed into various spectrograms using three transformation methods of Short-time Fourier…

Sound · Computer Science 2024-07-03 Lam Pham , Phat Lam , Truong Nguyen , Huyen Nguyen , Alexander Schindler

Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data

The development of audio event recognition systems require labeled training data, which are generally hard to obtain. One promising source of recordings of audio events is the large amount of multimedia data on the web. In particular, if…

Sound · Computer Science 2022-10-04 Anurag Kumar , Bhiksha Raj

Music Genre Classification: Training an AI model

Music genre classification is an area that utilizes machine learning models and techniques for the processing of audio signals, in which applications range from content recommendation systems to music recommendation systems. In this…

Sound · Computer Science 2024-05-27 Keoikantse Mogonediwa

Deep Learning Frameworks Applied For Audio-Visual Scene Classification

In this paper, we present deep learning frameworks for audio-visual scene classification (SC) and indicate how individual visual and audio features as well as their combination affect SC performance. Our extensive experiments, which are…

Sound · Computer Science 2021-06-17 Lam Pham , Alexander Schindler , Mina Schütz , Jasmin Lampert , Sven Schlarb , Ross King

Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks

Modern day audio signal classification techniques lack the ability to classify low feature audio signals in the form of spectrographic temporal frequency data representations. Additionally, currently utilized techniques rely on full diverse…

Sound · Computer Science 2024-10-30 Noel Elias

Bottom-up Broadcast Neural Network For Music Genre Classification

Music genre recognition based on visual representation has been successfully explored over the last years. Recently, there has been increasing interest in attempting convolutional neural networks (CNNs) to achieve the task. However, most of…

Sound · Computer Science 2019-01-28 Caifeng Liu , Lin Feng , Guochao Liu , Huibing Wang , Shenglan Liu