Related papers: SubSpectral Normalization for Neural Audio Data Pr…

SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification

Acoustic Scene Classification (ASC) is one of the core research problems in the field of Computational Sound Scene Analysis. In this work, we present SubSpectralNet, a novel model which captures discriminative features by incorporating…

Sound · Computer Science 2019-02-26 Sai Samarth R Phaye , Emmanouil Benetos , Ye Wang

Audio Super Resolution using Neural Networks

We introduce a new audio processing technique that increases the sampling rate of signals such as speech or music using deep convolutional neural networks. Our model is trained on pairs of low and high-quality audio examples; at test-time,…

Sound · Computer Science 2017-08-03 Volodymyr Kuleshov , S. Zayd Enam , Stefano Ermon

Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks

Modern day audio signal classification techniques lack the ability to classify low feature audio signals in the form of spectrographic temporal frequency data representations. Additionally, currently utilized techniques rely on full diverse…

Sound · Computer Science 2024-10-30 Noel Elias

SpectNet : End-to-End Audio Signal Classification Using Learnable Spectrograms

Pattern recognition from audio signals is an active research topic encompassing audio tagging, acoustic scene classification, music classification, and other areas. Spectrogram and mel-frequency cepstral coefficients (MFCC) are among the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-18 Md. Istiaq Ansari , Taufiq Hasan

Spectral Wavelet Dropout: Regularization in the Wavelet Domain

Regularization techniques help prevent overfitting and therefore improve the ability of convolutional neural networks (CNNs) to generalize. One reason for overfitting is the complex co-adaptations among different parts of the network, which…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Rinor Cakaj , Jens Mehnert , Bin Yang

A Structurally Regularized Convolutional Neural Network for Image Classification using Wavelet-based SubBand Decomposition

We propose a convolutional neural network (CNN) architecture for image classification based on subband decomposition of the image using wavelets. The proposed architecture decomposes the input image spectra into multiple critically sampled…

Computer Vision and Pattern Recognition · Computer Science 2021-03-03 Pavel Sinha , Ioannis Psaromiligkos , Zeljko Zilic

Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks

Convolutional neural networks (CNNs) are widely used in computer vision. They can be used not only for conventional digital image material to recognize patterns, but also for feature extraction from digital imagery representing spectral and…

Sound · Computer Science 2025-09-16 Friedrich Wolf-Monheim

Segmenting Hyperspectral Images Using Spectral-Spatial Convolutional Neural Networks With Training-Time Data Augmentation

Hyperspectral imaging provides detailed information about the scanned objects, as it captures their spectral characteristics within a large number of wavelength bands. Classification of such data has become an active research topic due to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Jakub Nalepa , Lukasz Tulczyjew , Michal Myller , Michal Kawulok

Multi-Subspace Neural Network for Image Recognition

In image classification task, feature extraction is always a big issue. Intra-class variability increases the difficulty in designing the extractors. Furthermore, hand-crafted feature extractor cannot simply adapt new situation. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Chieh-Ning Fang , Chin-Teng Lin

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

An increasing number of models require the control of the spectral norm of convolutional layers of a neural network. While there is an abundance of methods for estimating and enforcing upper bounds on those during training, they are…

Machine Learning · Computer Science 2021-02-15 Christina Runkel , Christian Etmann , Michael Möller , Carola-Bibiane Schönlieb

Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification

While using two-dimensional convolutional neural networks (2D-CNNs) in image processing, it is possible to manipulate domain information using channel statistics, and instance normalization has been a promising way to get domain-invariant…

Sound · Computer Science 2022-06-28 Byeonggeun Kim , Seunghan Yang , Jangho Kim , Hyunsin Park , Juntae Lee , Simyung Chang

Learning in the Frequency Domain

Deep neural networks have achieved remarkable success in computer vision tasks. Existing neural networks mainly operate in the spatial domain with fixed input sizes. For practical applications, images are usually large and have to be…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Kai Xu , Minghai Qin , Fei Sun , Yuhao Wang , Yen-Kuang Chen , Fengbo Ren

Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification

This paper proposes a Sub-band Convolutional Neural Network for spoken term classification. Convolutional neural networks (CNNs) have proven to be very effective in acoustic applications such as spoken term classification, keyword spotting,…

Audio and Speech Processing · Electrical Eng. & Systems 2019-07-03 Chieh-Chi Kao , Ming Sun , Yixin Gao , Shiv Vitaladevuni , Chao Wang

Sound Event Detection with Depthwise Separable and Dilated Convolutions

State-of-the-art sound event detection (SED) methods usually employ a series of convolutional neural networks (CNNs) to extract useful features from the input audio signal, and then recurrent neural networks (RNNs) to model longer temporal…

Sound · Computer Science 2020-02-04 Konstantinos Drossos , Stylianos I. Mimilakis , Shayan Gharib , Yanxiong Li , Tuomas Virtanen

Spectral Batch Normalization: Normalization in the Frequency Domain

Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce spectral batch normalization (SBN), a novel effective method to improve generalization by…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Rinor Cakaj , Jens Mehnert , Bin Yang

Learning Convolutional Neural Networks in the Frequency Domain

Convolutional neural network (CNN) has achieved impressive success in computer vision during the past few decades. The image convolution operation helps CNNs to get good performance on image-related tasks. However, the image convolution has…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Hengyue Pan , Yixin Chen , Xin Niu , Wenbo Zhou , Dongsheng Li

Learning to synthesize: splitting and recombining low and high spatial frequencies for image recovery

Deep Neural Network (DNN)-based image reconstruction, despite many successes, often exhibits uneven fidelity between high and low spatial frequency bands. In this paper we propose the Learning Synthesis by DNN (LS-DNN) approach where two…

Computer Vision and Pattern Recognition · Computer Science 2018-11-21 Mo Deng , Shuai Li , George Barbastathis

Spectral Response Function Guided Deep Optimization-driven Network for Spectral Super-resolution

Hyperspectral images are crucial for many research works. Spectral super-resolution (SSR) is a method used to obtain high spatial resolution (HR) hyperspectral images from HR multispectral images. Traditional SSR methods include…

Image and Video Processing · Electrical Eng. & Systems 2020-12-09 Jiang He , Jie Li , Qiangqiang Yuan , Huanfeng Shen , Liangpei Zhang

Switchable Normalization for Learning-to-Normalize Deep Representation

We address a learning-to-normalize problem by proposing Switchable Normalization (SN), which learns to select different normalizers for different normalization layers of a deep neural network. SN employs three distinct scopes to compute…

Computer Vision and Pattern Recognition · Computer Science 2019-07-25 Ping Luo , Ruimao Zhang , Jiamin Ren , Zhanglin Peng , Jingyu Li

Spectral-Spatial Global Graph Reasoning for Hyperspectral Image Classification

Convolutional neural networks have been widely applied to hyperspectral image classification. However, traditional convolutions can not effectively extract features for objects with irregular distributions. Recent methods attempt to address…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Di Wang , Bo Du , Liangpei Zhang