English
Related papers

Related papers: SubSpectral Normalization for Neural Audio Data Pr…

200 papers

Acoustic Scene Classification (ASC) is one of the core research problems in the field of Computational Sound Scene Analysis. In this work, we present SubSpectralNet, a novel model which captures discriminative features by incorporating…

Sound · Computer Science 2019-02-26 Sai Samarth R Phaye , Emmanouil Benetos , Ye Wang

We introduce a new audio processing technique that increases the sampling rate of signals such as speech or music using deep convolutional neural networks. Our model is trained on pairs of low and high-quality audio examples; at test-time,…

Sound · Computer Science 2017-08-03 Volodymyr Kuleshov , S. Zayd Enam , Stefano Ermon

Modern day audio signal classification techniques lack the ability to classify low feature audio signals in the form of spectrographic temporal frequency data representations. Additionally, currently utilized techniques rely on full diverse…

Sound · Computer Science 2024-10-30 Noel Elias

Pattern recognition from audio signals is an active research topic encompassing audio tagging, acoustic scene classification, music classification, and other areas. Spectrogram and mel-frequency cepstral coefficients (MFCC) are among the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-18 Md. Istiaq Ansari , Taufiq Hasan

Regularization techniques help prevent overfitting and therefore improve the ability of convolutional neural networks (CNNs) to generalize. One reason for overfitting is the complex co-adaptations among different parts of the network, which…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Rinor Cakaj , Jens Mehnert , Bin Yang

We propose a convolutional neural network (CNN) architecture for image classification based on subband decomposition of the image using wavelets. The proposed architecture decomposes the input image spectra into multiple critically sampled…

Computer Vision and Pattern Recognition · Computer Science 2021-03-03 Pavel Sinha , Ioannis Psaromiligkos , Zeljko Zilic

Convolutional neural networks (CNNs) are widely used in computer vision. They can be used not only for conventional digital image material to recognize patterns, but also for feature extraction from digital imagery representing spectral and…

Sound · Computer Science 2025-09-16 Friedrich Wolf-Monheim

Hyperspectral imaging provides detailed information about the scanned objects, as it captures their spectral characteristics within a large number of wavelength bands. Classification of such data has become an active research topic due to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Jakub Nalepa , Lukasz Tulczyjew , Michal Myller , Michal Kawulok

In image classification task, feature extraction is always a big issue. Intra-class variability increases the difficulty in designing the extractors. Furthermore, hand-crafted feature extractor cannot simply adapt new situation. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Chieh-Ning Fang , Chin-Teng Lin

An increasing number of models require the control of the spectral norm of convolutional layers of a neural network. While there is an abundance of methods for estimating and enforcing upper bounds on those during training, they are…

Machine Learning · Computer Science 2021-02-15 Christina Runkel , Christian Etmann , Michael Möller , Carola-Bibiane Schönlieb

While using two-dimensional convolutional neural networks (2D-CNNs) in image processing, it is possible to manipulate domain information using channel statistics, and instance normalization has been a promising way to get domain-invariant…

Sound · Computer Science 2022-06-28 Byeonggeun Kim , Seunghan Yang , Jangho Kim , Hyunsin Park , Juntae Lee , Simyung Chang

Deep neural networks have achieved remarkable success in computer vision tasks. Existing neural networks mainly operate in the spatial domain with fixed input sizes. For practical applications, images are usually large and have to be…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Kai Xu , Minghai Qin , Fei Sun , Yuhao Wang , Yen-Kuang Chen , Fengbo Ren

This paper proposes a Sub-band Convolutional Neural Network for spoken term classification. Convolutional neural networks (CNNs) have proven to be very effective in acoustic applications such as spoken term classification, keyword spotting,…

Audio and Speech Processing · Electrical Eng. & Systems 2019-07-03 Chieh-Chi Kao , Ming Sun , Yixin Gao , Shiv Vitaladevuni , Chao Wang

State-of-the-art sound event detection (SED) methods usually employ a series of convolutional neural networks (CNNs) to extract useful features from the input audio signal, and then recurrent neural networks (RNNs) to model longer temporal…

Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce spectral batch normalization (SBN), a novel effective method to improve generalization by…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Rinor Cakaj , Jens Mehnert , Bin Yang

Convolutional neural network (CNN) has achieved impressive success in computer vision during the past few decades. The image convolution operation helps CNNs to get good performance on image-related tasks. However, the image convolution has…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Hengyue Pan , Yixin Chen , Xin Niu , Wenbo Zhou , Dongsheng Li

Deep Neural Network (DNN)-based image reconstruction, despite many successes, often exhibits uneven fidelity between high and low spatial frequency bands. In this paper we propose the Learning Synthesis by DNN (LS-DNN) approach where two…

Computer Vision and Pattern Recognition · Computer Science 2018-11-21 Mo Deng , Shuai Li , George Barbastathis

Hyperspectral images are crucial for many research works. Spectral super-resolution (SSR) is a method used to obtain high spatial resolution (HR) hyperspectral images from HR multispectral images. Traditional SSR methods include…

Image and Video Processing · Electrical Eng. & Systems 2020-12-09 Jiang He , Jie Li , Qiangqiang Yuan , Huanfeng Shen , Liangpei Zhang

We address a learning-to-normalize problem by proposing Switchable Normalization (SN), which learns to select different normalizers for different normalization layers of a deep neural network. SN employs three distinct scopes to compute…

Computer Vision and Pattern Recognition · Computer Science 2019-07-25 Ping Luo , Ruimao Zhang , Jiamin Ren , Zhanglin Peng , Jingyu Li

Convolutional neural networks have been widely applied to hyperspectral image classification. However, traditional convolutions can not effectively extract features for objects with irregular distributions. Recent methods attempt to address…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Di Wang , Bo Du , Liangpei Zhang
‹ Prev 1 2 3 10 Next ›