Related papers: Audio Classification from Time-Frequency Texture

Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks

Modern day audio signal classification techniques lack the ability to classify low feature audio signals in the form of spectrographic temporal frequency data representations. Additionally, currently utilized techniques rely on full diverse…

Sound · Computer Science 2024-10-30 Noel Elias

Audio Texture Synthesis with Scattering Moments

We introduce an audio texture synthesis algorithm based on scattering moments. A scattering transform is computed by iteratively decomposing a signal with complex wavelet filter banks and computing their amplitude envelop. Scattering…

Applications · Statistics 2013-11-05 Joan Bruna , Stéphane Mallat

Time-Frequency Audio Features for Speech-Music Classification

Distinct striation patterns are observed in the spectrograms of speech and music. This motivated us to propose three novel time-frequency features for speech-music classification. These features are extracted in two stages. First, a preset…

Audio and Speech Processing · Electrical Eng. & Systems 2018-11-06 Mrinmoy Bhattacharjee , S. R. M. Prasanna , Prithwijit Guha

Sound texture synthesis using RI spectrograms

This article introduces a new parametric synthesis method for sound textures based on existing works in visual and sound texture synthesis. Starting from a base sound signal, an optimization process is performed until the cross-correlations…

Sound · Computer Science 2019-10-22 Hugo Caracalla , Axel Roebel

Learning Temporal Resolution in Spectrogram for Audio Classification

The audio spectrogram is a time-frequency representation that has been widely used for audio classification. One of the key attributes of the audio spectrogram is the temporal resolution, which depends on the hop size used in the Short-Time…

Sound · Computer Science 2024-01-15 Haohe Liu , Xubo Liu , Qiuqiang Kong , Wenwu Wang , Mark D. Plumbley

A Lightweight Music Texture Transfer System

Deep learning researches on the transformation problems for image and text have raised great attention. However, present methods for music feature transfer using neural networks are far from practical application. In this paper, we initiate…

Sound · Computer Science 2021-08-05 Xutan Peng , Chen Li , Zhi Cai , Faqiang Shi , Yidan Liu , Jianxin Li

MTCRNN: A multi-scale RNN for directed audio texture synthesis

Audio textures are a subset of environmental sounds, often defined as having stable statistical characteristics within an adequately large window of time but may be unstructured locally. They include common everyday sounds such as from…

Sound · Computer Science 2020-11-26 M. Huzaifah , L. Wyse

Learning Visual Styles from Audio-Visual Associations

From the patter of rain to the crunch of snow, the sounds we hear often convey the visual textures that appear within a scene. In this paper, we present a method for learning visual styles from unlabeled audio-visual data. Our model learns…

Computer Vision and Pattern Recognition · Computer Science 2022-05-11 Tingle Li , Yichen Liu , Andrew Owens , Hang Zhao

What You Hear Is What You See: Audio Quality Metrics From Image Quality Metrics

In this study, we investigate the feasibility of utilizing state-of-the-art image perceptual metrics for evaluating audio signals by representing them as spectrograms. The encouraging outcome of the proposed approach is based on the…

Sound · Computer Science 2023-08-31 Tashi Namgyal , Alexander Hepburn , Raul Santos-Rodriguez , Valero Laparra , Jesus Malo

Audio Texture Manipulation by Exemplar-Based Analogy

Audio texture manipulation involves modifying the perceptual characteristics of a sound to achieve specific transformations, such as adding, removing, or replacing auditory elements. In this paper, we propose an exemplar-based analogy model…

Sound · Computer Science 2025-01-22 Kan Jen Cheng , Tingle Li , Gopala Anumanchipalli

Applying Visual Domain Style Transfer and Texture Synthesis Techniques to Audio - Insights and Challenges

Style transfer is a technique for combining two images based on the activations and feature statistics in a deep learning neural network architecture. This paper studies the analogous task in the audio domain and takes a critical look at…

Sound · Computer Science 2020-08-10 M. Huzaifah , L. Wyse

A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification

One of the biggest challenges of acoustic scene classification (ASC) is to find proper features to better represent and characterize environmental sounds. Environmental sounds generally involve more sound sources while exhibiting less…

Sound · Computer Science 2019-04-11 Hongwei Song , Jiqing Han , Shiwen Deng

On Time-frequency Scattering and Computer Music

Time-frequency scattering is a mathematical transformation of sound waves. Its core purpose is to mimick the way the human auditory system extracts information from its environment. In the context of improving the artificial intelligence of…

Sound · Computer Science 2019-05-22 Vincent Lostanlen

Audio style transfer

'Style transfer' among images has recently emerged as a very active research topic, fuelled by the power of convolution neural networks (CNNs), and has become fast a very popular technology in social media. This paper investigates the…

Sound · Computer Science 2019-04-29 Eric Grinstein , Ngoc Duong , Alexey Ozerov , Patrick Pérez

A Novel Approach to Texture classification using statistical feature

Texture is an important spatial feature which plays a vital role in content based image retrieval. The enormous growth of the internet and the wide use of digital data have increased the need for both efficient image database creation and…

Computer Vision and Pattern Recognition · Computer Science 2011-11-11 B. Vijayalakshmi , V. Subbiah Bharathi

Sound texture synthesis using convolutional neural networks

The following article introduces a new parametric synthesis algorithm for sound textures inspired by existing methods used for visual textures. Using a 2D Convolutional Neural Network (CNN), a sound signal is modified until the temporal…

Sound · Computer Science 2019-05-10 Hugo Caracalla , Axel Roebel

Histogram of gradients of Time-Frequency Representations for Audio scene detection

This paper addresses the problem of audio scenes classification and contributes to the state of the art by proposing a novel feature. We build this feature by considering histogram of gradients (HOG) of time-frequency representation of an…

Sound · Computer Science 2015-08-21 Alain Rakotomamonjy , Gilles Gasso

Texture Selection for Automatic Music Genre Classification

Music Genre Classification is the problem of associating genre-related labels to digitized music tracks. It has applications in the organization of commercial and personal music collections. Often, music tracks are described as a set of…

Sound · Computer Science 2020-03-12 Juliano H. Foleiss , Tiago F. Tavares

Joint Time-Frequency Scattering for Audio Classification

We introduce the joint time-frequency scattering transform, a time shift invariant descriptor of time-frequency structure for audio classification. It is obtained by applying a two-dimensional wavelet transform in time and log-frequency to…

Sound · Computer Science 2018-08-06 Joakim Andén , Vincent Lostanlen , Stéphane Mallat

Automatic Instrument Recognition in Polyphonic Music Using Convolutional Neural Networks

Traditional methods to tackle many music information retrieval tasks typically follow a two-step architecture: feature engineering followed by a simple learning algorithm. In these "shallow" architectures, feature engineering and learning…

Sound · Computer Science 2015-11-18 Peter Li , Jiyuan Qian , Tian Wang