English
Related papers

Related papers: A Practical Guide to Spectrogram Analysis for Audi…

200 papers

Spectrograms visualize the frequency components of a given signal which may be an audio signal or even a time-series signal. Audio signals have higher sampling rate and high variability of frequency with time. Spectrograms can capture such…

Signal Processing · Electrical Eng. & Systems 2021-09-06 Sidharth Srivatsav Sribhashyam , Md Sirajus Salekin , Dmitry Goldgof , Ghada Zamzmi , Mark Last , Yu Sun

Audio fingerprinting is a technique used to identify and match audio recordings based on their unique characteristics. It involves creating a condensed representation of an audio signal that can be used to quickly compare and match against…

Sound · Computer Science 2023-05-03 Aarón López-García

Spectrogram-based representations have grown to dominate the feature space for deep learning audio analysis systems, and are often adopted for speech analysis also. Initially, the primary motivator for spectrogram-based representations was…

Audio and Speech Processing · Electrical Eng. & Systems 2026-03-17 Ian McLoughlin , Lam Pham , Yan Song , Xiaoxiao Miao , Huy Phan , Pengfei Cai , Qing Gu , Jiang Nan , Haoyu Song , Donny Soh

The spectrogram is a classical DSP tool used to view signals in both time and frequency. Unfortunately, the Heisenberg Uncertainty Principal limits our ability to use them for detecting and measuring narrowband signal modulation in wideband…

Information Theory · Computer Science 2014-01-22 Ray Maleh , Frank A. Boyle

Recent advancements in deep learning have significantly impacted the field of speech signal processing, particularly in the analysis and manipulation of complex spectrograms. This survey provides a comprehensive overview of the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-10-06 Yuying Xie , Zheng-Hua Tan

A finite-energy signal is represented by a square-integrable, complex-valued function $t\mapsto s(t)$ of a real variable $t$, interpreted as time. Similarly, a noisy signal is represented by a random process. Time-frequency analysis, a…

Signal Processing · Electrical Eng. & Systems 2025-04-16 Barbara Pascal , Rémi Bardenet

One of the decisions that arise when designing a neural network for any application is how the data should be represented in order to be presented to, and possibly generated by, a neural network. For audio, the choice is less obvious than…

Sound · Computer Science 2017-06-30 L. Wyse

Adsorption processes play a fundamental role in molecular transport through nanofluidic systems, but their signatures in measured signals are often hard to distinguish from other processes like diffusion. In this paper, we derive an…

Soft Condensed Matter · Physics 2025-11-07 Anna Drummond Young , Alice L. Thorneywork , Sophie Marbach

While log-amplitude mel-spectrogram has widely been used as the feature representation for processing speech based on deep learning, the effectiveness of another aspect of speech spectrum, i.e., phase information, was shown recently for…

Sound · Computer Science 2022-05-02 Shunsuke Hidaka , Kohei Wakamiya , Tokihiko Kaburagi

A number of signal processing and statistical methods can be used in analyzing either pieces of text or DNA sequences. These techniques can be used in a number of ways, such as determining authorship of documents, finding genes in DNA, and…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Matthew J. Berryman , Andrew Allison , Pedro Carpena , Derek Abbott

This paper studies a spectrum estimation method for the case that the samples are obtained at a rate lower than the Nyquist rate. The method is referred to as the correlogram for undersampled data. The algorithm partitions the spectrum into…

Information Theory · Computer Science 2018-02-07 Mahdi Shaghaghi , Sergiy A. Vorobyov

Spectroscopy has played the key role in revealing, and thereby understanding, the structure of atoms and molecules. A central drive in this field is the pursuit of higher precision and accuracy so that ever more subtle effects might be…

The spatial information of sound plays a crucial role in various situations, ranging from daily activities to advanced engineering technologies. To fully utilize its potential, numerous research studies on spatial audio signal processing…

Audio and Speech Processing · Electrical Eng. & Systems 2025-03-14 Natsuki Ueno , Shoichi Koyama

Acoustic recognition has emerged as a prominent task in deep learning research, frequently utilizing spectral feature extraction techniques such as the spectrogram from the Short-Time Fourier Transform and the scalogram from the Wavelet…

Audio and Speech Processing · Electrical Eng. & Systems 2025-12-01 Dang Thoai Phan

We propose a method using a long short-term memory (LSTM) network to estimate the noise power spectral density (PSD) of single-channel audio signals represented in the short time Fourier transform (STFT) domain. An LSTM network common to…

Signal Processing · Electrical Eng. & Systems 2020-11-11 Xiaofei Li , Simon Leglaive , Laurent Girin , Radu Horaud

Given the recent surge in developments of deep learning, this article provides a review of the state-of-the-art deep learning techniques for audio signal processing. Speech, music, and environmental sound processing are considered…

Sound · Computer Science 2019-05-28 Hendrik Purwins , Bo Li , Tuomas Virtanen , Jan Schlüter , Shuo-yiin Chang , Tara Sainath

This paper investigates the problem of estimating the spectral power parameters of random analog sources using numerical measurements acquired with minimum digitization complexity. Therefore, spectral analysis has to be performed with…

Signal Processing · Electrical Eng. & Systems 2019-10-29 Manuel S. Stein

Several methods have recently been proposed to analyze speech and automatically infer the personality of the speaker. These methods often rely on prosodic and other hand crafted speech processing features extracted with off-the-shelf…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Marc-André Carbonneau , Eric Granger , Yazid Attabi , Ghyslain Gagnon

Pattern recognition from audio signals is an active research topic encompassing audio tagging, acoustic scene classification, music classification, and other areas. Spectrogram and mel-frequency cepstral coefficients (MFCC) are among the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-18 Md. Istiaq Ansari , Taufiq Hasan

This article addresses the measurement of the power spectrum of red noise processes at the lowest frequencies, where the minimum acquisition time is so long that it is impossible to average on a sequence of data record. Therefore, averaging…

Data Analysis, Statistics and Probability · Physics 2022-06-30 Antoine Baudiquez , Éric Lantz , Enrico Rubiola , François Vernotte
‹ Prev 1 2 3 10 Next ›