Related papers: A Practical Guide to Spectrogram Analysis for Audi…

Pattern Recognition in Vital Signs Using Spectrograms

Spectrograms visualize the frequency components of a given signal which may be an audio signal or even a time-series signal. Audio signals have higher sampling rate and high variability of frequency with time. Spectrograms can capture such…

Signal Processing · Electrical Eng. & Systems 2021-09-06 Sidharth Srivatsav Sribhashyam , Md Sirajus Salekin , Dmitry Goldgof , Ghada Zamzmi , Mark Last , Yu Sun

SpectroMap: Peak detection algorithm for audio fingerprinting

Audio fingerprinting is a technique used to identify and match audio recordings based on their unique characteristics. It involves creating a condensed representation of an audio signal that can be used to quickly compare and match against…

Sound · Computer Science 2023-05-03 Aarón López-García

Spectrogram features for audio and speech analysis

Spectrogram-based representations have grown to dominate the feature space for deep learning audio analysis systems, and are often adopted for speech analysis also. Initially, the primary motivator for spectrogram-based representations was…

Audio and Speech Processing · Electrical Eng. & Systems 2026-03-17 Ian McLoughlin , Lam Pham , Yan Song , Xiaoxiao Miao , Huy Phan , Pengfei Cai , Qing Gu , Jiang Nan , Haoyu Song , Donny Soh

Exploiting Spectral Leakage for Spectrogram Frequency Super-resolution

The spectrogram is a classical DSP tool used to view signals in both time and frequency. Unfortunately, the Heisenberg Uncertainty Principal limits our ability to use them for detecting and measuring narrowband signal modulation in wideband…

Information Theory · Computer Science 2014-01-22 Ray Maleh , Frank A. Boyle

A Survey of Deep Learning for Complex Speech Spectrograms

Recent advancements in deep learning have significantly impacted the field of speech signal processing, particularly in the analysis and manipulation of complex spectrograms. This survey provides a comprehensive overview of the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-10-06 Yuying Xie , Zheng-Hua Tan

Point Processes and spatial statistics in time-frequency analysis

A finite-energy signal is represented by a square-integrable, complex-valued function $t\mapsto s(t)$ of a real variable $t$, interpreted as time. Similarly, a noisy signal is represented by a random process. Time-frequency analysis, a…

Signal Processing · Electrical Eng. & Systems 2025-04-16 Barbara Pascal , Rémi Bardenet

Audio Spectrogram Representations for Processing with Convolutional Neural Networks

One of the decisions that arise when designing a neural network for any application is how the data should be represented in order to be presented to, and possibly generated by, a neural network. For audio, the choice is less obvious than…

Sound · Computer Science 2017-06-30 L. Wyse

Decoding Noise in Nanofluidic Systems: Adsorption versus Diffusion Signatures in Power Spectra

Adsorption processes play a fundamental role in molecular transport through nanofluidic systems, but their signatures in measured signals are often hard to distinguish from other processes like diffusion. In this paper, we derive an…

Soft Condensed Matter · Physics 2025-11-07 Anna Drummond Young , Alice L. Thorneywork , Sophie Marbach

An Investigation of the Effectiveness of Phase for Audio Classification

While log-amplitude mel-spectrogram has widely been used as the feature representation for processing speech based on deep learning, the effectiveness of another aspect of speech spectrum, i.e., phase information, was shown recently for…

Sound · Computer Science 2022-05-02 Shunsuke Hidaka , Kohei Wakamiya , Tokihiko Kaburagi

Signal processing and statistical methods in analysis of text and DNA

A number of signal processing and statistical methods can be used in analyzing either pieces of text or DNA sequences. These techniques can be used in a number of ways, such as determining authorship of documents, finding genes in DNA, and…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Matthew J. Berryman , Andrew Allison , Pedro Carpena , Derek Abbott

Finite-Length and Asymptotic Analysis of Correlogram for Undersampled Data

This paper studies a spectrum estimation method for the case that the samples are obtained at a rate lower than the Nyquist rate. The method is referred to as the correlogram for undersampled data. The algorithm partitions the spectrum into…

Information Theory · Computer Science 2018-02-07 Mahdi Shaghaghi , Sergiy A. Vorobyov

Spectral Lineshape Measurements with Shot-Noise Limited Accuracy

Spectroscopy has played the key role in revealing, and thereby understanding, the structure of atoms and molecules. A central drive in this field is the pursuit of higher precision and accuracy so that ever more subtle effects might be…

Optics · Physics 2013-05-30 Gar-Wing Truong , James D. Anstie , Eric F. May , Thomas M. Stace , Andre N. Luiten

Sound Field Estimation: Theories and Applications

The spatial information of sound plays a crucial role in various situations, ranging from daily activities to advanced engineering technologies. To fully utilize its potential, numerous research studies on spatial audio signal processing…

Audio and Speech Processing · Electrical Eng. & Systems 2025-03-14 Natsuki Ueno , Shoichi Koyama

Comparison Performance of Spectrogram and Scalogram as Input of Acoustic Recognition Task

Acoustic recognition has emerged as a prominent task in deep learning research, frequently utilizing spectral feature extraction techniques such as the spectrogram from the Short-Time Fourier Transform and the scalogram from the Wavelet…

Audio and Speech Processing · Electrical Eng. & Systems 2025-12-01 Dang Thoai Phan

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

We propose a method using a long short-term memory (LSTM) network to estimate the noise power spectral density (PSD) of single-channel audio signals represented in the short time Fourier transform (STFT) domain. An LSTM network common to…

Signal Processing · Electrical Eng. & Systems 2020-11-11 Xiaofei Li , Simon Leglaive , Laurent Girin , Radu Horaud

Deep Learning for Audio Signal Processing

Given the recent surge in developments of deep learning, this article provides a review of the state-of-the-art deep learning techniques for audio signal processing. Speech, music, and environmental sound processing are considered…

Sound · Computer Science 2019-05-28 Hendrik Purwins , Bo Li , Tuomas Virtanen , Jan Schlüter , Shuo-yiin Chang , Tara Sainath

Spectral Power Parameter Estimation of Random Sources with Binary Sampled Signals

This paper investigates the problem of estimating the spectral power parameters of random analog sources using numerical measurements acquired with minimum digitization complexity. Therefore, spectral analysis has to be performed with…

Signal Processing · Electrical Eng. & Systems 2019-10-29 Manuel S. Stein

Feature Learning from Spectrograms for Assessment of Personality Traits

Several methods have recently been proposed to analyze speech and automatically infer the personality of the speaker. These methods often rely on prosodic and other hand crafted speech processing features extracted with off-the-shelf…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Marc-André Carbonneau , Eric Granger , Yazid Attabi , Ghyslain Gagnon

SpectNet : End-to-End Audio Signal Classification Using Learnable Spectrograms

Pattern recognition from audio signals is an active research topic encompassing audio tagging, acoustic scene classification, music classification, and other areas. Spectrogram and mel-frequency cepstral coefficients (MFCC) are among the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-18 Md. Istiaq Ansari , Taufiq Hasan

The Statistics of the Cross-Spectrum and the Spectrum Average: Generalization to Multiple Instruments

This article addresses the measurement of the power spectrum of red noise processes at the lowest frequencies, where the minimum acquisition time is so long that it is impossible to average on a sequence of data record. Therefore, averaging…

Data Analysis, Statistics and Probability · Physics 2022-06-30 Antoine Baudiquez , Éric Lantz , Enrico Rubiola , François Vernotte