Related papers: An auditory cortex model for sound processing

A bio-inspired geometric model for sound reconstruction

The reconstruction mechanisms built by the human auditory system during sound reconstruction are still a matter of debate. The purpose of this study is to propose a mathematical model of sound reconstruction based on the functional…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-20 Ugo Boscain , Dario Prandi , Ludovic Sacchelli , Giuseppina Turco

Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI

Drawing inspiration from the hierarchical processing of the human auditory system, which transforms sound from low-level acoustic features to high-level semantic understanding, we introduce a novel coarse-to-fine audio reconstruction…

Sound · Computer Science 2024-05-30 Che Liu , Changde Du , Xiaoyu Chen , Huiguang He

Flexible framework for audio reconstruction

The paper presents a unified, flexible framework for the tasks of audio inpainting, declipping, and dequantization. The concept is further extended to cover analogous degradation models in a transformed domain, e.g. quantization of the…

Audio and Speech Processing · Electrical Eng. & Systems 2021-01-05 Ondřej Mokrý , Pavel Rajmic , Pavel Záviška

Treatise on Hearing: The Temporal Auditory Imaging Theory Inspired by Optics and Communication

A new theory of mammalian hearing is presented, which accounts for the auditory image in the midbrain (inferior colliculus) of objects in the acoustical environment of the listener. It is shown that the ear is a temporal imaging system that…

Neurons and Cognition · Quantitative Biology 2024-06-04 Adam Weisser

Learning audio representations via phase prediction

We learn audio representations by solving a novel self-supervised learning task, which consists of predicting the phase of the short-time Fourier transform from its magnitude. A convolutional encoder is used to map the magnitude spectrum of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-29 Félix de Chaumont Quitry , Marco Tagliasacchi , Dominik Roblek

Sound reconstruction from human brain activity via a generative model with brain-like auditory features

The successful reconstruction of perceptual experiences from human brain activity has provided insights into the neural representations of sensory experiences. However, reconstructing arbitrary sounds has been avoided due to the complexity…

Sound · Computer Science 2023-06-21 Jong-Yun Park , Mitsuaki Tsukamoto , Misato Tanaka , Yukiyasu Kamitani

A Reconstruction Algorithm for Photoacoustic Imaging based on the Nonuniform FFT

Fourier reconstruction algorithms significantly outperform conventional back-projection algorithms in terms of computation time. In photoacoustic imaging, these methods require interpolation in the Fourier space domain, which creates…

Numerical Analysis · Mathematics 2016-11-17 M. Haltmeier , O. Scherzer , G. Zangerl

A convolutional plane wave model for sound field reconstruction

Spatial sound field interpolation relies on suitable models to both conform to available measurements and predict the sound field in the domain of interest. A suitable model can be difficult to determine when the spatial domain of interest…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-30 Manuel Hahmann , Efren Fernandez-Grande

Complex Image Generation SwinTransformer Network for Audio Denoising

Achieving high-performance audio denoising is still a challenging task in real-world applications. Existing time-frequency methods often ignore the quality of generated frequency domain images. This paper converts the audio denoising…

Sound · Computer Science 2023-10-26 Youshan Zhang , Jialu Li

Image Reconstruction via Variational Network for Real-Time Hand-Held Sound-Speed Imaging

Speed-of-sound is a biomechanical property for quantitative tissue differentiation, with great potential as a new ultrasound-based image modality. A conventional ultrasound array transducer can be used together with an acoustic mirror, or…

Computer Vision and Pattern Recognition · Computer Science 2018-07-20 Valery Vishnevskiy , Sergio J Sanabria , Orcun Goksel

Learning to Denoise Historical Music

We propose an audio-to-audio neural network model that learns to denoise old music recordings. Our model internally converts its input into a time-frequency representation by means of a short-time Fourier transform (STFT), and processes the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-17 Yunpeng Li , Beat Gfeller , Marco Tagliasacchi , Dominik Roblek

Sequential image recovery from noisy and under-sampled Fourier data

A new algorithm is developed to jointly recover a temporal sequence of images from noisy and under-sampled Fourier data. Specifically, we consider the case where each data set is missing vital information that prevents its (individual)…

Numerical Analysis · Mathematics 2022-05-13 Yao Xiao , Jan Glaubitz , Anne Gelb , Guohui Song

Audio Decoding by Inverse Problem Solving

We consider audio decoding as an inverse problem and solve it through diffusion posterior sampling. Explicit conditioning functions are developed for input signal measurements provided by an example of a transform domain perceptual audio…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-13 Pedro J. Villasana T. , Lars Villemoes , Janusz Klejsa , Per Hedelin

How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

Advanced auditory models are useful in designing signal-processing algorithms for hearing-loss compensation or speech enhancement. Such auditory models provide rich and detailed descriptions of the auditory pathway, and might allow for…

Audio and Speech Processing · Electrical Eng. & Systems 2024-03-18 Peter Leer , Jesper Jensen , Zheng-Hua Tan , Jan Østergaard , Lars Bramsløw

Photoacoustic image reconstruction via deep learning

Applying standard algorithms to sparse data problems in photoacoustic tomography (PAT) yields low-quality images containing severe under-sampling artifacts. To some extent, these artifacts can be reduced by iterative image reconstruction…

Numerical Analysis · Mathematics 2024-12-20 Stephan Antholzer , Johannes Schwab , Robert Nuster , Markus Haltmeier

Phase retrieval for wavelet transforms

We describe a new algorithm to solve a particular phase retrieval problem, that has wide applications in audio processing: the reconstruction of a function from its scalogram, that is from the modulus of its wavelet transform. It is a…

Optimization and Control · Mathematics 2017-04-11 Irène Waldspurger

Time Domain Neural Audio Style Transfer

A recently published method for audio style transfer has shown how to extend the process of image style transfer to audio. This method synthesizes audio "content" and "style" independently using the magnitudes of a short time Fourier…

Sound · Computer Science 2017-12-01 Parag K. Mital

Complex Image-Generative Diffusion Transformer for Audio Denoising

The audio denoising technique has captured widespread attention in the deep neural network field. Recently, the audio denoising problem has been converted into an image generation task, and deep learning-based approaches have been applied…

Sound · Computer Science 2024-06-14 Junhui Li , Pu Wang , Jialu Li , Youshan Zhang

Reconstructing seen images from human brain activity via guided stochastic search

Visual reconstruction algorithms are an interpretive tool that map brain activity to pixels. Past reconstruction algorithms employed brute-force search through a massive library to select candidate images that, when passed through an…

Neurons and Cognition · Quantitative Biology 2023-05-03 Reese Kneeland , Jordyn Ojeda , Ghislain St-Yves , Thomas Naselaris

Regularized autoregressive modeling and its application to audio signal reconstruction

Autoregressive (AR) modeling is invaluable in signal processing, in particular in speech and audio fields. Attempts in the literature can be found that regularize or constrain either the time-domain signal values or the AR coefficients,…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-06 Ondřej Mokrý , Pavel Rajmic