Related papers: Decoding EEG Speech Perception with Transformers a…

Constrained Variational Autoencoder for improving EEG based Speech Recognition Systems

In this paper we introduce a recurrent neural network (RNN) based variational autoencoder (VAE) model with a new constrained loss function that can generate more meaningful electroencephalography (EEG) features from raw EEG features to…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-05 Gautam Krishna , Co Tran , Mason Carnahan , Ahmed Tewfik

Decoding Envelope and Frequency-Following EEG Responses to Continuous Speech Using Deep Neural Networks

The electroencephalogram (EEG) offers a non-invasive means by which a listener's auditory system may be monitored during continuous speech perception. Reliable auditory-EEG decoders could facilitate the objective diagnosis of hearing…

Audio and Speech Processing · Electrical Eng. & Systems 2023-12-18 Mike Thornton , Danilo Mandic , Tobias Reichenbach

Towards Linguistic Neural Representation Learning and Sentence Retrieval from Electroencephalogram Recordings

Decoding linguistic information from non-invasive brain signals using EEG has gained increasing research attention due to its vast applicational potential. Recently, a number of works have adopted a generative-based framework to decode…

Computation and Language · Computer Science 2024-08-12 Jinzhao Zhou , Yiqun Duan , Ziyi Zhao , Yu-Cheng Chang , Yu-Kai Wang , Thomas Do , Chin-Teng Lin

Scaling Law in Neural Data: Non-Invasive Speech Decoding with 175 Hours of EEG Data

Brain-computer interfaces (BCIs) hold great potential for aiding individuals with speech impairments. Utilizing electroencephalography (EEG) to decode speech is particularly promising due to its non-invasive nature. However, recordings are…

Neurons and Cognition · Quantitative Biology 2024-07-11 Motoshige Sato , Kenichi Tomeoka , Ilya Horiguchi , Kai Arulkumaran , Ryota Kanai , Shuntaro Sasai

A Silent Speech Decoding System from EEG and EMG with Heterogenous Electrode Configurations

Silent speech decoding, which performs unvocalized human speech recognition from electroencephalography/electromyography (EEG/EMG), increases accessibility for speech-impaired humans. However, data collection is difficult and performed…

Quantitative Methods · Quantitative Biology 2025-06-18 Masakazu Inoue , Motoshige Sato , Kenichi Tomeoka , Nathania Nah , Eri Hatakeyama , Kai Arulkumaran , Ilya Horiguchi , Shuntaro Sasai

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders

Variational auto-encoders (VAEs) are deep generative latent variable models that can be used for learning the distribution of complex data. VAEs have been successfully used to learn a probabilistic prior over speech signals, which is then…

Sound · Computer Science 2020-12-18 Mostafa Sadeghi , Simon Leglaive , Xavier Alameda-PIneda , Laurent Girin , Radu Horaud

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space learned in an unsupervised manner. In the original VAE model,…

Sound · Computer Science 2021-06-15 Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda

Eeg2vec: Self-Supervised Electroencephalographic Representation Learning

Recently, many efforts have been made to explore how the brain processes speech using electroencephalographic (EEG) signals, where deep learning-based approaches were shown to be applicable in this field. In order to decode speech signals…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-24 Qiushi Zhu , Xiaoying Zhao , Jie Zhang , Yu Gu , Chao Weng , Yuchen Hu

Improving Speech Decoding from ECoG with Self-Supervised Pretraining

Recent work on intracranial brain-machine interfaces has demonstrated that spoken speech can be decoded with high accuracy, essentially by treating the problem as an instance of supervised learning and training deep neural networks to map…

Neurons and Cognition · Quantitative Biology 2024-05-30 Brian A. Yuan , Joseph G. Makin

Enhancing Listened Speech Decoding from EEG via Parallel Phoneme Sequence Prediction

Brain-computer interfaces (BCI) offer numerous human-centered application possibilities, particularly affecting people with neurological disorders. Text or speech decoding from brain activities is a relevant domain that could augment the…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-10 Jihwan Lee , Tiantian Feng , Aditya Kommineni , Sudarsana Reddy Kadiri , Shrikanth Narayanan

Variational decomposition autoencoding improves disentanglement of latent representations

Understanding the structure of complex, nonstationary, high-dimensional time-evolving signals is a central challenge in scientific data analysis. In many domains, such as speech and biomedical signal processing, the ability to learn…

Machine Learning · Computer Science 2026-01-13 Ioannis Ziogas , Aamna Al Shehhi , Ahsan H. Khandoker , Leontios J. Hadjileontiadis

A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond

Decoding neural activity into human-interpretable representations is a key research direction in brain-computer interfaces (BCIs) and computational neuroscience. Recent progress in machine learning and generative AI has driven growing…

Artificial Intelligence · Computer Science 2025-12-02 Shreya Shukla , Jose Torres , Akshaj Murhekar , Christina Liu , Abhijit Mishra , Jacek Gwizdka , Shounak Roychowdhury

Relating EEG recordings to speech using envelope tracking and the speech-FFR

During speech perception, a listener's electroencephalogram (EEG) reflects acoustic-level processing as well as higher-level cognitive factors such as speech comprehension and attention. However, decoding speech from EEG recordings is…

Audio and Speech Processing · Electrical Eng. & Systems 2023-03-14 Mike Thornton , Danilo Mandic , Tobias Reichenbach

Bridging Brain Signals and Language: A Deep Learning Approach to EEG-to-Text Decoding

Brain activity translation into human language delivers the capability to revolutionize machine-human interaction while providing communication support to people with speech disability. Electronic decoding reaches a certain level of…

Signal Processing · Electrical Eng. & Systems 2025-02-26 Mostafa El Gedawy , Omnia Nabil , Omar Mamdouh , Mahmoud Nady , Nour Alhuda Adel , Ahmed Fares

A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders

Recent studies have explored the use of deep generative models of speech spectra based of variational autoencoders (VAEs), combined with unsupervised noise models, to perform speech enhancement. These studies developed iterative algorithms…

Sound · Computer Science 2019-05-15 Manuel Pariente , Antoine Deleforge , Emmanuel Vincent

Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders

The electroencephalogram (EEG) is a powerful method to understand how the brain processes speech. Linear models have recently been replaced for this purpose with deep neural networks and yield promising results. In related EEG…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-25 Lies Bollens , Tom Francart , Hugo Van Hamme

Decoding non-invasive brain activity with novel deep-learning approaches

This thesis delves into the world of non-invasive electrophysiological brain signals like electroencephalography (EEG) and magnetoencephalography (MEG), focusing on modelling and decoding such data. The research aims to investigate what…

Signal Processing · Electrical Eng. & Systems 2025-10-30 Richard Csaky

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

Dynamical variational autoencoders (DVAEs) are a class of deep generative models with latent variables, dedicated to model time series of high-dimensional data. DVAEs can be considered as extensions of the variational autoencoder (VAE) that…

Sound · Computer Science 2022-10-04 Xiaoyu Bie , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin

Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder

Recently, a generative variational autoencoder (VAE) has been proposed for speech enhancement to model speech statistics. However, this approach only uses clean speech in the training phase, making the estimation particularly sensitive to…

Audio and Speech Processing · Electrical Eng. & Systems 2021-05-18 Huajian Fang , Guillaume Carbajal , Stefan Wermter , Timo Gerkmann

EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG

Restoring speech communication from neural signals is a central goal of brain-computer interface research, yet EEG-based speech reconstruction remains challenging due to limited spatial resolution, susceptibility to noise, and the absence…

Signal Processing · Electrical Eng. & Systems 2025-12-30 Hanbeot Park , Yunjeong Cho , Hunhee Kim