English
Related papers

Related papers: Robust Multi-channel Speech Recognition using Freq…

200 papers

Conventional far-field automatic speech recognition (ASR) systems typically employ microphone array techniques for speech enhancement in order to improve robustness against noise or reverberation. However, such speech enhancement techniques…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-23 Minhua Wu , Kenichi Kumatani , Shiva Sundaram , Nikko Strom , Bjorn Hoffmeister

The use of spatial information with multiple microphones can improve far-field automatic speech recognition (ASR) accuracy. However, conventional microphone array techniques degrade speech enhancement performance when there is an array…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-23 Kenichi Kumatani , Minhua Wu , Shiva Sundaram , Nikko Strom , Bjorn Hoffmeister

To achieve robust far-field automatic speech recognition (ASR), existing techniques typically employ an acoustic front end (AFE) cascaded with a neural transducer (NT) ASR model. The AFE output, however, could be unreliable, as the…

The machine recognition of speech spoken at a distance from the microphones, known as far-field automatic speech recognition (ASR), has received a significant increase of attention in science and industry, which caused or was caused by an…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-22 Reinhold Haeb-Umbach , Jahn Heymann , Lukas Drude , Shinji Watanabe , Marc Delcroix , Tomohiro Nakatani

This paper proposes a flexible multichannel speech enhancement system with the main goal of improving robustness of automatic speech recognition (ASR) in noisy conditions. The proposed system combines a flexible neural mask estimator…

Audio and Speech Processing · Electrical Eng. & Systems 2024-06-10 Ante Jukić , Jagadeesh Balam , Boris Ginsburg

In this work, we investigated the teacher-student training paradigm to train a fully learnable multi-channel acoustic model for far-field automatic speech recognition (ASR). Using a large offline teacher model trained on beamformed audio,…

Sound · Computer Science 2020-05-05 Sanna Wager , Aparna Khare , Minhua Wu , Kenichi Kumatani , Shiva Sundaram

Joint optimization of multi-channel front-end and automatic speech recognition (ASR) has attracted much interest. While promising results have been reported for various tasks, past studies on its meeting transcription application were…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-30 Xiaofei Wang , Naoyuki Kanda , Yashesh Gaur , Zhuo Chen , Zhong Meng , Takuya Yoshioka

Far-field speech recognition is a challenging task that conventionally uses signal processing beamforming to attack noise and interference problem. But the performance has been found usually limited due to heavy reliance on environmental…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-08 Dongdi Zhao , Jianbo Ma , Lu Lu , Jinke Li , Xuan Ji , Lei Zhu , Fuming Fang , Ming Liu , Feijun Jiang

This paper describes multichannel speech enhancement for improving automatic speech recognition (ASR) in noisy environments. Recently, the minimum variance distortionless response (MVDR) beamforming has widely been used because it works…

Automatic speech recognition (ASR) in multichannel, multi-speaker scenarios remains challenging due to ambient noise, reverberation and overlapping speakers. In this paper, we propose a beamforming approach that processes specific angular…

Sound · Computer Science 2025-09-15 Can Cui , Paul Magron , Mostafa Sadeghi , Emmanuel Vincent

This paper describes noisy speech recognition for an augmented reality headset that helps verbal communication within real multiparty conversational environments. A major approach that has actively been studied in simulated environments is…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-18 Yicheng Du , Aditya Arie Nugraha , Kouhei Sekiguchi , Yoshiaki Bando , Mathieu Fontaine , Kazuyoshi Yoshii

Beamforming has been extensively investigated for multi-channel audio processing tasks. Recently, learning-based beamforming methods, sometimes called \textit{neural beamformers}, have achieved significant improvements in both signal…

Audio and Speech Processing · Electrical Eng. & Systems 2019-10-02 Yi Luo , Enea Ceolini , Cong Han , Shih-Chii Liu , Nima Mesgarani

Automatic Speech Recognition (ASR) has shown remarkable progress, yet it still faces challenges in real-world distant scenarios across various array topologies each with multiple recording devices. The focal point of the CHiME-7 Distant ASR…

Sound · Computer Science 2023-12-18 Bingshen Mu , Pengcheng Guo , Dake Guo , Pan Zhou , Wei Chen , Lei Xie

Neural speech separation has made remarkable progress and its integration with automatic speech recognition (ASR) is an important direction towards realizing multi-speaker ASR. This work provides an insightful investigation of speech…

It has been shown that the intelligibility of noisy speech can be improved by speech enhancement algorithms. However, speech enhancement has not been established as an effective frontend for robust automatic speech recognition (ASR) in…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-22 Yufeng Yang , Ashutosh Pandey , DeLiang Wang

Speech representation and modelling in high-dimensional spaces of acoustic waveforms, or a linear transformation thereof, is investigated with the aim of improving the robustness of automatic speech recognition to additive noise. The…

Computation and Language · Computer Science 2015-03-31 Matthew Ager , Zoran Cvetkovic , Peter Sollich

A stream attention framework has been applied to the posterior probabilities of the deep neural network (DNN) to improve the far-field automatic speech recognition (ASR) performance in the multi-microphone configuration. The stream…

Sound · Computer Science 2017-12-01 Xiaofei Wang , Yonghong Yan , Hynek Hermansky

The front-end module in multi-channel automatic speech recognition (ASR) systems mainly use microphone array techniques to produce enhanced signals in noisy conditions with reverberation and echos. Recently, neural network (NN) based…

Sound · Computer Science 2020-11-19 Yuxiang Kong , Jian Wu , Quandong Wang , Peng Gao , Weiji Zhuang , Yujun Wang , Lei Xie

In multi-channel speech enhancement and robust automatic speech recognition (ASR), beamforming can typically improve the signal-to-noise ratio (SNR) of the target speaker and produce reliable enhancement with little distortion to target…

Audio and Speech Processing · Electrical Eng. & Systems 2025-07-22 Zhong-Qiu Wang , Ruizhe Pang

Automatic speech recognition in multi-channel reverberant conditions is a challenging task. The conventional way of suppressing the reverberation artifacts involves a beamforming based enhancement of the multi-channel speech signal, which…

Audio and Speech Processing · Electrical Eng. & Systems 2020-01-28 Anurenjan Purushothaman , Anirudh Sreeram , Sriram Ganapathy
‹ Prev 1 2 3 10 Next ›