Related papers: ASR Error Correction with Constrained Decoding on …

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER) than original ASR outputs. Previous works usually use a sequence-to-sequence…

Computation and Language · Computer Science 2022-11-30 Yichong Leng , Xu Tan , Linchen Zhu , Jin Xu , Renqian Luo , Linquan Liu , Tao Qin , Xiang-Yang Li , Ed Lin , Tie-Yan Liu

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition

Error correction in automatic speech recognition (ASR) aims to correct those incorrect words in sentences generated by ASR models. Since recent ASR models usually have low word error rate (WER), to avoid affecting originally correct tokens,…

Computation and Language · Computer Science 2023-12-21 Yichong Leng , Xu Tan , Wenjie Liu , Kaitao Song , Rui Wang , Xiang-Yang Li , Tao Qin , Edward Lin , Tie-Yan Liu

Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition

Accurately finding the wrong words in the automatic speech recognition (ASR) hypothesis and recovering them well-founded is the goal of speech error correction. In this paper, we propose a non-autoregressive speech error correction method.…

Computation and Language · Computer Science 2024-07-19 Yuchun Shu , Bo Hu , Yifeng He , Hao Shi , Longbiao Wang , Jianwu Dang

ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction

Automatic speech recognition (ASR) systems often encounter difficulties in accurately recognizing rare words, leading to errors that can have a negative impact on downstream tasks such as keyword spotting, intent detection, and text…

Artificial Intelligence · Computer Science 2023-10-10 Jiajun He , Zekun Yang , Tomoki Toda

Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model

In this work, we introduce a simple yet efficient post-processing model for automatic speech recognition (ASR). Our model has Transformer-based encoder-decoder architecture which "translates" ASR model output into grammatically and…

Computation and Language · Computer Science 2019-10-24 Oleksii Hrinchuk , Mariya Popova , Boris Ginsburg

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Error correction is widely used in automatic speech recognition (ASR) to post-process the generated sentence, and can further reduce the word error rate (WER). Although multiple candidates are generated by an ASR system through beam search,…

Computation and Language · Computer Science 2022-11-30 Yichong Leng , Xu Tan , Rui Wang , Linchen Zhu , Jin Xu , Wenjie Liu , Linquan Liu , Tao Qin , Xiang-Yang Li , Edward Lin , Tie-Yan Liu

Tag and correct: high precision post-editing approach to correction of speech recognition errors

This paper presents a new approach to the problem of correcting speech recognition errors by means of post-editing. It consists of using a neural sequence tagger that learns how to correct an ASR (Automatic Speech Recognition) hypothesis…

Computation and Language · Computer Science 2024-06-13 Tomasz Ziętkiewicz

Post-Editing Error Correction Algorithm for Speech Recognition using Bing Spelling Suggestion

ASR short for Automatic Speech Recognition is the process of converting a spoken speech into text that can be manipulated by a computer. Although ASR has several applications, it is still erroneous and imprecise especially if used in a…

Computation and Language · Computer Science 2012-03-26 Youssef Bassil , Mohammad Alwani

Hybrid phonetic-neural model for correction in speech recognition systems

Automatic speech recognition (ASR) is a relevant area in multiple settings because it provides a natural communication mechanism between applications and users. ASRs often fail in environments that use language specific to particular…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-16 Rafael Viana-Cámara , Mario Campos-Soberanis , Diego Campos-Sobrino

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER). Previous works usually adopt end-to-end models and has strong dependency on…

Computation and Language · Computer Science 2024-01-12 Jiaxin Guo , Minghan Wang , Xiaosong Qiao , Daimeng Wei , Hengchao Shang , Zongyao Li , Zhengzhe Yu , Yinglu Li , Chang Su , Min Zhang , Shimin Tao , Hao Yang

Error Correction in ASR using Sequence-to-Sequence Models

Post-editing in Automatic Speech Recognition (ASR) entails automatically correcting common and systematic errors produced by the ASR system. The outputs of an ASR system are largely prone to phonetic and spelling errors. In this paper, we…

Computation and Language · Computer Science 2022-08-24 Samrat Dutta , Shreyansh Jain , Ayush Maheshwari , Souvik Pal , Ganesh Ramakrishnan , Preethi Jyothi

Revisiting ASR Error Correction with Specialized Models

Language models play a central role in automatic speech recognition (ASR), yet most methods rely on text-only models unaware of ASR error patterns. Recently, large language models (LLMs) have been applied to ASR correction, but introduce…

Machine Learning · Computer Science 2026-03-18 Zijin Gu , Tatiana Likhomanenko , He Bai , Erik McDermott , Ronan Collobert , Navdeep Jaitly

Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection

Although modern automatic speech recognition (ASR) systems can achieve high performance, they may produce errors that weaken readers' experience and do harm to downstream tasks. To improve the accuracy and reliability of ASR hypotheses, we…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-11 Jing Du , Shiliang Pu , Qinbo Dong , Chao Jin , Xin Qi , Dian Gu , Ru Wu , Hongwei Zhou

Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces

Despite advances in Automatic Speech Recognition (ASR), transcription errors persist and require manual correction. Confidence scores, which indicate the certainty of ASR results, could assist users in identifying and correcting errors.…

Human-Computer Interaction · Computer Science 2025-03-20 Korbinian Kuhn , Verena Kersken , Gottfried Zimmermann

Transformer-based Online Speech Recognition with Decoder-end Adaptive Computation Steps

Transformer-based end-to-end (E2E) automatic speech recognition (ASR) systems have recently gained wide popularity, and are shown to outperform E2E models based on recurrent structures on a number of ASR tasks. However, like other E2E…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-30 Mohan Li , Catalin Zorila , Rama Doddipatla

Automatic Quality Estimation for ASR System Combination

Recognizer Output Voting Error Reduction (ROVER) has been widely used for system combination in automatic speech recognition (ASR). In order to select the most appropriate words to insert at each position in the output transcriptions, some…

Computation and Language · Computer Science 2017-06-23 Shahab Jalalvand , Matteo Negri , Daniele Falavigna , Marco Matassoni , Marco Turchi

Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation

End-to-end automatic speech recognition (ASR), unlike conventional ASR, does not have modules to learn the semantic representation from speech encoder. Moreover, the higher frame-rate of speech representation prevents the model to learn the…

Artificial Intelligence · Computer Science 2021-03-19 Md Akmal Haidar , Chao Xing , Mehdi Rezagholizadeh

Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR

A streaming style inference of encoder-decoder automatic speech recognition (ASR) system is important for reducing latency, which is essential for interactive use cases. To this end, we propose a novel blockwise synchronous decoding…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-26 Emiru Tsunoo , Chaitanya Narisetty , Michael Hentschel , Yosuke Kashiwagi , Shinji Watanabe

Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes

Automatic Speech Recognition (ASR) systems exhibit the best performance on speech that is similar to that on which it was trained. As such, underrepresented varieties including regional dialects, minority-speakers, and low-resource…

Computation and Language · Computer Science 2023-05-15 Emma O'Neill , Julie Carson-Berndsen

ASR Error Detection via Audio-Transcript entailment

Despite improved performances of the latest Automatic Speech Recognition (ASR) systems, transcription errors are still unavoidable. These errors can have a considerable impact in critical domains such as healthcare, when used to help with…

Computation and Language · Computer Science 2022-07-25 Nimshi Venkat Meripo , Sandeep Konam