An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Hang Lv; Zhehuai Chen; Hainan Xu; Daniel Povey; Lei Xie; Sanjeev Khudanpur

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Sound 2021-03-17 v1 Audio and Speech Processing

Authors: Hang Lv , Zhehuai Chen , Hainan Xu , Daniel Povey , Lei Xie , Sanjeev Khudanpur

Abstract

We introduce asynchronous dynamic decoder, which adopts an efficient A* algorithm to incorporate big language models in the one-pass decoding for large vocabulary continuous speech recognition. Unlike standard one-pass decoding with on-the-fly composition decoder which might induce a significant computation overhead, the asynchronous dynamic decoder has a novel design where it has two fronts, with one performing "exploration" and the other "backfill". The computation of the two fronts alternates in the decoding process, resulting in more effective pruning than the standard one-pass decoding with an on-the-fly composition decoder. Experiments show that the proposed decoder works notably faster than the standard one-pass decoding with on-the-fly composition decoder, while the acceleration will be more obvious with the increment of data complexity.

Keywords

speech recognition encoder-decoder architecture audio classification

Cite

@article{arxiv.2103.09063,
  title  = {An Asynchronous WFST-Based Decoder For Automatic Speech Recognition},
  author = {Hang Lv and Zhehuai Chen and Hainan Xu and Daniel Povey and Lei Xie and Sanjeev Khudanpur},
  journal= {arXiv preprint arXiv:2103.09063},
  year   = {2021}
}

Comments

5 pages, 5 figures, icassp

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Abstract

Keywords

Cite

Comments

Related papers