Related papers: Algorithms for Speech Recognition and Language Pro…

Weighted Automata in Text and Speech Processing

Finite-state automata are a very effective tool in natural language processing. However, in a variety of applications and especially in speech precessing, it is necessary to consider more general machines in which arcs are assigned weights…

Computation and Language · Computer Science 2007-05-23 Mehryar Mohri , Fernando Pereira , Michael Riley

Speech Recognition by Composition of Weighted Finite Automata

We present a general framework based on weighted finite automata and weighted finite-state transducers for describing and implementing speech recognizers. The framework allows us to represent uniformly the information sources and data…

cmp-lg · Computer Science 2008-02-03 Fernando C. N. Pereira , Michael D. Riley

A review of on-device fully neural end-to-end automatic speech recognition algorithms

In this paper, we review various end-to-end automatic speech recognition algorithms and their optimization techniques for on-device applications. Conventional speech recognition systems comprise a large number of discrete components such as…

Machine Learning · Computer Science 2021-08-30 Chanwoo Kim , Dhananjaya Gowda , Dongsoo Lee , Jiyeon Kim , Ankur Kumar , Sungsoo Kim , Abhinav Garg , Changwoo Han

Use of Weighted Finite State Transducers in Part of Speech Tagging

This paper addresses issues in part of speech disambiguation using finite-state transducers and presents two main contributions to the field. One of them is the use of finite-state machines for part of speech tagging. Linguistic and…

cmp-lg · Computer Science 2007-05-23 Evelyne Tzoukermann , Dragomir R. Radev

An Efficient Compiler for Weighted Rewrite Rules

Context-dependent rewrite rules are used in many areas of natural language and speech processing. Work in computational phonology has demonstrated that, given certain conditions, such rewrite rules can be represented as finite-state…

cmp-lg · Computer Science 2008-02-03 Mehryar Mohri , Richard Sproat

Optimizing Speech Recognition For The Edge

While most deployed speech recognition systems today still run on servers, we are in the midst of a transition towards deployments on edge devices. This leap to the edge is powered by the progression from traditional speech recognition…

Computation and Language · Computer Science 2020-02-10 Yuan Shangguan , Jian Li , Qiao Liang , Raziel Alvarez , Ian McGraw

Deep Learning Based Natural Language Processing for End to End Speech Translation

Deep Learning methods employ multiple processing layers to learn hierarchial representations of data. They have already been deployed in a humongous number of applications and have produced state-of-the-art results. Recently with the growth…

Computation and Language · Computer Science 2018-08-15 Sarvesh Patil

Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models

In Spoken Language Understanding (SLU) the task is to extract important information from audio commands, like the intent of what a user wants the system to do and special entities like locations or numbers. This paper presents a simple…

Computation and Language · Computer Science 2022-06-30 Daniel Bermuth , Alexander Poeppel , Wolfgang Reif

Speech Recognition Transformers: Topological-lingualism Perspective

Transformers have evolved with great success in various artificial intelligence tasks. Thanks to our recent prevalence of self-attention mechanisms, which capture long-term dependency, phenomenal outcomes in speech processing and…

Computation and Language · Computer Science 2024-08-28 Shruti Singh , Muskaan Singh , Virender Kadyan

Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition

To join the advantages of classical and end-to-end approaches for speech recognition, we present a simple, novel and competitive approach for phoneme-based neural transducer modeling. Different alignment label topologies are compared and…

Computation and Language · Computer Science 2021-04-21 Wei Zhou , Simon Berger , Ralf Schlüter , Hermann Ney

Transformer-based language modeling and decoding for conversational speech recognition

We propose a way to use a transformer-based language model in conversational speech recognition. Specifically, we focus on decoding efficiently in a weighted finite-state transducer framework. We showcase an approach to lattice re-scoring…

Computation and Language · Computer Science 2020-01-07 Kareem Nassar

Head Automata for Speech Translation

This paper presents statistical language and translation models based on collections of small finite state machines we call ``head automata''. The models are intended to capture the lexical sensitivity of N-gram models and direct…

cmp-lg · Computer Science 2008-02-03 Hiyan Alshawi

Speech Recognition by Machine, A Review

This paper presents a brief survey on Automatic Speech Recognition and discusses the major themes and advances made in the past 60 years of research, so as to provide a technological perspective and an appreciation of the fundamental…

Computation and Language · Computer Science 2010-01-14 M. A. Anusuya , S. K. Katti

Finite-State Approximation of Phrase-Structure Grammars

Phrase-structure grammars are effective models for important syntactic and semantic aspects of natural languages, but can be computationally too demanding for use as language models in real-time speech recognition. Therefore, finite-state…

cmp-lg · Computer Science 2008-02-03 Fernando C. N. Pereira , Rebecca N. Wright

Approximating Context-Free Grammars with a Finite-State Calculus

Although adequate models of human language for syntactic analysis and semantic interpretation are of at least context-free complexity, for applications such as speech processing in which speed is important finite-state models are often…

cmp-lg · Computer Science 2007-05-23 Edmund Grimley-Evans

Transfer Learning for Speech and Language Processing

Transfer learning is a vital technique that generalizes models trained for one setting or task to other settings or tasks. For example in speech recognition, an acoustic model trained for one language can be used to recognize speech in…

Computation and Language · Computer Science 2015-11-20 Dong Wang , Thomas Fang Zheng

Finite State Transducers Approximating Hidden Markov Models

This paper describes the conversion of a Hidden Markov Model into a sequential transducer that closely approximates the behavior of the stochastic model. This transformation is especially advantageous for part-of-speech tagging because the…

cmp-lg · Computer Science 2008-02-03 Andre Kempe

Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data

Most state-of-the-art speech systems are using Deep Neural Networks (DNNs). Those systems require a large amount of data to be learned. Hence, learning state-of-the-art frameworks on under-resourced speech languages/problems is a difficult…

Audio and Speech Processing · Electrical Eng. & Systems 2020-03-10 Vincent Roger , Jérôme Farinas , Julien Pinquier

Transformers in Speech Processing: A Survey

The remarkable success of transformers in the field of natural language processing has sparked the interest of the speech-processing community, leading to an exploration of their potential for modeling long-range dependencies within speech…

Computation and Language · Computer Science 2025-06-05 Siddique Latif , Aun Zaidi , Heriberto Cuayahuitl , Fahad Shamshad , Moazzam Shoukat , Muhammad Usama , Junaid Qadir

Real-time low-resource phoneme recognition on edge devices

While speech recognition has seen a surge in interest and research over the last decade, most machine learning models for speech recognition either require large training datasets or lots of storage and memory. Combined with the prominence…

Computation and Language · Computer Science 2021-03-26 Yonatan Alon