English
Related papers

Related papers: Bayesian Transformer Language Models for Speech Re…

200 papers

State-of-the-art neural network language models (NNLMs) represented by long short term memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly complex. They are prone to overfitting and poor generalization when…

Computation and Language · Computer Science 2022-08-30 Boyang Xue , Shoukang Hu , Junhao Xu , Mengzhe Geng , Xunying Liu , Helen Meng

Discriminative training techniques define state-of-the-art performance for automatic speech recognition systems. However, they are inherently prone to overfitting, leading to poor generalization performance when using limited training data.…

Audio and Speech Processing · Electrical Eng. & Systems 2021-05-11 Shoukang Hu , Xurong Xie , Shansong Liu , Jianwei Yu , Zi Ye , Mengzhe Geng , Xunying Liu , Helen Meng

Recent developments using End-to-End Deep Learning models have been shown to have near or better performance than state of the art Recurrent Neural Networks (RNNs) on Automatic Speech Recognition tasks. These models tend to be lighter…

Audio and Speech Processing · Electrical Eng. & Systems 2023-01-27 Will Rieger

A key task for speech recognition systems is to reduce the mismatch between training and evaluation data that is often attributable to speaker differences. Speaker adaptation techniques play a vital role to reduce the mismatch. Model-based…

Sound · Computer Science 2024-06-17 Xurong Xie , Xunying Liu , Tan Lee , Lan Wang

Trans-dimensional random field language models (TRF LMs) where sentences are modeled as a collection of random fields, have shown close performance with LSTM LMs in speech recognition and are computationally more efficient in inference.…

Computation and Language · Computer Science 2017-10-31 Bin Wang , Zhijian Ou

Although n-gram language models (LMs) have been outperformed by the state-of-the-art neural LMs, they are still widely used in speech recognition due to its high efficiency in inference. In this paper, we demonstrate that n-gram LM can be…

Computation and Language · Computer Science 2019-12-03 Yiren Wang , Hongzhao Huang , Zhe Liu , Yutong Pang , Yongqiang Wang , ChengXiang Zhai , Fuchun Peng

In this work, we aim to establish a Bayesian adaptive learning framework by focusing on estimating latent variables in deep neural network (DNN) models. Latent variables indeed encode both transferable distributional information and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-26 Hu Hu , Sabato Marco Siniscalchi , Chin-Hui Lee

In recent years, Large Language Models (LLMs) have garnered significant attention from the research community due to their exceptional performance and generalization capabilities. In this paper, we introduce a novel method for…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-21 Egor Lakomkin , Chunyang Wu , Yassir Fathullah , Ozlem Kalinli , Michael L. Seltzer , Christian Fuegen

Language models (LMs) have been commonly adopted to boost the performance of automatic speech recognition (ASR) particularly in domain adaptation tasks. Conventional way of LM training treats all the words in corpora equally, resulting in…

Computation and Language · Computer Science 2023-10-18 Yingyi Ma , Zhe Liu , Ozlem Kalinli

Recently, large language models (LLMs) have demonstrated powerful capabilities in performing various tasks and thus are applied by recent studies to time series forecasting (TSF) tasks, which predict future values with the given historical…

Computation and Language · Computer Science 2025-07-15 Chen Su , Yuanhe Tian , Qinyu Liu , Jun Zhang , Yan Song

Automatic speech recognition and spoken dialogue systems have made great advances through the use of deep machine learning methods. This is partly due to greater computing power but also through the large amount of data available in common…

Computation and Language · Computer Science 2020-06-04 Boris Mocialov , Graham Turner , Helen Hastie

Understanding how Transformer-based Language Models (LMs) learn and recall information is a key goal of the deep learning community. Recent interpretability methods project weights and hidden states obtained from the forward pass to the…

Computation and Language · Computer Science 2024-02-21 Shahar Katz , Yonatan Belinkov , Mor Geva , Lior Wolf

In this paper, we summarize the application of transformer and its streamable variant, Emformer based acoustic model for large scale speech recognition applications. We compare the transformer based acoustic models with their LSTM…

Computation and Language · Computer Science 2020-11-02 Yongqiang Wang , Yangyang Shi , Frank Zhang , Chunyang Wu , Julian Chan , Ching-Feng Yeh , Alex Xiao

Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work…

We propose a variational Bayesian (VB) approach to learning distributions of latent variables in deep neural network (DNN) models for cross-domain knowledge transfer, to address acoustic mismatches between training and testing conditions.…

Audio and Speech Processing · Electrical Eng. & Systems 2022-02-22 Hu Hu , Sabato Marco Siniscalchi , Chao-Han Huck Yang , Chin-Hui Lee

Neural language models (LMs) have been proved to significantly outperform classical n-gram LMs for language modeling due to their superior abilities to model long-range dependencies in text and handle data sparsity problems. And recently,…

Computation and Language · Computer Science 2019-10-28 Hongzhao Huang , Fuchun Peng

We consider phrase based Language Models (LM), which generalize the commonly used word level models. Similar concept on phrase based LMs appears in speech recognition, which is rather specialized and thus less suitable for machine…

Computation and Language · Computer Science 2015-01-20 Jia Xu , Geliang Chen

Much theoretical work has described the ability of transformers to represent formal languages. However, linking theoretical results to empirical performance is not straightforward due to the complex interplay between the architecture, the…

Computation and Language · Computer Science 2024-10-07 Anej Svete , Nadav Borenstein , Mike Zhou , Isabelle Augenstein , Ryan Cotterell

While recurrent neural networks still largely define state-of-the-art speech recognition systems, the Transformer network has been proven to be a competitive alternative, especially in the offline condition. Most studies with Transformers…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-13 Liang Lu , Changliang Liu , Jinyu Li , Yifan Gong

There has been considerable interest in using surprisal from Transformer-based language models (LMs) as predictors of human sentence processing difficulty. Recent work has observed an inverse scaling relationship between Transformers'…

Computation and Language · Computer Science 2026-02-04 Yi-Chien Lin , William Schuler
‹ Prev 1 2 3 10 Next ›