Related papers: Attention-Based Models for Speech Recognition

Input-length-shortening and text generation via attention values

Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention…

Computation and Language · Computer Science 2023-03-15 Neşet Özkan Tan , Alex Yuxuan Peng , Joshua Bensemann , Qiming Bao , Tim Hartill , Mark Gahegan , Michael Witbrock

Attention-Based Models for Text-Dependent Speaker Verification

Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire…

Audio and Speech Processing · Electrical Eng. & Systems 2018-02-02 F A Rezaur Rahman Chowdhury , Quan Wang , Ignacio Lopez Moreno , Li Wan

Temporal Attention Model for Neural Machine Translation

Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research. We propose a novel mechanism to address some of these limitations and improve the NMT attention.…

Computation and Language · Computer Science 2016-08-10 Baskaran Sankaran , Haitao Mi , Yaser Al-Onaizan , Abe Ittycheriah

Attention Forcing for Sequence-to-sequence Model Training

Auto-regressive sequence-to-sequence models with attention mechanism have achieved state-of-the-art performance in many tasks such as machine translation and speech synthesis. These models can be difficult to train. The standard approach,…

Machine Learning · Computer Science 2019-10-04 Qingyun Dou , Yiting Lu , Joshua Efiong , Mark J. F. Gales

Attention-based Contextual Language Model Adaptation for Speech Recognition

Language modeling (LM) for automatic speech recognition (ASR) does not usually incorporate utterance level contextual information. For some domains like voice assistants, however, additional context, such as the time at which an utterance…

Computation and Language · Computer Science 2021-06-04 Richard Diehl Martinez , Scott Novotney , Ivan Bulyko , Ariya Rastrow , Andreas Stolcke , Ankur Gandhe

A neural attention model for speech command recognition

This paper introduces a convolutional recurrent network with attention for speech command recognition. Attention models are powerful tools to improve performance on natural language, image captioning and speech tasks. The proposed model…

Audio and Speech Processing · Electrical Eng. & Systems 2018-08-28 Douglas Coimbra de Andrade , Sabato Leo , Martin Loesener Da Silva Viana , Christoph Bernkopf

Neural Machine Translation with Recurrent Attention Modeling

Knowing which words have been attended to in previous time steps while generating a translation is a rich source of information for predicting what words will be attended to in the future. We improve upon the attention model of Bahdanau et…

Neural and Evolutionary Computing · Computer Science 2016-07-19 Zichao Yang , Zhiting Hu , Yuntian Deng , Chris Dyer , Alex Smola

Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition

Attention-based sequence-to-sequence automatic speech recognition (ASR) requires a significant delay to recognize long utterances because the output is generated after receiving entire input sequences. Although several studies recently…

Computation and Language · Computer Science 2020-11-05 Sashi Novitasari , Andros Tjandra , Sakriani Sakti , Satoshi Nakamura

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition

Attention-based models have been gaining popularity recently for their strong performance demonstrated in fields such as machine translation and automatic speech recognition. One major challenge of attention-based models is the need of…

Computation and Language · Computer Science 2020-11-17 Ching-Feng Yeh , Yongqiang Wang , Yangyang Shi , Chunyang Wu , Frank Zhang , Julian Chan , Michael L. Seltzer

Temporal Attention for Language Models

Pretrained language models based on the transformer architecture have shown great success in NLP. Textual training data often comes from the web and is thus tagged with time-specific information, but most language models ignore this…

Computation and Language · Computer Science 2022-05-05 Guy D. Rosin , Kira Radinsky

Improved training of end-to-end attention models for speech recognition

Sequence-to-sequence attention-based models on subword units allow simple open-vocabulary end-to-end speech recognition. In this work, we show that such models can achieve competitive results on the Switchboard 300h and LibriSpeech 1000h…

Computation and Language · Computer Science 2019-08-06 Albert Zeyer , Kazuki Irie , Ralf Schlüter , Hermann Ney

LASER: Attention with Exponential Transformation

Transformers have had tremendous impact for several sequence related tasks, largely due to their ability to retrieve from any part of the sequence via softmax based dot-product attention. This mechanism plays a crucial role in Transformer's…

Machine Learning · Computer Science 2025-07-15 Sai Surya Duvvuri , Inderjit S. Dhillon

Longer is (Not Necessarily) Stronger: Punctuated Long-Sequence Training for Enhanced Speech Recognition and Translation

This paper presents a new method for training sequence-to-sequence models for speech recognition and translation tasks. Instead of the traditional approach of training models on short segments containing only lowercase or partial…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-10 Nithin Rao Koluguri , Travis Bartley , Hainan Xu , Oleksii Hrinchuk , Jagadeesh Balam , Boris Ginsburg , Georg Kucsko

Look-ahead Attention for Generation in Neural Machine Translation

The attention model has become a standard component in neural machine translation (NMT) and it guides translation process by selectively focusing on parts of the source sentence when predicting each target word. However, we find that the…

Computation and Language · Computer Science 2017-08-31 Long Zhou , Jiajun Zhang , Chengqing Zong

Modeling Recurrence for Transformer

Recently, the Transformer model that is based solely on attention mechanisms, has advanced the state-of-the-art on various machine translation tasks. However, recent studies reveal that the lack of recurrence hinders its further improvement…

Computation and Language · Computer Science 2019-04-08 Jie Hao , Xing Wang , Baosong Yang , Longyue Wang , Jinfeng Zhang , Zhaopeng Tu

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural…

Computation and Language · Computer Science 2018-02-26 Chung-Cheng Chiu , Tara N. Sainath , Yonghui Wu , Rohit Prabhavalkar , Patrick Nguyen , Zhifeng Chen , Anjuli Kannan , Ron J. Weiss , Kanishka Rao , Ekaterina Gonina , Navdeep Jaitly , Bo Li , Jan Chorowski , Michiel Bacchiani

Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection

Recurrent neural network architectures combining with attention mechanism, or neural attention model, have shown promising performance recently for the tasks including speech recognition, image caption generation, visual question answering…

Computation and Language · Computer Science 2016-04-04 Sheng-syun Shen , Hung-yi Lee

Attention Forcing for Machine Translation

Auto-regressive sequence-to-sequence models with attention mechanisms have achieved state-of-the-art performance in various tasks including Text-To-Speech (TTS) and Neural Machine Translation (NMT). The standard training approach, teacher…

Computation and Language · Computer Science 2021-04-06 Qingyun Dou , Yiting Lu , Potsawee Manakul , Xixin Wu , Mark J. F. Gales

Neural Machine Translation with Key-Value Memory-Augmented Attention

Although attention-based Neural Machine Translation (NMT) has achieved remarkable progress in recent years, it still suffers from issues of repeating and dropping translations. To alleviate these issues, we propose a novel key-value…

Computation and Language · Computer Science 2018-07-02 Fandong Meng , Zhaopeng Tu , Yong Cheng , Haiyang Wu , Junjie Zhai , Yuekui Yang , Di Wang

XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers

Transformers are among the state of the art for many tasks in speech, vision, and natural language processing, among others. Self-attentions, which are crucial contributors to this performance have quadratic computational complexity, which…

Computation and Language · Computer Science 2022-12-21 Roshan Sharma , Bhiksha Raj