English
Related papers

Related papers: Improving Context Aware Language Models

200 papers

A context-aware language model uses location, user and/or domain metadata (context) to adapt its predictions. In neural language models, context information is typically represented as an embedding and it is given to the RNN as an…

Computation and Language · Computer Science 2018-05-08 Aaron Jaech , Mari Ostendorf

The performance of Neural Network (NN)-based language models is steadily improving due to the emergence of new architectures, which are able to learn different natural language characteristics. This paper presents a novel framework, which…

Computation and Language · Computer Science 2017-08-24 Youssef Oualil , Dietrich Klakow

Conversational speech, while being unstructured at an utterance level, typically has a macro topic which provides larger context spanning multiple utterances. The current language models in speech recognition systems using recurrent neural…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Srikanth Raj Chetupalli , Sriram Ganapathy

We present a new recurrent neural network topology to enhance state-of-the-art machine learning systems by incorporating a broader context. Our approach overcomes recent limitations with extended narratives through a multi-layered…

Computation and Language · Computer Science 2018-08-07 Patrick Huber , Jan Niehues , Alex Waibel

Current advances in Natural Language Processing (NLP) have made it increasingly feasible to build applications leveraging textual data. Generally, the core of these applications rely on having a good semantic representation of text into…

Computation and Language · Computer Science 2024-10-21 Thomas Uriot

We propose a principle for exploring context in machine learning models. Starting with a simple assumption that each observation may or may not depend on its context, a conditional probability distribution is decomposed into two parts:…

Machine Learning · Computer Science 2019-01-23 Yun Zeng

We propose several ways of reusing subword embeddings and other weights in subword-aware neural language models. The proposed techniques do not benefit a competitive character-aware model, but some of them improve the performance of…

Computation and Language · Computer Science 2018-04-26 Zhenisbek Assylbekov , Rustem Takhanov

Deep learning models such as convolutional neural networks and recurrent networks are widely applied in text classification. In spite of their great success, most deep learning models neglect the importance of modeling context information,…

Computation and Language · Computer Science 2019-06-05 Liuyu Xiang , Xiaoming Jin , Lan Yi , Guiguang Ding

Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been…

Computation and Language · Computer Science 2017-11-30 Kushal Chawla , Sunil Kumar Sahu , Ashish Anand

Existing research suggests that automatic speech recognition (ASR) models can benefit from additional contexts (e.g., contact lists, user specified vocabulary). Rare words and named entities can be better recognized with contexts. In this…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-16 Ruizhe Huang , Mahsa Yarmohammadi , Sanjeev Khudanpur , Daniel Povey

Humor is a natural and fundamental component of human interactions. When correctly applied, humor allows us to express thoughts and feelings conveniently and effectively, increasing interpersonal affection, likeability, and trust. However,…

Computation and Language · Computer Science 2020-11-25 Felipe Godoy

It is today acknowledged that neural network language models outperform backoff language models in applications like speech recognition or statistical machine translation. However, training these models on large amounts of data can take…

Neural and Evolutionary Computing · Computer Science 2015-07-08 Aram Ter-Sarkisov , Holger Schwenk , Loic Barrault , Fethi Bougares

Modern machine learning models typically represent inputs as fixed points in a high-dimensional embedding space. While this approach has been proven powerful for a wide range of downstream tasks, it fundamentally differs from the way humans…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Frieda Born , Tom Neuhäuser , Lukas Muttenthaler , Brett D. Roads , Bernhard Spitzer , Andrew K. Lampinen , Matt Jones , Klaus-Robert Müller , Michael C. Mozer

Transformer-based language models have recently been at the forefront of active research in text generation. However, these models' advances come at the price of prohibitive training costs, with parameter counts in the billions and compute…

Computation and Language · Computer Science 2025-02-04 Gabriel Lindenmaier , Sean Papay , Sebastian Padó

The integration of contextual embeddings into the optimization processes of large language models is an advancement in natural language processing. The Context-Aware Neural Gradient Mapping framework introduces a dynamic gradient adjustment…

Computation and Language · Computer Science 2025-04-25 David Boldo , Lily Pemberton , Gabriel Thistledown , Jacob Fairchild , Felix Kowalski

Multi-layer models with multiple attention heads per layer provide superior translation quality compared to simpler and shallower models, but determining what source context is most relevant to each target word is more challenging as a…

Computation and Language · Computer Science 2019-02-01 Thomas Zenkel , Joern Wuebker , John DeNero

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when…

Machine Learning · Statistics 2017-04-19 Sebastien Jean , Stanislas Lauly , Orhan Firat , Kyunghyun Cho

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised…

Computation and Language · Computer Science 2019-08-07 Giuseppe Marra , Andrea Zugarini , Stefano Melacci , Marco Maggini

Most state-of-the-art models in natural language processing (NLP) are neural models built on top of large, pre-trained, contextual language models that generate representations of words in context and are fine-tuned for the task at hand.…

Computation and Language · Computer Science 2020-10-13 Brian Lester , Daniel Pressel , Amy Hemmeter , Sagnik Ray Choudhury , Srinivas Bangalore

Context-aware neural machine translation aims to use the document-level context to improve translation quality. However, not all words in the context are helpful. The irrelevant or trivial words may bring some noise and distract the model…

Computation and Language · Computer Science 2023-04-20 Jian Yang , Yuwei Yin , Shuming Ma , Liqun Yang , Hongcheng Guo , Haoyang Huang , Dongdong Zhang , Yutao Zeng , Zhoujun Li , Furu Wei
‹ Prev 1 2 3 10 Next ›