Related papers: Improving Context Aware Language Models

Low-Rank RNN Adaptation for Context-Aware Language Modeling

A context-aware language model uses location, user and/or domain metadata (context) to adapt its predictions. In neural language models, context information is typically represented as an embedding and it is given to the RNN as an…

Computation and Language · Computer Science 2018-05-08 Aaron Jaech , Mari Ostendorf

A Neural Network Approach for Mixing Language Models

The performance of Neural Network (NN)-based language models is steadily improving due to the emergence of new architectures, which are able to learn different natural language characteristics. This paper presents a novel framework, which…

Computation and Language · Computer Science 2017-08-24 Youssef Oualil , Dietrich Klakow

Context Dependent RNNLM for Automatic Transcription of Conversations

Conversational speech, while being unstructured at an utterance level, typically has a macro topic which provides larger context spanning multiple utterances. The current language models in speech recognition systems using recurrent neural…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-11 Srikanth Raj Chetupalli , Sriram Ganapathy

A Hierarchical Approach to Neural Context-Aware Modeling

We present a new recurrent neural network topology to enhance state-of-the-art machine learning systems by incorporating a broader context. Our approach overcomes recent limitations with extended narratives through a multi-layered…

Computation and Language · Computer Science 2018-08-07 Patrick Huber , Jan Niehues , Alex Waibel

On Debiasing Text Embeddings Through Context Injection

Current advances in Natural Language Processing (NLP) have made it increasingly feasible to build applications leveraging textual data. Generally, the core of these applications rely on having a good semantic representation of text into…

Computation and Language · Computer Science 2024-10-21 Thomas Uriot

Context Aware Machine Learning

We propose a principle for exploring context in machine learning models. Starting with a simple assumption that each observation may or may not depend on its context, a conditional probability distribution is decomposed into two parts:…

Machine Learning · Computer Science 2019-01-23 Yun Zeng

Reusing Weights in Subword-aware Neural Language Models

We propose several ways of reusing subword embeddings and other weights in subword-aware neural language models. The proposed techniques do not benefit a competitive character-aware model, but some of them improve the performance of…

Computation and Language · Computer Science 2018-04-26 Zhenisbek Assylbekov , Rustem Takhanov

Adaptive Region Embedding for Text Classification

Deep learning models such as convolutional neural networks and recurrent networks are widely applied in text classification. In spite of their great success, most deep learning models neglect the importance of modeling context information,…

Computation and Language · Computer Science 2019-06-05 Liuyu Xiang , Xiaoming Jin , Lan Yi , Guiguang Ding

Investigating how well contextual features are captured by bi-directional recurrent neural network models

Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been…

Computation and Language · Computer Science 2017-11-30 Kushal Chawla , Sunil Kumar Sahu , Ashish Anand

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

Existing research suggests that automatic speech recognition (ASR) models can benefit from additional contexts (e.g., contact lists, user specified vocabulary). Rare words and named entities can be better recognized with contexts. In this…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-16 Ruizhe Huang , Mahsa Yarmohammadi , Sanjeev Khudanpur , Daniel Povey

Advancing Humor-Focused Sentiment Analysis through Improved Contextualized Embeddings and Model Architecture

Humor is a natural and fundamental component of human interactions. When correctly applied, humor allows us to express thoughts and feelings conveniently and effectively, increasing interpersonal affection, likeability, and trust. However,…

Computation and Language · Computer Science 2020-11-25 Felipe Godoy

Incremental Adaptation Strategies for Neural Network Language Models

It is today acknowledged that neural network language models outperform backoff language models in applications like speech recognition or statistical machine translation. However, training these models on large amounts of data can take…

Neural and Evolutionary Computing · Computer Science 2015-07-08 Aram Ter-Sarkisov , Holger Schwenk , Loic Barrault , Fethi Bougares

Context Sensitivity Improves Human-Machine Visual Alignment

Modern machine learning models typically represent inputs as fixed points in a high-dimensional embedding space. While this approach has been proven powerful for a wide range of downstream tasks, it fundamentally differs from the way humans…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Frieda Born , Tom Neuhäuser , Lukas Muttenthaler , Brett D. Roads , Bernhard Spitzer , Andrew K. Lampinen , Matt Jones , Klaus-Robert Müller , Michael C. Mozer

Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures

Transformer-based language models have recently been at the forefront of active research in text generation. However, these models' advances come at the price of prohibitive training costs, with parameter counts in the billions and compute…

Computation and Language · Computer Science 2025-02-04 Gabriel Lindenmaier , Sean Papay , Sebastian Padó

Context-Aware Neural Gradient Mapping for Fine-Grained Instruction Processing

The integration of contextual embeddings into the optimization processes of large language models is an advancement in natural language processing. The Context-Aware Neural Gradient Mapping framework introduces a dynamic gradient adjustment…

Computation and Language · Computer Science 2025-04-25 David Boldo , Lily Pemberton , Gabriel Thistledown , Jacob Fairchild , Felix Kowalski

Adding Interpretable Attention to Neural Translation Models Improves Word Alignment

Multi-layer models with multiple attention heads per layer provide superior translation quality compared to simpler and shallower models, but determining what source context is most relevant to each target word is more challenging as a…

Computation and Language · Computer Science 2019-02-01 Thomas Zenkel , Joern Wuebker , John DeNero

Does Neural Machine Translation Benefit from Larger Context?

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when…

Machine Learning · Statistics 2017-04-19 Sebastien Jean , Stanislas Lauly , Orhan Firat , Kyunghyun Cho

An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised…

Computation and Language · Computer Science 2019-08-07 Giuseppe Marra , Andrea Zugarini , Stefano Melacci , Marco Maggini

Multiple Word Embeddings for Increased Diversity of Representation

Most state-of-the-art models in natural language processing (NLP) are neural models built on top of large, pre-trained, contextual language models that generate representations of words in context and are fine-tuned for the task at hand.…

Computation and Language · Computer Science 2020-10-13 Brian Lester , Daniel Pressel , Amy Hemmeter , Sagnik Ray Choudhury , Srinivas Bangalore

HanoiT: Enhancing Context-aware Translation via Selective Context

Context-aware neural machine translation aims to use the document-level context to improve translation quality. However, not all words in the context are helpful. The irrelevant or trivial words may bring some noise and distract the model…

Computation and Language · Computer Science 2023-04-20 Jian Yang , Yuwei Yin , Shuming Ma , Liqun Yang , Hongcheng Guo , Haoyang Huang , Dongdong Zhang , Yutao Zeng , Zhoujun Li , Furu Wei