Related papers: Decoupled Context Processing for Context Augmented…

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of…

Computation and Language · Computer Science 2024-12-24 Zhuo Chen , Xinyu Wang , Yong Jiang , Pengjun Xie , Fei Huang , Kewei Tu

Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Recent shifts in the space of large language model (LLM) research have shown an increasing focus on novel architectures to compete with prototypical Transformer-based models that have long dominated this space. Linear recurrent models have…

Computation and Language · Computer Science 2025-07-24 Xinyu Wang , Linrui Ma , Jerry Huang , Peng Lu , Prasanna Parthasarathi , Xiao-Wen Chang , Boxing Chen , Yufei Cui

RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models

In this paper, we investigate the in-context learning ability of retrieval-augmented encoder-decoder language models. We first conduct a comprehensive analysis of existing models and identify their limitations in in-context learning,…

Computation and Language · Computer Science 2024-08-20 Jie Huang , Wei Ping , Peng Xu , Mohammad Shoeybi , Kevin Chen-Chuan Chang , Bryan Catanzaro

RARe: Retrieval Augmented Retrieval with In-Context Examples

While in-context learning is well-studied with decoder-only language models (LLMs), its utility for encoder-only models remains underexplored. We study in-context learning for encoder-only models for text retrieval tasks. Can incorporating…

Computation and Language · Computer Science 2026-02-10 Atula Tejaswi , Yoonsang Lee , Sujay Sanghavi , Eunsol Choi

Memory-Augmented Architecture for Long-Term Context Handling in Large Language Models

Large Language Models face significant challenges in maintaining coherent interactions over extended dialogues due to their limited contextual memory. This limitation often leads to fragmented exchanges and reduced relevance in responses,…

Machine Learning · Computer Science 2025-06-24 Haseeb Ullah Khan Shinwari , Muhammad Usama

Improving Context Aware Language Models

Increased adaptability of RNN language models leads to improved predictions that benefit many applications. However, current methods do not take full advantage of the RNN structure. We show that the most widely-used approach to adaptation…

Computation and Language · Computer Science 2017-04-24 Aaron Jaech , Mari Ostendorf

Learning to Retrieve In-Context Examples for Large Language Models

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the…

Computation and Language · Computer Science 2024-01-29 Liang Wang , Nan Yang , Furu Wei

In-Context Retrieval-Augmented Language Models

Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance. In addition,…

Computation and Language · Computer Science 2023-08-02 Ori Ram , Yoav Levine , Itay Dalmedigos , Dor Muhlgay , Amnon Shashua , Kevin Leyton-Brown , Yoav Shoham

Enriching Conversation Context in Retrieval-based Chatbots

Work on retrieval-based chatbots, like most sequence pair matching tasks, can be divided into Cross-encoders that perform word matching over the pair, and Bi-encoders that encode the pair separately. The latter has better performance,…

Computation and Language · Computer Science 2019-11-07 Amir Vakili Tahami , Azadeh Shakery

Machine translation considering context information using Encoder-Decoder model

In the task of machine translation, context information is one of the important factor. But considering the context information model dose not proposed. The paper propose a new model which can integrate context information and make…

Computation and Language · Computer Science 2019-04-02 Tetsuto Takano , Satoshi Yamane

ACER: Automatic Language Model Context Extension via Retrieval

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through…

Computation and Language · Computer Science 2024-10-15 Luyu Gao , Yunyi Zhang , Jamie Callan

More Room for Language: Investigating the Effect of Retrieval on Language Models

Retrieval-augmented language models pose a promising alternative to standard language modeling. During pretraining, these models search in a corpus of documents for contextually relevant information that could aid the language modeling…

Computation and Language · Computer Science 2024-04-18 David Samuel , Lucas Georges Gabriel Charpentier , Sondre Wold

Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model

Although many context-aware neural machine translation models have been proposed to incorporate contexts in translation, most of those models are trained end-to-end on parallel documents aligned in sentence-level. Because only a few domains…

Computation and Language · Computer Science 2021-11-16 Amane Sugiyama , Naoki Yoshinaga

Augmenting Language Models with Long-Term Memory

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models…

Computation and Language · Computer Science 2023-06-13 Weizhi Wang , Li Dong , Hao Cheng , Xiaodong Liu , Xifeng Yan , Jianfeng Gao , Furu Wei

Exploring the context of recurrent neural network based conversational agents

Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network…

Computation and Language · Computer Science 2019-02-01 Raffaele Piccini , Gerasimos Spanakis

Rethinking embedding coupling in pre-trained language models

We re-evaluate the standard practice of sharing weights between input and output embeddings in state-of-the-art pre-trained language models. We show that decoupled embeddings provide increased modeling flexibility, allowing us to…

Computation and Language · Computer Science 2020-10-27 Hyung Won Chung , Thibault Févry , Henry Tsai , Melvin Johnson , Sebastian Ruder

Contextual Augmentation for Entity Linking using Large Language Models

Entity Linking involves detecting and linking entity mentions in natural language texts to a knowledge graph. Traditional methods use a two-step process with separate models for entity recognition and disambiguation, which can be…

Computation and Language · Computer Science 2025-10-23 Daniel Vollmers , Hamada M. Zahera , Diego Moussallem , Axel-Cyrille Ngonga Ngomo

Making Retrieval-Augmented Language Models Robust to Irrelevant Context

Retrieval-augmented language models (RALMs) hold promise to produce language understanding systems that are are factual, efficient, and up-to-date. An important desideratum of RALMs, is that retrieved information helps model performance…

Computation and Language · Computer Science 2024-05-07 Ori Yoran , Tomer Wolfson , Ori Ram , Jonathan Berant

Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation

Large language models (LLMs) encode vast world knowledge in their parameters, yet they remain fundamentally limited by static knowledge, finite context windows, and weakly structured causal reasoning. This survey provides a unified account…

Computation and Language · Computer Science 2026-04-06 Prakhar Bansal , Shivangi Agarwal

Focused Concatenation for Context-Aware Neural Machine Translation

A straightforward approach to context-aware neural machine translation consists in feeding the standard encoder-decoder architecture with a window of consecutive sentences, formed by the current sentence and a number of sentences from its…

Computation and Language · Computer Science 2022-10-25 Lorenzo Lupo , Marco Dinarelli , Laurent Besacier