English
Related papers

Related papers: Decoupled Context Processing for Context Augmented…

200 papers

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of…

Computation and Language · Computer Science 2024-12-24 Zhuo Chen , Xinyu Wang , Yong Jiang , Pengjun Xie , Fei Huang , Kewei Tu

Recent shifts in the space of large language model (LLM) research have shown an increasing focus on novel architectures to compete with prototypical Transformer-based models that have long dominated this space. Linear recurrent models have…

Computation and Language · Computer Science 2025-07-24 Xinyu Wang , Linrui Ma , Jerry Huang , Peng Lu , Prasanna Parthasarathi , Xiao-Wen Chang , Boxing Chen , Yufei Cui

In this paper, we investigate the in-context learning ability of retrieval-augmented encoder-decoder language models. We first conduct a comprehensive analysis of existing models and identify their limitations in in-context learning,…

Computation and Language · Computer Science 2024-08-20 Jie Huang , Wei Ping , Peng Xu , Mohammad Shoeybi , Kevin Chen-Chuan Chang , Bryan Catanzaro

While in-context learning is well-studied with decoder-only language models (LLMs), its utility for encoder-only models remains underexplored. We study in-context learning for encoder-only models for text retrieval tasks. Can incorporating…

Computation and Language · Computer Science 2026-02-10 Atula Tejaswi , Yoonsang Lee , Sujay Sanghavi , Eunsol Choi

Large Language Models face significant challenges in maintaining coherent interactions over extended dialogues due to their limited contextual memory. This limitation often leads to fragmented exchanges and reduced relevance in responses,…

Machine Learning · Computer Science 2025-06-24 Haseeb Ullah Khan Shinwari , Muhammad Usama

Increased adaptability of RNN language models leads to improved predictions that benefit many applications. However, current methods do not take full advantage of the RNN structure. We show that the most widely-used approach to adaptation…

Computation and Language · Computer Science 2017-04-24 Aaron Jaech , Mari Ostendorf

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the…

Computation and Language · Computer Science 2024-01-29 Liang Wang , Nan Yang , Furu Wei

Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance. In addition,…

Computation and Language · Computer Science 2023-08-02 Ori Ram , Yoav Levine , Itay Dalmedigos , Dor Muhlgay , Amnon Shashua , Kevin Leyton-Brown , Yoav Shoham

Work on retrieval-based chatbots, like most sequence pair matching tasks, can be divided into Cross-encoders that perform word matching over the pair, and Bi-encoders that encode the pair separately. The latter has better performance,…

Computation and Language · Computer Science 2019-11-07 Amir Vakili Tahami , Azadeh Shakery

In the task of machine translation, context information is one of the important factor. But considering the context information model dose not proposed. The paper propose a new model which can integrate context information and make…

Computation and Language · Computer Science 2019-04-02 Tetsuto Takano , Satoshi Yamane

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through…

Computation and Language · Computer Science 2024-10-15 Luyu Gao , Yunyi Zhang , Jamie Callan

Retrieval-augmented language models pose a promising alternative to standard language modeling. During pretraining, these models search in a corpus of documents for contextually relevant information that could aid the language modeling…

Computation and Language · Computer Science 2024-04-18 David Samuel , Lucas Georges Gabriel Charpentier , Sondre Wold

Although many context-aware neural machine translation models have been proposed to incorporate contexts in translation, most of those models are trained end-to-end on parallel documents aligned in sentence-level. Because only a few domains…

Computation and Language · Computer Science 2021-11-16 Amane Sugiyama , Naoki Yoshinaga

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models…

Computation and Language · Computer Science 2023-06-13 Weizhi Wang , Li Dong , Hao Cheng , Xiaodong Liu , Xifeng Yan , Jianfeng Gao , Furu Wei

Conversational agents have begun to rise both in the academic (in terms of research) and commercial (in terms of applications) world. This paper investigates the task of building a non-goal driven conversational agent, using neural network…

Computation and Language · Computer Science 2019-02-01 Raffaele Piccini , Gerasimos Spanakis

We re-evaluate the standard practice of sharing weights between input and output embeddings in state-of-the-art pre-trained language models. We show that decoupled embeddings provide increased modeling flexibility, allowing us to…

Computation and Language · Computer Science 2020-10-27 Hyung Won Chung , Thibault Févry , Henry Tsai , Melvin Johnson , Sebastian Ruder

Entity Linking involves detecting and linking entity mentions in natural language texts to a knowledge graph. Traditional methods use a two-step process with separate models for entity recognition and disambiguation, which can be…

Computation and Language · Computer Science 2025-10-23 Daniel Vollmers , Hamada M. Zahera , Diego Moussallem , Axel-Cyrille Ngonga Ngomo

Retrieval-augmented language models (RALMs) hold promise to produce language understanding systems that are are factual, efficient, and up-to-date. An important desideratum of RALMs, is that retrieved information helps model performance…

Computation and Language · Computer Science 2024-05-07 Ori Yoran , Tomer Wolfson , Ori Ram , Jonathan Berant

Large language models (LLMs) encode vast world knowledge in their parameters, yet they remain fundamentally limited by static knowledge, finite context windows, and weakly structured causal reasoning. This survey provides a unified account…

Computation and Language · Computer Science 2026-04-06 Prakhar Bansal , Shivangi Agarwal

A straightforward approach to context-aware neural machine translation consists in feeding the standard encoder-decoder architecture with a window of consecutive sentences, formed by the current sentence and a number of sentences from its…

Computation and Language · Computer Science 2022-10-25 Lorenzo Lupo , Marco Dinarelli , Laurent Besacier
‹ Prev 1 2 3 10 Next ›