Related papers: The $\mathbf{Y}$-Combinator for LLMs: Solving Long…

Recursive Language Models

We study allowing large language models (LLMs) to process arbitrarily long prompts through the lens of inference-time scaling. We propose Recursive Language Models (RLMs), a general inference paradigm that treats long prompts as part of an…

Artificial Intelligence · Computer Science 2026-05-12 Alex L. Zhang , Tim Kraska , Omar Khattab

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Long-context handling remains a core challenge for language models: even with extended context windows, models often fail to reliably extract, reason over, and use the information across long contexts. Recent works like Recursive Language…

Computation and Language · Computer Science 2026-03-18 Keivan Alizadeh , Parshin Shojaee , Minsik Cho , Mehrdad Farajtabar

RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval

We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval…

Computation and Language · Computer Science 2026-04-13 Kyle Whitecross , Negin Rahimi

RelayLLM: Efficient Reasoning via Collaborative Decoding

Large Language Models (LLMs) for complex reasoning is often hindered by high computational costs and latency, while resource-efficient Small Language Models (SLMs) typically lack the necessary reasoning capacity. Existing collaborative…

Computation and Language · Computer Science 2026-01-09 Chengsong Huang , Tong Zheng , Langlin Huang , Jinyuan Li , Haolin Liu , Jiaxin Huang

Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors

Large language models (LLMs) now solve multi-step problems by emitting extended chains of thought. During the process, they often re-derive the same intermediate steps across problems, inflating token usage and latency. This saturation of…

Machine Learning · Computer Science 2025-09-17 Aniket Didolkar , Nicolas Ballas , Sanjeev Arora , Anirudh Goyal

Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning

Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final answers. While such approaches improve…

Artificial Intelligence · Computer Science 2026-04-28 Zichuan Fu , Xian Wu , Guojing Li , Yejing Wang , Yijun Chen , Zihao Zhao , Yixuan Luo , Hanyu Yan , Yefeng Zheng , Xiangyu Zhao

Long-context Language Models Fail in Basic Retrieval Tasks Without Sufficient Reasoning Steps

Long-context language models (LCLMs), characterized by their extensive context window, are becoming popular. However, despite the fact that they are nearly perfect at standard long-context retrieval tasks, our evaluations demonstrate they…

Computation and Language · Computer Science 2025-08-27 Yijiong Yu , Yongfeng Huang , Zhixiao Qi , Wei Wang , Weifeng Liu , Ran Chen , Ji Pei

Think, But Don't Overthink: Reproducing Recursive Language Models

This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt…

Computation and Language · Computer Science 2026-03-04 Daren Wang

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Recent Long-Context Language Models (LCLMs) can process hundreds of thousands of tokens in a single prompt, enabling new opportunities for knowledge-intensive multi-hop reasoning by integrating large sets of retrieved documents or, in some…

Computation and Language · Computer Science 2026-04-29 Soyeong Jeong , Taehee Jung , Sung Ju Hwang , Joo-Kyung Kim , Dongyeop Kang

LLM Guided Inductive Inference for Solving Compositional Problems

While large language models (LLMs) have demonstrated impressive performance in question-answering tasks, their performance is limited when the questions require knowledge that is not included in the model's training data and can only be…

Computation and Language · Computer Science 2023-09-22 Abhigya Sodani , Lauren Moos , Matthew Mirman

Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire…

Computation and Language · Computer Science 2024-06-21 Jinhyuk Lee , Anthony Chen , Zhuyun Dai , Dheeru Dua , Devendra Singh Sachan , Michael Boratko , Yi Luan , Sébastien M. R. Arnold , Vincent Perot , Siddharth Dalmia , Hexiang Hu , Xudong Lin , Panupong Pasupat , Aida Amini , Jeremy R. Cole , Sebastian Riedel , Iftekhar Naim , Ming-Wei Chang , Kelvin Guu

Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning

Retrieval-Augmented Language Models (RALMs) have significantly improved performance in open-domain question answering (QA) by leveraging external knowledge. However, RALMs still struggle with unanswerable queries, where the retrieved…

Computation and Language · Computer Science 2024-08-09 Seong-Il Park , Seung-Woo Choi , Na-Hyun Kim , Jay-Yoon Lee

RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding

The emergence of long-context large language models (LLMs) offers a promising alternative to traditional retrieval-augmented generation (RAG) for processing extensive documents. However, the computational overhead of long-context inference…

Computation and Language · Computer Science 2025-06-24 Guanzheng Chen , Qilong Feng , Jinjie Ni , Xin Li , Michael Qizhe Shieh

ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents

Long-horizon tasks requiring multi-step reasoning and dynamic re-planning remain challenging for large language models (LLMs). Sequential prompting methods are prone to context drift, loss of goal information, and recurrent failure cycles,…

Artificial Intelligence · Computer Science 2025-10-30 Zhenyu Zhang , Tianyi Chen , Weiran Xu , Alex Pentland , Jiaxin Pei

Does RAG Really Perform Bad For Long-Context Processing?

The efficient processing of long context poses a serious challenge for large language models (LLMs). Recently, retrieval-augmented generation (RAG) has emerged as a promising strategy for this problem, as it enables LLMs to make selective…

Computation and Language · Computer Science 2025-02-18 Kun Luo , Zheng Liu , Peitian Zhang , Hongjin Qian , Jun Zhao , Kang Liu

LCM: Lossless Context Management

We introduce Lossless Context Management (LCM), a deterministic architecture for LLM memory that outperforms Claude Code on long-context tasks. When benchmarked using Opus 4.6, our LCM-augmented coding agent, Volt, achieves higher scores…

Artificial Intelligence · Computer Science 2026-05-07 Clint Ehrlich , Theodore Blackman

ALR$^2$: A Retrieve-then-Reason Framework for Long-context Question Answering

The context window of large language models (LLMs) has been extended significantly in recent years. However, while the context length that the LLM can process has grown, the capability of the model to accurately reason over that context…

Computation and Language · Computer Science 2024-10-07 Huayang Li , Pat Verga , Priyanka Sen , Bowen Yang , Vijay Viswanathan , Patrick Lewis , Taro Watanabe , Yixuan Su

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions (Full Thesis)

Pretrained large Language Models (LLMs) are able to answer questions that are unlikely to have been encountered during training. However a diversity of potential applications exist in the broad domain of reasoning systems and considerations…

Computation and Language · Computer Science 2024-11-27 Tim Hartill

Reasoning Language Models: A Blueprint

Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning…

Artificial Intelligence · Computer Science 2025-06-12 Maciej Besta , Julia Barth , Eric Schreiber , Ales Kubicek , Afonso Catarino , Robert Gerstenberger , Piotr Nyczyk , Patrick Iff , Yueling Li , Sam Houliston , Tomasz Sternal , Marcin Copik , Grzegorz Kwaśniewski , Jürgen Müller , Łukasz Flis , Hannes Eberhard , Zixuan Chen , Hubert Niewiadomski , Torsten Hoefler

Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Recently, advanced large language models (LLMs) have emerged at an increasingly rapid pace. However, when faced with complex problems, most users are often unable to provide accurate and effective prompts to interact with LLMs, thus…

Computation and Language · Computer Science 2026-04-17 Wenjin Liu , Haoran Luo , Xueyuan Lin , Haoming Liu , Tiesunlong Shen , Jiapu Wang , Rui Mao , Erik Cambria