English
Related papers

Related papers: The $\mathbf{Y}$-Combinator for LLMs: Solving Long…

200 papers

We study allowing large language models (LLMs) to process arbitrarily long prompts through the lens of inference-time scaling. We propose Recursive Language Models (RLMs), a general inference paradigm that treats long prompts as part of an…

Artificial Intelligence · Computer Science 2026-05-12 Alex L. Zhang , Tim Kraska , Omar Khattab

Long-context handling remains a core challenge for language models: even with extended context windows, models often fail to reliably extract, reason over, and use the information across long contexts. Recent works like Recursive Language…

Computation and Language · Computer Science 2026-03-18 Keivan Alizadeh , Parshin Shojaee , Minsik Cho , Mehrdad Farajtabar

We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval…

Computation and Language · Computer Science 2026-04-13 Kyle Whitecross , Negin Rahimi

Large Language Models (LLMs) for complex reasoning is often hindered by high computational costs and latency, while resource-efficient Small Language Models (SLMs) typically lack the necessary reasoning capacity. Existing collaborative…

Computation and Language · Computer Science 2026-01-09 Chengsong Huang , Tong Zheng , Langlin Huang , Jinyuan Li , Haolin Liu , Jiaxin Huang

Large language models (LLMs) now solve multi-step problems by emitting extended chains of thought. During the process, they often re-derive the same intermediate steps across problems, inflating token usage and latency. This saturation of…

Machine Learning · Computer Science 2025-09-17 Aniket Didolkar , Nicolas Ballas , Sanjeev Arora , Anirudh Goyal

Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final answers. While such approaches improve…

Artificial Intelligence · Computer Science 2026-04-28 Zichuan Fu , Xian Wu , Guojing Li , Yejing Wang , Yijun Chen , Zihao Zhao , Yixuan Luo , Hanyu Yan , Yefeng Zheng , Xiangyu Zhao

Long-context language models (LCLMs), characterized by their extensive context window, are becoming popular. However, despite the fact that they are nearly perfect at standard long-context retrieval tasks, our evaluations demonstrate they…

Computation and Language · Computer Science 2025-08-27 Yijiong Yu , Yongfeng Huang , Zhixiao Qi , Wei Wang , Weifeng Liu , Ran Chen , Ji Pei

This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt…

Computation and Language · Computer Science 2026-03-04 Daren Wang

Recent Long-Context Language Models (LCLMs) can process hundreds of thousands of tokens in a single prompt, enabling new opportunities for knowledge-intensive multi-hop reasoning by integrating large sets of retrieved documents or, in some…

Computation and Language · Computer Science 2026-04-29 Soyeong Jeong , Taehee Jung , Sung Ju Hwang , Joo-Kyung Kim , Dongyeop Kang

While large language models (LLMs) have demonstrated impressive performance in question-answering tasks, their performance is limited when the questions require knowledge that is not included in the model's training data and can only be…

Computation and Language · Computer Science 2023-09-22 Abhigya Sodani , Lauren Moos , Matthew Mirman

Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire…

Retrieval-Augmented Language Models (RALMs) have significantly improved performance in open-domain question answering (QA) by leveraging external knowledge. However, RALMs still struggle with unanswerable queries, where the retrieved…

Computation and Language · Computer Science 2024-08-09 Seong-Il Park , Seung-Woo Choi , Na-Hyun Kim , Jay-Yoon Lee

The emergence of long-context large language models (LLMs) offers a promising alternative to traditional retrieval-augmented generation (RAG) for processing extensive documents. However, the computational overhead of long-context inference…

Computation and Language · Computer Science 2025-06-24 Guanzheng Chen , Qilong Feng , Jinjie Ni , Xin Li , Michael Qizhe Shieh

Long-horizon tasks requiring multi-step reasoning and dynamic re-planning remain challenging for large language models (LLMs). Sequential prompting methods are prone to context drift, loss of goal information, and recurrent failure cycles,…

Artificial Intelligence · Computer Science 2025-10-30 Zhenyu Zhang , Tianyi Chen , Weiran Xu , Alex Pentland , Jiaxin Pei

The efficient processing of long context poses a serious challenge for large language models (LLMs). Recently, retrieval-augmented generation (RAG) has emerged as a promising strategy for this problem, as it enables LLMs to make selective…

Computation and Language · Computer Science 2025-02-18 Kun Luo , Zheng Liu , Peitian Zhang , Hongjin Qian , Jun Zhao , Kang Liu

We introduce Lossless Context Management (LCM), a deterministic architecture for LLM memory that outperforms Claude Code on long-context tasks. When benchmarked using Opus 4.6, our LCM-augmented coding agent, Volt, achieves higher scores…

Artificial Intelligence · Computer Science 2026-05-07 Clint Ehrlich , Theodore Blackman

The context window of large language models (LLMs) has been extended significantly in recent years. However, while the context length that the LLM can process has grown, the capability of the model to accurately reason over that context…

Computation and Language · Computer Science 2024-10-07 Huayang Li , Pat Verga , Priyanka Sen , Bowen Yang , Vijay Viswanathan , Patrick Lewis , Taro Watanabe , Yixuan Su

Pretrained large Language Models (LLMs) are able to answer questions that are unlikely to have been encountered during training. However a diversity of potential applications exist in the broad domain of reasoning systems and considerations…

Computation and Language · Computer Science 2024-11-27 Tim Hartill

Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning…

Recently, advanced large language models (LLMs) have emerged at an increasingly rapid pace. However, when faced with complex problems, most users are often unable to provide accurate and effective prompts to interact with LLMs, thus…

Computation and Language · Computer Science 2026-04-17 Wenjin Liu , Haoran Luo , Xueyuan Lin , Haoming Liu , Tiesunlong Shen , Jiapu Wang , Rui Mao , Erik Cambria
‹ Prev 1 2 3 10 Next ›