Related papers: Coding Agents are Effective Long-Context Processor…

An Effective Framework to Help Large Language Models Handle Numeric-involved Long-context Tasks

Large Language Models (LLMs) have demonstrated remarkable capabilities in handling long texts and have almost perfect performance in traditional retrieval tasks. However, their performance significantly degrades when it comes to numerical…

Computation and Language · Computer Science 2024-12-05 Yijiong Yu

Shifting Long-Context LLMs Research from Input to Output

Recent advancements in long-context Large Language Models (LLMs) have primarily concentrated on processing extended input contexts, resulting in significant strides in long-context comprehension. However, the equally critical aspect of…

Computation and Language · Computer Science 2025-03-10 Yuhao Wu , Yushi Bai , Zhiqing Hu , Shangqing Tu , Ming Shan Hee , Juanzi Li , Roy Ka-Wei Lee

Context-Aware or Context-Insensitive? Assessing LLMs' Performance in Document-Level Translation

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

A Survey on Transformer Context Extension: Approaches and Evaluation

Large language models (LLMs) based on Transformer have been widely applied in the filed of natural language processing (NLP), demonstrating strong performance, particularly in handling short text tasks. However, when it comes to long…

Computation and Language · Computer Science 2025-07-09 Yijun Liu , Jinzheng Yu , Yang Xu , Zhongyang Li , Qingfu Zhu

Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Addressing the challenge of effectively processing long contexts has become a critical issue for Large Language Models (LLMs). Two common strategies have emerged: 1) reducing the input length, such as retrieving relevant chunks by…

Computation and Language · Computer Science 2024-06-06 Yusen Zhang , Ruoxi Sun , Yanfei Chen , Tomas Pfister , Rui Zhang , Sercan Ö. Arik

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

Long-context capabilities are essential for large language models (LLMs) to tackle complex and long-input tasks. Despite numerous efforts made to optimize LLMs for long contexts, challenges persist in robustly processing long inputs. In…

Computation and Language · Computer Science 2024-11-06 Shilong Li , Yancheng He , Hangyu Guo , Xingyuan Bu , Ge Bai , Jie Liu , Jiaheng Liu , Xingwei Qu , Yangguang Li , Wanli Ouyang , Wenbo Su , Bo Zheng

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

Large language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined…

Computation and Language · Computer Science 2023-10-10 Howard Chen , Ramakanth Pasunuru , Jason Weston , Asli Celikyilmaz

Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration

Processing long contexts has become a critical capability for modern large language models (LLMs). Existing works leverage agent-based divide-and-conquer methods for processing long contexts. But these methods face crucial limitations,…

Computation and Language · Computer Science 2025-09-30 Sibo Xiao , Zixin Lin , Wenyang Gao , Hui Chen , Yue Zhang

Solving Context Window Overflow in AI Agents

Large Language Models (LLMs) have become increasingly capable of interacting with external tools, granting access to specialized knowledge beyond their training data - critical in dynamic, knowledge-intensive domains such as Chemistry and…

Artificial Intelligence · Computer Science 2025-12-01 Anton Bulle Labate , Valesca Moura de Sousa , Sandro Rama Fiorini , Leonardo Guerreiro Azevedo , Raphael Melo Thiago , Viviane Torres da Silva

Evaluating Long-Context Reasoning in LLM-Based WebAgents

As large language model (LLM)-based agents become increasingly integrated into daily digital interactions, their ability to reason across long interaction histories becomes crucial for providing personalized and contextually aware…

Machine Learning · Computer Science 2025-12-05 Andy Chung , Yichi Zhang , Kaixiang Lin , Aditya Rawal , Qiaozi Gao , Joyce Chai

Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration

With the rapid advancement of post-training techniques for reasoning and information seeking, large language models (LLMs) can incorporate a large quantity of retrieved knowledge to solve complex tasks. However, the limited context window…

Computation and Language · Computer Science 2026-04-21 Zijun Liu , Zhennan Wan , Peng Li , Ming Yan , Fei Huang , Yang Liu

The Limits of Long-Context Reasoning in Automated Bug Fixing

Rapidly increasing context lengths have led to the assumption that large language models (LLMs) can directly reason over entire codebases. Concurrently, recent advances in LLMs have enabled strong performance on software engineering…

Software Engineering · Computer Science 2026-03-09 Ravi Raju , Mengmeng Ji , Shubhangi Upasani , Bo Li , Urmish Thakker

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

Recently, large language models (LLMs) have shown remarkable capabilities including understanding context, engaging in logical reasoning, and generating responses. However, this is achieved at the expense of stringent computational and…

Computation and Language · Computer Science 2024-05-30 Xindi Wang , Mahsa Salmani , Parsa Omidi , Xiangyu Ren , Mehdi Rezagholizadeh , Armaghan Eshaghi

Multilingual Contextualization of Large Language Models for Document-Level Machine Translation

Large language models (LLMs) have demonstrated strong performance in sentence-level machine translation, but scaling to document-level translation remains challenging, particularly in modeling long-range dependencies and discourse phenomena…

Computation and Language · Computer Science 2025-08-29 Miguel Moura Ramos , Patrick Fernandes , Sweta Agrawal , André F. T. Martins

Evaluating Large Language Models as Expert Annotators

Textual data annotation, the process of labeling or tagging text with relevant information, is typically costly, time-consuming, and labor-intensive. While large language models (LLMs) have demonstrated their potential as direct…

Computation and Language · Computer Science 2025-08-12 Yu-Min Tseng , Wei-Lin Chen , Chung-Chi Chen , Hsin-Hsi Chen

Scaling Long-Horizon LLM Agent via Context-Folding

Large language model (LLM) agents are fundamentally constrained by context length on long-horizon tasks. We introduce Context-Folding, a framework that empowers agents to actively manage their working context. An agent can procedurally…

Computation and Language · Computer Science 2025-10-15 Weiwei Sun , Miao Lu , Zhan Ling , Kang Liu , Xuesong Yao , Yiming Yang , Jiecao Chen

A Controlled Study on Long Context Extension and Generalization in LLMs

Broad textual understanding and in-context learning require language models that utilize full document contexts. Due to the implementation challenges associated with directly training long-context models, many methods have been proposed for…

Computation and Language · Computer Science 2024-09-24 Yi Lu , Jing Nathan Yan , Songlin Yang , Justin T. Chiu , Siyu Ren , Fei Yuan , Wenting Zhao , Zhiyong Wu , Alexander M. Rush

When Refusals Fail: Unstable Safety Mechanisms in Long-Context LLM Agents

Solving complex or long-horizon problems often requires large language models (LLMs) to use external tools and operate over a significantly longer context window. New LLMs enable longer context windows and support tool calling capabilities.…

Machine Learning · Computer Science 2025-12-03 Tsimur Hadeliya , Mohammad Ali Jauhar , Nidhi Sakpal , Diogo Cruz

Long-context Language Models Fail in Basic Retrieval Tasks Without Sufficient Reasoning Steps

Long-context language models (LCLMs), characterized by their extensive context window, are becoming popular. However, despite the fact that they are nearly perfect at standard long-context retrieval tasks, our evaluations demonstrate they…

Computation and Language · Computer Science 2025-08-27 Yijiong Yu , Yongfeng Huang , Zhixiao Qi , Wei Wang , Weifeng Liu , Ran Chen , Ji Pei

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks. However, LLMs with long context windows have been notorious for their expensive training costs and high…

Computation and Language · Computer Science 2024-03-14 Jun Zhao , Can Zu , Hao Xu , Yi Lu , Wei He , Yiwen Ding , Tao Gui , Qi Zhang , Xuanjing Huang