English
Related papers

Related papers: Coding Agents are Effective Long-Context Processor…

200 papers

Large Language Models (LLMs) have demonstrated remarkable capabilities in handling long texts and have almost perfect performance in traditional retrieval tasks. However, their performance significantly degrades when it comes to numerical…

Computation and Language · Computer Science 2024-12-05 Yijiong Yu

Recent advancements in long-context Large Language Models (LLMs) have primarily concentrated on processing extended input contexts, resulting in significant strides in long-context comprehension. However, the equally critical aspect of…

Computation and Language · Computer Science 2025-03-10 Yuhao Wu , Yushi Bai , Zhiqing Hu , Shangqing Tu , Ming Shan Hee , Juanzi Li , Roy Ka-Wei Lee

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

Large language models (LLMs) based on Transformer have been widely applied in the filed of natural language processing (NLP), demonstrating strong performance, particularly in handling short text tasks. However, when it comes to long…

Computation and Language · Computer Science 2025-07-09 Yijun Liu , Jinzheng Yu , Yang Xu , Zhongyang Li , Qingfu Zhu

Addressing the challenge of effectively processing long contexts has become a critical issue for Large Language Models (LLMs). Two common strategies have emerged: 1) reducing the input length, such as retrieving relevant chunks by…

Computation and Language · Computer Science 2024-06-06 Yusen Zhang , Ruoxi Sun , Yanfei Chen , Tomas Pfister , Rui Zhang , Sercan Ö. Arik

Long-context capabilities are essential for large language models (LLMs) to tackle complex and long-input tasks. Despite numerous efforts made to optimize LLMs for long contexts, challenges persist in robustly processing long inputs. In…

Computation and Language · Computer Science 2024-11-06 Shilong Li , Yancheng He , Hangyu Guo , Xingyuan Bu , Ge Bai , Jie Liu , Jiaheng Liu , Xingwei Qu , Yangguang Li , Wanli Ouyang , Wenbo Su , Bo Zheng

Large language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined…

Computation and Language · Computer Science 2023-10-10 Howard Chen , Ramakanth Pasunuru , Jason Weston , Asli Celikyilmaz

Processing long contexts has become a critical capability for modern large language models (LLMs). Existing works leverage agent-based divide-and-conquer methods for processing long contexts. But these methods face crucial limitations,…

Computation and Language · Computer Science 2025-09-30 Sibo Xiao , Zixin Lin , Wenyang Gao , Hui Chen , Yue Zhang

Large Language Models (LLMs) have become increasingly capable of interacting with external tools, granting access to specialized knowledge beyond their training data - critical in dynamic, knowledge-intensive domains such as Chemistry and…

As large language model (LLM)-based agents become increasingly integrated into daily digital interactions, their ability to reason across long interaction histories becomes crucial for providing personalized and contextually aware…

Machine Learning · Computer Science 2025-12-05 Andy Chung , Yichi Zhang , Kaixiang Lin , Aditya Rawal , Qiaozi Gao , Joyce Chai

With the rapid advancement of post-training techniques for reasoning and information seeking, large language models (LLMs) can incorporate a large quantity of retrieved knowledge to solve complex tasks. However, the limited context window…

Computation and Language · Computer Science 2026-04-21 Zijun Liu , Zhennan Wan , Peng Li , Ming Yan , Fei Huang , Yang Liu

Rapidly increasing context lengths have led to the assumption that large language models (LLMs) can directly reason over entire codebases. Concurrently, recent advances in LLMs have enabled strong performance on software engineering…

Software Engineering · Computer Science 2026-03-09 Ravi Raju , Mengmeng Ji , Shubhangi Upasani , Bo Li , Urmish Thakker

Recently, large language models (LLMs) have shown remarkable capabilities including understanding context, engaging in logical reasoning, and generating responses. However, this is achieved at the expense of stringent computational and…

Computation and Language · Computer Science 2024-05-30 Xindi Wang , Mahsa Salmani , Parsa Omidi , Xiangyu Ren , Mehdi Rezagholizadeh , Armaghan Eshaghi

Large language models (LLMs) have demonstrated strong performance in sentence-level machine translation, but scaling to document-level translation remains challenging, particularly in modeling long-range dependencies and discourse phenomena…

Computation and Language · Computer Science 2025-08-29 Miguel Moura Ramos , Patrick Fernandes , Sweta Agrawal , André F. T. Martins

Textual data annotation, the process of labeling or tagging text with relevant information, is typically costly, time-consuming, and labor-intensive. While large language models (LLMs) have demonstrated their potential as direct…

Computation and Language · Computer Science 2025-08-12 Yu-Min Tseng , Wei-Lin Chen , Chung-Chi Chen , Hsin-Hsi Chen

Large language model (LLM) agents are fundamentally constrained by context length on long-horizon tasks. We introduce Context-Folding, a framework that empowers agents to actively manage their working context. An agent can procedurally…

Computation and Language · Computer Science 2025-10-15 Weiwei Sun , Miao Lu , Zhan Ling , Kang Liu , Xuesong Yao , Yiming Yang , Jiecao Chen

Broad textual understanding and in-context learning require language models that utilize full document contexts. Due to the implementation challenges associated with directly training long-context models, many methods have been proposed for…

Computation and Language · Computer Science 2024-09-24 Yi Lu , Jing Nathan Yan , Songlin Yang , Justin T. Chiu , Siyu Ren , Fei Yuan , Wenting Zhao , Zhiyong Wu , Alexander M. Rush

Solving complex or long-horizon problems often requires large language models (LLMs) to use external tools and operate over a significantly longer context window. New LLMs enable longer context windows and support tool calling capabilities.…

Machine Learning · Computer Science 2025-12-03 Tsimur Hadeliya , Mohammad Ali Jauhar , Nidhi Sakpal , Diogo Cruz

Long-context language models (LCLMs), characterized by their extensive context window, are becoming popular. However, despite the fact that they are nearly perfect at standard long-context retrieval tasks, our evaluations demonstrate they…

Computation and Language · Computer Science 2025-08-27 Yijiong Yu , Yongfeng Huang , Zhixiao Qi , Wei Wang , Weifeng Liu , Ran Chen , Ji Pei

Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks. However, LLMs with long context windows have been notorious for their expensive training costs and high…

Computation and Language · Computer Science 2024-03-14 Jun Zhao , Can Zu , Hao Xu , Yi Lu , Wei He , Yiwen Ding , Tao Gui , Qi Zhang , Xuanjing Huang
‹ Prev 1 2 3 10 Next ›