English
Related papers

Related papers: LLM$\times$MapReduce: Simplified Long-Sequence Pro…

200 papers

We investigate the challenge of applying Large Language Models (LLMs) to long texts. We propose a theoretical framework that distinguishes the failure modes of long context tasks into three categories: cross-chunk dependence (task noise),…

Computation and Language · Computer Science 2026-03-03 Zhen Xu , Shang Zhu , Jue Wang , Junlin Wang , Ben Athiwaratkun , Chi Wang , James Zou , Ce Zhang

Large Language Models (LLMs), constrained by limited context windows, often face significant performance degradation when reasoning over long contexts. To address this, Retrieval-Augmented Generation (RAG) retrieves and reasons over chunks…

Computation and Language · Computer Science 2025-11-04 Jiani Guo , Zuchao Li , Jie Wu , Qianren Wang , Yun Li , Lefei Zhang , Hai Zhao , Yujiu Yang

Long-form generation is crucial for a wide range of practical applications, typically categorized into short-to-long and long-to-long generation. While short-to-long generations have received considerable attention, generating long texts…

Computation and Language · Computer Science 2025-04-16 Haoyu Wang , Yujia Fu , Zhu Zhang , Shuo Wang , Zirui Ren , Xiaorong Wang , Zhili Li , Chaoqun He , Bo An , Zhiyuan Liu , Maosong Sun

Processing long contexts has become a critical capability for modern large language models (LLMs). Existing works leverage agent-based divide-and-conquer methods for processing long contexts. But these methods face crucial limitations,…

Computation and Language · Computer Science 2025-09-30 Sibo Xiao , Zixin Lin , Wenyang Gao , Hui Chen , Yue Zhang

Long-term time-series forecasting is essential for planning and decision-making in economics, energy, and transportation, where long foresight is required. To obtain such long foresight, models must be both efficient and effective in…

Machine Learning · Computer Science 2025-09-05 Chao Ma , Yikai Hou , Xiang Li , Yinggang Sun , Haining Yu , Zhou Fang , Jiaxing Qu

Large language models (LLMs) face significant challenges in handling long-context tasks because of their limited effective context window size during pretraining, which restricts their ability to generalize over extended sequences.…

Computation and Language · Computer Science 2024-09-05 Zhiyuan Hu , Yuliang Liu , Jinman Zhao , Suyuchen Wang , Yan Wang , Wei Shen , Qing Gu , Anh Tuan Luu , See-Kiong Ng , Zhiwei Jiang , Bryan Hooi

Large Language Models (LLMs) have demonstrated remarkable capabilities in handling long texts and have almost perfect performance in traditional retrieval tasks. However, their performance significantly degrades when it comes to numerical…

Computation and Language · Computer Science 2024-12-05 Yijiong Yu

Large Language Models (LLMs) have been widely used as general-purpose AI agents showing comparable performance on many downstream tasks. However, existing work shows that it is challenging for LLMs to integrate structured data (e.g. KG,…

Computation and Language · Computer Science 2024-02-23 Younghun Lee , Sungchul Kim , Tong Yu , Ryan A. Rossi , Xiang Chen

Long document summarization poses a significant challenge in natural language processing due to input lengths that exceed the capacity of most state-of-the-art pre-trained language models. This study proposes a hierarchical framework that…

Computation and Language · Computer Science 2024-10-10 Yuan-Jhe Yin , Bo-Yu Chen , Berlin Chen

Large language models (LLMs) often struggle to accurately read and comprehend extremely long texts. Current methods for improvement typically rely on splitting long contexts into fixed-length chunks. However, fixed truncation risks…

Computation and Language · Computer Science 2025-06-04 Boheng Sheng , Jiacheng Yao , Meicong Zhang , Guoxiu He

Large language models (LLMs) have been a disruptive innovation in recent years, and they play a crucial role in our daily lives due to their ability to understand and generate human-like text. Their capabilities include natural language…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-17 Akrit Mudvari , Yuang Jiang , Leandros Tassiulas

The development of Long-Context Large Language Models (LLMs) has markedly advanced natural language processing by facilitating the process of textual data across long documents and multiple corpora. However, Long-Context LLMs still face two…

Computation and Language · Computer Science 2024-10-10 Jingyang Deng , Zhengyang Shen , Boyang Wang , Lixin Su , Suqi Cheng , Ying Nie , Junfeng Wang , Dawei Yin , Jinwen Ma

Recently, large language models (LLMs) have shown remarkable capabilities including understanding context, engaging in logical reasoning, and generating responses. However, this is achieved at the expense of stringent computational and…

Computation and Language · Computer Science 2024-05-30 Xindi Wang , Mahsa Salmani , Parsa Omidi , Xiangyu Ren , Mehdi Rezagholizadeh , Armaghan Eshaghi

The limited context window of contemporary large language models (LLMs) remains a primary bottleneck for their broader application across diverse domains. Although continual pre-training on long-context data offers a straightforward…

Computation and Language · Computer Science 2026-04-10 Wei Han , Pan Zhou , Soujanya Poria , Shuicheng Yan

Large language models (LLMs) based on Transformer have been widely applied in the filed of natural language processing (NLP), demonstrating strong performance, particularly in handling short text tasks. However, when it comes to long…

Computation and Language · Computer Science 2025-07-09 Yijun Liu , Jinzheng Yu , Yang Xu , Zhongyang Li , Qingfu Zhu

The rapid increase in unstructured data across various fields has made multi-document comprehension and summarization a critical task. Traditional approaches often fail to capture relevant context, maintain logical consistency, and extract…

Computation and Language · Computer Science 2024-09-30 Aditi Godbole , Jabin Geevarghese George , Smita Shandilya

Large language models (LLMs) have demonstrated strong performance in sentence-level machine translation, but scaling to document-level translation remains challenging, particularly in modeling long-range dependencies and discourse phenomena…

Computation and Language · Computer Science 2025-08-29 Miguel Moura Ramos , Patrick Fernandes , Sweta Agrawal , André F. T. Martins

While numerous architectures for long-range language models (LRLMs) have recently been proposed, a meaningful evaluation of their discourse-level language understanding capabilities has not yet followed. To this end, we introduce…

Computation and Language · Computer Science 2022-04-26 Simeng Sun , Katherine Thai , Mohit Iyyer

The advent of Large Language Models (LLMs) has provided unprecedented capabilities for analyzing unstructured text data. However, deploying these models as reliable, robust, and scalable classifiers in production environments presents…

Computation and Language · Computer Science 2025-08-25 Doohee You , Andy Parisi , Zach Vander Velden , Lara Dantas Inojosa

Large language models (LLMs) have been increasingly used to analyze text. However, they are often plagued with contextual reasoning limitations when analyzing long documents. When long documents are processed sequentially, early or dominant…

Computation and Language · Computer Science 2026-05-21 Aisvarya Adeseye , Jouni Isoaho , Adeyemi Adeseye
‹ Prev 1 2 3 10 Next ›