English
Related papers

Related papers: SEGMENT+: Long Text Processing with Short-Context …

200 papers

Transformer-based Large Language Models (LLMs) often impose limitations on the length of the text input to ensure the generation of fluent and relevant responses. This constraint restricts their applicability in scenarios involving long…

Computation and Language · Computer Science 2023-12-18 Weizhi Fei , Xueyan Niu , Pingyi Zhou , Lu Hou , Bo Bai , Lei Deng , Wei Han

Large Language Models (LLMs) have demonstrated remarkable capabilities in comprehending and analyzing lengthy sequential inputs, owing to their extensive context windows that allow processing millions of tokens in a single forward pass.…

Computation and Language · Computer Science 2024-12-23 Peyman Hosseini , Ignacio Castro , Iacopo Ghinassi , Matthew Purver

Large language models (LLMs) achieved remarkable performance across various tasks. However, they face challenges in managing long documents and extended conversations, due to significantly increased computational requirements, both in…

Computation and Language · Computer Science 2023-10-11 Yucheng Li , Bo Dong , Chenghua Lin , Frank Guerin

Large language models (LLMs) have demonstrated strong performance in sentence-level machine translation, but scaling to document-level translation remains challenging, particularly in modeling long-range dependencies and discourse phenomena…

Computation and Language · Computer Science 2025-08-29 Miguel Moura Ramos , Patrick Fernandes , Sweta Agrawal , André F. T. Martins

Recently, large language models (LLMs) have shown remarkable capabilities including understanding context, engaging in logical reasoning, and generating responses. However, this is achieved at the expense of stringent computational and…

Computation and Language · Computer Science 2024-05-30 Xindi Wang , Mahsa Salmani , Parsa Omidi , Xiangyu Ren , Mehdi Rezagholizadeh , Armaghan Eshaghi

Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows,…

Artificial Intelligence · Computer Science 2024-02-13 Charles Packer , Sarah Wooders , Kevin Lin , Vivian Fang , Shishir G. Patil , Ion Stoica , Joseph E. Gonzalez

Large language models (LLMs) have received significant attention by achieving remarkable performance across various tasks. However, their fixed context length poses challenges when processing long documents or maintaining extended…

Computation and Language · Computer Science 2023-04-25 Yucheng Li

Large Multimodal Models (LMMs) have demonstrated impressive performance in short video understanding tasks but face great challenges when applied to long video understanding. In contrast, Large Language Models (LLMs) exhibit outstanding…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Hongchen Wei , Zhenzhong Chen

Long context understanding remains challenging for large language models due to their limited context windows. This paper introduces Long Input Fine-Tuning (LIFT), a novel framework for long-context modeling that can enhance the…

Computation and Language · Computer Science 2026-04-14 Yansheng Mao , Yufei Xu , Jiaqi Li , Fanxu Meng , Haotong Yang , Zilong Zheng , Xiyuan Wang , Muhan Zhang

Large language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined…

Computation and Language · Computer Science 2023-10-10 Howard Chen , Ramakanth Pasunuru , Jason Weston , Asli Celikyilmaz

Document-level context for neural machine translation (NMT) is crucial to improve the translation consistency and cohesion, the translation of ambiguous inputs, as well as several other linguistic phenomena. Many works have been published…

Computation and Language · Computer Science 2023-06-09 Christian Herold , Hermann Ney

Large Language Models (LLMs) have been widely used as general-purpose AI agents showing comparable performance on many downstream tasks. However, existing work shows that it is challenging for LLMs to integrate structured data (e.g. KG,…

Computation and Language · Computer Science 2024-02-23 Younghun Lee , Sungchul Kim , Tong Yu , Ryan A. Rossi , Xiang Chen

The rapid increase in unstructured data across various fields has made multi-document comprehension and summarization a critical task. Traditional approaches often fail to capture relevant context, maintain logical consistency, and extract…

Computation and Language · Computer Science 2024-09-30 Aditi Godbole , Jabin Geevarghese George , Smita Shandilya

In recent years, Large Language Models (LLMs) have demonstrated significant improvements across a variety of tasks, one of which is the long-context capability. The key to improving long-context performance lies in effective data…

Computation and Language · Computer Science 2024-10-03 Keer Lu , Xiaonan Nie , Zheng Liang , Da Pan , Shusen Zhang , Keshi Zhao , Weipeng Chen , Zenan Zhou , Guosheng Dong , Bin Cui , Wentao Zhang

Enlarging the context window of large language models (LLMs) has become a crucial research area, particularly for applications involving extremely long texts. In this work, we propose a novel training-free framework for processing long…

Computation and Language · Computer Science 2024-10-15 Zihan Zhou , Chong Li , Xinyi Chen , Shuo Wang , Yu Chao , Zhili Li , Haoyu Wang , Rongqiao An , Qi Shi , Zhixing Tan , Xu Han , Xiaodong Shi , Zhiyuan Liu , Maosong Sun

The development of Long-Context Large Language Models (LLMs) has markedly advanced natural language processing by facilitating the process of textual data across long documents and multiple corpora. However, Long-Context LLMs still face two…

Computation and Language · Computer Science 2024-10-10 Jingyang Deng , Zhengyang Shen , Boyang Wang , Lixin Su , Suqi Cheng , Ying Nie , Junfeng Wang , Dawei Yin , Jinwen Ma

Long-context large language models (LC LLMs) promise to increase reliability of LLMs in real-world tasks requiring processing and understanding of long input documents. However, this ability of LC LLMs to reliably utilize their growing…

Computation and Language · Computer Science 2024-12-23 Lavanya Gupta , Saket Sharma , Yiyun Zhao

Large Language Models (LLMs) have demonstrated remarkable capabilities in handling long texts and have almost perfect performance in traditional retrieval tasks. However, their performance significantly degrades when it comes to numerical…

Computation and Language · Computer Science 2024-12-05 Yijiong Yu

Existing large language models (LLMs) can only afford fix-sized inputs due to the input length limit, preventing them from utilizing rich long-context information from past inputs. To address this, we propose a framework, Language Models…

Computation and Language · Computer Science 2023-06-13 Weizhi Wang , Li Dong , Hao Cheng , Xiaodong Liu , Xifeng Yan , Jianfeng Gao , Furu Wei

The limited context window of contemporary large language models (LLMs) remains a primary bottleneck for their broader application across diverse domains. Although continual pre-training on long-context data offers a straightforward…

Computation and Language · Computer Science 2026-04-10 Wei Han , Pan Zhou , Soujanya Poria , Shuicheng Yan
‹ Prev 1 2 3 10 Next ›