SEGMENT+: Long Text Processing with Short-Context Language Models

Wei Shi; Shuang Li; Kerun Yu; Jinglei Chen; Zujie Liang; Xinhui Wu; Yuxi Qian; Feng Wei; Bo Zheng; Jiaqing Liang; Jiangjie Chen; Yanghua Xiao

SEGMENT+: Long Text Processing with Short-Context Language Models

Computation and Language 2024-10-10 v1

Authors: Wei Shi , Shuang Li , Kerun Yu , Jinglei Chen , Zujie Liang , Xinhui Wu , Yuxi Qian , Feng Wei , Bo Zheng , Jiaqing Liang , Jiangjie Chen , Yanghua Xiao

View on arXiv ↗ PDF ↗

Abstract

There is a growing interest in expanding the input capacity of language models (LMs) across various domains. However, simply increasing the context window does not guarantee robust performance across diverse long-input processing tasks, such as understanding extensive documents and extracting detailed information from lengthy and noisy data. In response, we introduce SEGMENT+, a general framework that enables LMs to handle extended inputs within limited context windows efficiently. SEGMENT+ utilizes structured notes and a filtering module to manage information flow, resulting in a system that is both controllable and interpretable. Our extensive experiments across various model sizes, focusing on long-document question-answering and Needle-in-a-Haystack tasks, demonstrate the effectiveness of SEGMENT+ in improving performance.

Keywords

long-context modeling in-context learning natural language parsing

Cite

@article{arxiv.2410.06519,
  title  = {SEGMENT+: Long Text Processing with Short-Context Language Models},
  author = {Wei Shi and Shuang Li and Kerun Yu and Jinglei Chen and Zujie Liang and Xinhui Wu and Yuxi Qian and Feng Wei and Bo Zheng and Jiaqing Liang and Jiangjie Chen and Yanghua Xiao},
  journal= {arXiv preprint arXiv:2410.06519},
  year   = {2024}
}

Comments

EMNLP 2024

SEGMENT+: Long Text Processing with Short-Context Language Models

Abstract

Keywords

Cite

Comments

Related papers