Related papers: Systematically Exploring Redundancy Reduction in S…

Survey on Multi-Document Summarization: Systematic Literature Review

In this era of information technology, abundant information is available on the internet in the form of web pages and documents on any given topic. Finding the most relevant and informative content out of these huge number of documents,…

Computers and Society · Computer Science 2023-12-21 Uswa Ihsan , Humaira Ashraf , NZ Jhanjhi

On the Trade-off between Redundancy and Local Coherence in Summarization

Extractive summaries are usually presented as lists of sentences with no expected cohesion between them and with plenty of redundant information if not accounted for. In this paper, we investigate the trade-offs incurred when aiming to…

Computation and Language · Computer Science 2024-06-07 Ronald Cardenas , Matthias Galle , Shay B. Cohen

A Divide-and-Conquer Approach to the Summarization of Long Documents

We present a novel divide-and-conquer method for the neural summarization of long documents. Our method exploits the discourse structure of the document and uses sentence similarity to split the problem into an ensemble of smaller…

Computation and Language · Computer Science 2020-09-24 Alexios Gidiotis , Grigorios Tsoumakas

An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics

Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively…

Computation and Language · Computer Science 2022-07-05 Huan Yee Koh , Jiaxin Ju , Ming Liu , Shirui Pan

Summary Refinement through Denoising

We propose a simple method for post-processing the outputs of a text summarization system in order to refine its overall quality. Our approach is to train text-to-text rewriting models to correct information redundancy errors that may arise…

Computation and Language · Computer Science 2019-07-26 Nikola I. Nikolov , Alessandro Calmanovici , Richard H. R. Hahnloser

How Far are We from Robust Long Abstractive Summarization?

Abstractive summarization has made tremendous progress in recent years. In this work, we perform fine-grained human annotations to evaluate long document abstractive summarization systems (i.e., models and metrics) with the aim of…

Computation and Language · Computer Science 2022-11-01 Huan Yee Koh , Jiaxin Ju , He Zhang , Ming Liu , Shirui Pan

On Generating Extended Summaries of Long Documents

Prior work in document summarization has mainly focused on generating short summaries of a document. While this type of summary helps get a high-level view of a given document, it is desirable in some cases to know more detailed information…

Computation and Language · Computer Science 2020-12-29 Sajad Sotudeh , Arman Cohan , Nazli Goharian

Improving the Robustness of Summarization Models by Detecting and Removing Input Noise

The evaluation of abstractive summarization models typically uses test data that is identically distributed as training data. In real-world practice, documents to be summarized may contain input noise caused by text extraction artifacts or…

Computation and Language · Computer Science 2023-12-05 Kundan Krishna , Yao Zhao , Jie Ren , Balaji Lakshminarayanan , Jiaming Luo , Mohammad Saleh , Peter J. Liu

Towards Personalized and Human-in-the-Loop Document Summarization

The ubiquitous availability of computing devices and the widespread use of the internet have generated a large amount of data continuously. Therefore, the amount of available information on any given topic is far beyond humans' processing…

Artificial Intelligence · Computer Science 2023-07-11 Samira Ghodratnama

Long Text and Multi-Table Summarization: Dataset and Method

Automatic document summarization aims to produce a concise summary covering the input document's salient information. Within a report document, the salient information can be scattered in the textual and non-textual content. However,…

Computation and Language · Computer Science 2023-02-09 Shuaiqi Liu , Jiannong Cao , Ruosong Yang , Zhiyuan Wen

RefSum: Refactoring Neural Summarization

Although some recent works show potential complementarity among different state-of-the-art systems, few works try to investigate this problem in text summarization. Researchers in other areas commonly refer to the techniques of reranking or…

Computation and Language · Computer Science 2021-04-16 Yixin Liu , Zi-Yi Dou , Pengfei Liu

AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Redundancy-aware extractive summarization systems score the redundancy of the sentences to be included in a summary either jointly with their salience information or separately as an additional sentence scoring step. Previous work shows the…

Computation and Language · Computer Science 2021-04-06 Keping Bi , Rahul Jha , W. Bruce Croft , Asli Celikyilmaz

Markov-Enhanced Clustering for Long Document Summarization: Tackling the 'Lost in the Middle' Challenge with Large Language Models

The rapid expansion of information from diverse sources has heightened the need for effective automatic text summarization, which condenses documents into shorter, coherent texts. Summarization methods generally fall into two categories:…

Computation and Language · Computer Science 2025-06-24 Aziz Amari , Mohamed Achref Ben Ammar

Improving Abstraction in Text Summarization

Abstractive text summarization aims to shorten long text documents into a human readable form that contains the most important facts from the original document. However, the level of actual abstraction as measured by novel phrases that do…

Computation and Language · Computer Science 2018-08-27 Wojciech Kryściński , Romain Paulus , Caiming Xiong , Richard Socher

A Supervised Approach to Extractive Summarisation of Scientific Papers

Automatic summarisation is a popular approach to reduce a document to its main arguments. Recent research in the area has focused on neural approaches to summarisation, which can be very data-hungry. However, few large datasets exist and…

Computation and Language · Computer Science 2017-06-14 Ed Collins , Isabelle Augenstein , Sebastian Riedel

MRGSEM-Sum: An Unsupervised Multi-document Summarization Framework based on Multi-Relational Graphs and Structural Entropy Minimization

The core challenge faced by multi-document summarization is the complexity of relationships among documents and the presence of information redundancy. Graph clustering is an effective paradigm for addressing this issue, as it models the…

Computation and Language · Computer Science 2025-08-01 Yongbing Zhang , Fang Nan , Shengxiang Gao , Yuxin Huang , Kaiwen Tan , Zhengtao Yu

The Influence of Data Pre-processing and Post-processing on Long Document Summarization

Long document summarization is an important and hard task in the field of natural language processing. A good performance of the long document summarization reveals the model has a decent understanding of the human language. Currently, most…

Computation and Language · Computer Science 2021-12-06 Xinwei Du , Kailun Dong , Yuchen Zhang , Yongsheng Li , Ruei-Yu Tsay

Neural Text Summarization: A Critical Evaluation

Text summarization aims at compressing long documents into a shorter form that conveys the most important parts of the original document. Despite increased interest in the community and notable research effort, progress on benchmark…

Computation and Language · Computer Science 2019-08-27 Wojciech Kryściński , Nitish Shirish Keskar , Bryan McCann , Caiming Xiong , Richard Socher

Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches

Automatically summarizing large text collections is a valuable tool for document research, with applications in journalism, academic research, legal work, and many other fields. In this work, we contrast two classes of systems for…

Computation and Language · Computer Science 2025-02-11 Adithya Pratapa , Teruko Mitamura

Read Top News First: A Document Reordering Approach for Multi-Document News Summarization

A common method for extractive multi-document news summarization is to re-formulate it as a single-document summarization problem by concatenating all documents as a single meta-document. However, this method neglects the relative…

Computation and Language · Computer Science 2022-03-22 Chao Zhao , Tenghao Huang , Somnath Basu Roy Chowdhury , Muthu Kumar Chandrasekaran , Kathleen McKeown , Snigdha Chaturvedi