Related papers: Document Summarization with Text Segmentation

Toward Unifying Text Segmentation and Long Document Summarization

Text segmentation is important for signaling a document's structure. Without segmenting a long document into topically coherent sections, it is difficult for readers to comprehend the text, let alone find important information. The problem…

Computation and Language · Computer Science 2022-11-01 Sangwoo Cho , Kaiqiang Song , Xiaoyang Wang , Fei Liu , Dong Yu

Enhancing Extractive Text Summarization with Topic-Aware Graph Neural Networks

Text summarization aims to compress a textual document to a short summary while keeping salient information. Extractive approaches are widely used in text summarization because of their fluency and efficiency. However, most of existing…

Computation and Language · Computer Science 2020-10-14 Peng Cui , Le Hu , Yuanchao Liu

Automatic Text Document Summarization using Semantic-based Analysis

Since the advent of the web, the amount of data on wen has been increased several million folds. In recent years web data generated is more than data stored for years. One important data format is text. To answer user queries over the…

Information Retrieval · Computer Science 2018-11-19 Chandra Shekhar Yadav

Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses

Sentence position is a strong feature for news summarization, since the lead often (but not always) summarizes the key points of the article. In this paper, we show that recent neural systems excessively exploit this trend, which although…

Computation and Language · Computer Science 2019-09-11 Matt Grenander , Yue Dong , Jackie Chi Kit Cheung , Annie Louis

Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task

Text segmentation aims to divide text into contiguous, semantically coherent segments, while segment labeling deals with producing labels for each segment. Past work has shown success in tackling segmentation and labeling for documents and…

Computation and Language · Computer Science 2022-09-29 Hakan Inan , Rashi Rungta , Yashar Mehdad

Modeling, comprehending and summarizing textual content by graphs

Automatic Text Summarization strategies have been successfully employed to digest text collections and extract its essential content. Usually, summaries are generated using textual corpora that belongs to the same domain area where the…

Computation and Language · Computer Science 2018-07-03 Vinicius Woloszyn , Guilherme Medeiros Machado , Leandro Krug Wives , José Palazzo Moreira de Oliveira

Extractive Multi-document Summarization Using Multilayer Networks

Huge volumes of textual information has been produced every single day. In order to organize and understand such large datasets, in recent years, summarization techniques have become popular. These techniques aims at finding relevant,…

Computation and Language · Computer Science 2018-03-26 Jorge V. Tohalino , Diego R. Amancio

A Divide-and-Conquer Approach to the Summarization of Long Documents

We present a novel divide-and-conquer method for the neural summarization of long documents. Our method exploits the discourse structure of the document and uses sentence similarity to split the problem into an ensemble of smaller…

Computation and Language · Computer Science 2020-09-24 Alexios Gidiotis , Grigorios Tsoumakas

Distraction-Based Neural Networks for Document Summarization

Distributed representation learned with neural networks has recently shown to be effective in modeling natural languages at fine granularities such as words, phrases, and even sentences. Whether and how such an approach can be extended to…

Computation and Language · Computer Science 2016-10-27 Qian Chen , Xiaodan Zhu , Zhenhua Ling , Si Wei , Hui Jiang

Leveraging Word Embeddings for Spoken Document Summarization

Owing to the rapidly growing multimedia content available on the Internet, extractive spoken document summarization, with the purpose of automatically selecting a set of representative sentences from a spoken document to concisely express…

Computation and Language · Computer Science 2015-06-16 Kuan-Yu Chen , Shih-Hung Liu , Hsin-Min Wang , Berlin Chen , Hsin-Hsi Chen

Text Segmentation as a Supervised Learning Task

Text segmentation, the task of dividing a document into contiguous segments based on its semantic structure, is a longstanding challenge in language understanding. Previous work on text segmentation focused on unsupervised methods such as…

Computation and Language · Computer Science 2018-03-28 Omri Koshorek , Adir Cohen , Noam Mor , Michael Rotman , Jonathan Berant

At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

Extractive methods have been proven effective in automatic document summarization. Previous works perform this task by identifying informative contents at sentence level. However, it is unclear whether performing extraction at sentence…

Computation and Language · Computer Science 2020-10-27 Qingyu Zhou , Furu Wei , Ming Zhou

Markov-Enhanced Clustering for Long Document Summarization: Tackling the 'Lost in the Middle' Challenge with Large Language Models

The rapid expansion of information from diverse sources has heightened the need for effective automatic text summarization, which condenses documents into shorter, coherent texts. Summarization methods generally fall into two categories:…

Computation and Language · Computer Science 2025-06-24 Aziz Amari , Mohamed Achref Ben Ammar

SumHiS: Extractive Summarization Exploiting Hidden Structure

Extractive summarization is a task of highlighting the most important parts of the text. We introduce a new approach to extractive summarization task using hidden clustering structure of the text. Experimental results on CNN/DailyMail…

Computation and Language · Computer Science 2024-06-13 Tikhonov Pavel , Anastasiya Ianina , Valentin Malykh

On Generating Extended Summaries of Long Documents

Prior work in document summarization has mainly focused on generating short summaries of a document. While this type of summary helps get a high-level view of a given document, it is desirable in some cases to know more detailed information…

Computation and Language · Computer Science 2020-12-29 Sajad Sotudeh , Arman Cohan , Nazli Goharian

Topic Modeling Based Extractive Text Summarization

Text summarization is an approach for identifying important information present within text documents. This computational technique aims to generate shorter versions of the source text, by including only the relevant and salient information…

Computation and Language · Computer Science 2021-06-30 Kalliath Abdul Rasheed Issam , Shivam Patel , Subalalitha C. N

Automated Text Summarization Base on Lexicales Chain and graph Using of WordNet and Wikipedia Knowledge Base

The technology of automatic document summarization is maturing and may provide a solution to the information overload problem. Nowadays, document summarization plays an important role in information retrieval. With a large volume of…

Information Retrieval · Computer Science 2012-04-10 Mohsen Pourvali , Mohammad Saniee Abadeh

Sentence Embeddings as an intermediate target in end-to-end summarisation

Current neural network-based methods to the problem of document summarisation struggle when applied to datasets containing large inputs. In this paper we propose a new approach to the challenge of content-selection when dealing with…

Computation and Language · Computer Science 2025-05-07 Maciej Zembrzuski , Saad Mahamood

Sequence-Based Extractive Summarisation for Scientific Articles

This paper presents the results of research on supervised extractive text summarisation for scientific articles. We show that a simple sequential tagging model based only on the text within a document achieves high results against a simple…

Computation and Language · Computer Science 2022-04-08 Daniel Kershaw , Rob Koeling

Text segmentation on multilabel documents: A distant-supervised approach

Segmenting text into semantically coherent segments is an important task with applications in information retrieval and text summarization. Developing accurate topical segmentation requires the availability of training data with ground…

Computation and Language · Computer Science 2019-04-16 Saurav Manchanda , George Karypis