Related papers: Document-aware Positional Encoding and Linguistic-…

A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization

Pre-trained language models (PLMs) have achieved outstanding achievements in abstractive single-document summarization (SDS). However, such benefits may not fully extend to multi-document summarization (MDS), where the handling of…

Computation and Language · Computer Science 2023-11-02 Chenhui Shen , Liying Cheng , Xuan-Phi Nguyen , Yang You , Lidong Bing

Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Multi-document summarization (MDS) refers to the task of summarizing the text in multiple documents into a concise summary. The generated summary can save the time of reading many documents by providing the important content in the form of…

Computation and Language · Computer Science 2023-06-09 Mohamed Trabelsi , Huseyin Uzunalioglu

Topic-Guided Abstractive Multi-Document Summarization

A critical point of multi-document summarization (MDS) is to learn the relations among various documents. In this paper, we propose a novel abstractive MDS model, in which we represent multiple documents as a heterogeneous graph, taking…

Computation and Language · Computer Science 2021-10-22 Peng Cui , Le Hu

Leveraging Graph to Improve Abstractive Multi-Document Summarization

Graphs that capture relations between textual units have great benefits for detecting salient information from multiple documents and generating overall coherent summaries. In this paper, we develop a neural abstractive multi-document…

Computation and Language · Computer Science 2020-05-21 Wei Li , Xinyan Xiao , Jiachen Liu , Hua Wu , Haifeng Wang , Junping Du

Rethinking Transformer-based Multi-document Summarization: An Empirical Investigation

The utilization of Transformer-based models prospers the growth of multi-document summarization (MDS). Given the huge impact and widespread adoption of Transformer-based models in various natural language processing tasks, investigating…

Computation and Language · Computer Science 2024-07-17 Congbo Ma , Wei Emma Zhang , Dileepa Pitawela , Haojie Zhuang , Yanfeng Shu

Hierarchical Transformers for Multi-Document Summarization

In this paper, we develop a neural summarization model which can effectively process multiple input documents and distill Transformer architecture with the ability to encode documents in a hierarchical manner. We represent cross-document…

Computation and Language · Computer Science 2019-05-31 Yang Liu , Mirella Lapata

Learning Syntactic and Dynamic Selective Encoding for Document Summarization

Text summarization aims to generate a headline or a short summary consisting of the major information of the source text. Recent studies employ the sequence-to-sequence framework to encode the input with a neural network and generate…

Computation and Language · Computer Science 2020-03-26 Haiyang Xu , Yahao He , Kun Han , Junwen Chen , Xiangang Li

Topic-Aware Encoding for Extractive Summarization

Document summarization provides an instrument for faster understanding the collection of text documents and has several real-life applications. With the growth of online text data, numerous summarization models have been proposed recently.…

Computation and Language · Computer Science 2022-04-01 Mingyang Song , Liping Jing

Multilevel Text Alignment with Cross-Document Attention

Text alignment finds application in tasks such as citation recommendation and plagiarism detection. Existing alignment methods operate at a single, predefined level and cannot learn to align texts at, for example, sentence and document…

Computation and Language · Computer Science 2020-10-06 Xuhui Zhou , Nikolaos Pappas , Noah A. Smith

A Topic-aware Summarization Framework with Different Modal Side Information

Automatic summarization plays an important role in the exponential document growth on the Web. On content websites such as CNN.com and WikiHow.com, there often exist various kinds of side information along with the main document for…

Computation and Language · Computer Science 2023-05-22 Xiuying Chen , Mingzhe Li , Shen Gao , Xin Cheng , Qiang Yang , Qishen Zhang , Xin Gao , Xiangliang Zhang

Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness

A notable challenge in Multi-Document Summarization (MDS) is the extremely-long length of the input. In this paper, we present an extract-then-abstract Transformer framework to overcome the problem. Specifically, we leverage pre-trained…

Computation and Language · Computer Science 2022-05-05 Yun-Zhu Song , Yi-Syuan Chen , Hong-Han Shuai

Large Scale Multi-Lingual Multi-Modal Summarization Dataset

Significant developments in techniques such as encoder-decoder models have enabled us to represent information comprising multiple modalities. This information can further enhance many downstream tasks in the field of information retrieval…

Computation and Language · Computer Science 2023-02-14 Yash Verma , Anubhav Jangra , Raghvendra Kumar , Sriparna Saha

Multi-document Summarization via Deep Learning Techniques: A Survey

Multi-document summarization (MDS) is an effective tool for information aggregation that generates an informative and concise summary from a cluster of topic-related documents. Our survey, the first of its kind, systematically overviews the…

Computation and Language · Computer Science 2021-12-10 Congbo Ma , Wei Emma Zhang , Mingyu Guo , Hu Wang , Quan Z. Sheng

The Power of Summary-Source Alignments

Multi-document summarization (MDS) is a challenging task, often decomposed to subtasks of salience and redundancy detection, followed by text generation. In this context, alignment of corresponding sentences between a reference summary and…

Computation and Language · Computer Science 2024-06-04 Ori Ernst , Ori Shapira , Aviv Slobodkin , Sharon Adar , Mohit Bansal , Jacob Goldberger , Ran Levy , Ido Dagan

DynE: Dynamic Ensemble Decoding for Multi-Document Summarization

Sequence-to-sequence (s2s) models are the basis for extensive work in natural language processing. However, some applications, such as multi-document summarization, multi-modal machine translation, and the automatic post-editing of machine…

Computation and Language · Computer Science 2020-06-17 Chris Hokamp , Demian Gholipour Ghalandari , Nghia The Pham , John Glover

Deconstructing Positional Information: From Attention Logits to Training Biases

Positional encodings enable Transformers to incorporate sequential information, yet their theoretical understanding remains limited to two properties: distance attenuation and translation invariance. Because natural language lacks purely…

Machine Learning · Computer Science 2026-02-11 Zihan Gu , Ruoyu Chen , Han Zhang , Hua Zhang , Yue Hu

Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation

Context-aware translation can be achieved by processing a concatenation of consecutive sentences with the standard Transformer architecture. This paper investigates the intuitive idea of providing the model with explicit information about…

Computation and Language · Computer Science 2023-04-06 Lorenzo Lupo , Marco Dinarelli , Laurent Besacier

Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization

Generating a text abstract from a set of documents remains a challenging task. The neural encoder-decoder framework has recently been exploited to summarize single documents, but its success can in part be attributed to the availability of…

Computation and Language · Computer Science 2018-08-29 Logan Lebanoff , Kaiqiang Song , Fei Liu

StructSum: Summarization via Structured Representations

Abstractive text summarization aims at compressing the information of a long source document into a rephrased, condensed summary. Despite advances in modeling techniques, abstractive summarization models still suffer from several key…

Computation and Language · Computer Science 2021-02-17 Vidhisha Balachandran , Artidoro Pagnoni , Jay Yoon Lee , Dheeraj Rajagopal , Jaime Carbonell , Yulia Tsvetkov

A Transformer-based Approach for Source Code Summarization

Generating a readable summary that describes the functionality of a program is known as source code summarization. In this task, learning code representation by modeling the pairwise relationship between code tokens to capture their…

Software Engineering · Computer Science 2020-05-05 Wasi Uddin Ahmad , Saikat Chakraborty , Baishakhi Ray , Kai-Wei Chang