AgreeSum: Agreement-Oriented Multi-Document Summarization

Richard Yuanzhe Pang; Adam D. Lelkes; Vinh Q. Tran; Cong Yu

AgreeSum: Agreement-Oriented Multi-Document Summarization

Computation and Language 2021-06-07 v1

Authors: Richard Yuanzhe Pang , Adam D. Lelkes , Vinh Q. Tran , Cong Yu

Abstract

We aim to renew interest in a particular multi-document summarization (MDS) task which we call AgreeSum: agreement-oriented multi-document summarization. Given a cluster of articles, the goal is to provide abstractive summaries that represent information common and faithful to all input articles. Given the lack of existing datasets, we create a dataset for AgreeSum, and provide annotations on article-summary entailment relations for a subset of the clusters in the dataset. We aim to create strong baselines for the task by applying the top-performing pretrained single-document summarization model PEGASUS onto AgreeSum, leveraging both annotated clusters by supervised losses, and unannotated clusters by T5-based entailment-related and language-related losses. Compared to other baselines, both automatic evaluation and human evaluation show better article-summary and cluster-summary entailment in generated summaries. On a separate note, we hope that our article-summary entailment annotations contribute to the community's effort in improving abstractive summarization faithfulness.

Keywords

text summarization data annotation information retrieval

Cite

@article{arxiv.2106.02278,
  title  = {AgreeSum: Agreement-Oriented Multi-Document Summarization},
  author = {Richard Yuanzhe Pang and Adam D. Lelkes and Vinh Q. Tran and Cong Yu},
  journal= {arXiv preprint arXiv:2106.02278},
  year   = {2021}
}

Comments

Findings of ACL 2021

AgreeSum: Agreement-Oriented Multi-Document Summarization

Abstract

Keywords

Cite

Comments

Related papers