Related papers: Summary-Source Proposition-level Alignment: Task, …

The Power of Summary-Source Alignments

Multi-document summarization (MDS) is a challenging task, often decomposed to subtasks of salience and redundancy detection, followed by text generation. In this context, alignment of corresponding sentences between a reference summary and…

Computation and Language · Computer Science 2024-06-04 Ori Ernst , Ori Shapira , Aviv Slobodkin , Sharon Adar , Mohit Bansal , Jacob Goldberger , Ran Levy , Ido Dagan

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

Unsupervised extractive summarization aims to extract salient sentences from a document as the summary without labeled data. Recent literatures mostly research how to leverage sentence similarity to rank sentences in the order of salience.…

Computation and Language · Computer Science 2023-02-27 Shichao Sun , Ruifeng Yuan , Wenjie Li , Sujian Li

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation

Summarizing texts is not a straightforward task. Before even considering text summarization, one should determine what kind of summary is expected. How much should the information be compressed? Is it relevant to reformulate or should the…

Computation and Language · Computer Science 2020-07-16 Paul Tardy , David Janiszek , Yannick Estève , Vincent Nguyen

Unsupervised Text Summarization via Mixed Model Back-Translation

Back-translation based approaches have recently lead to significant progress in unsupervised sequence-to-sequence tasks such as machine translation or style transfer. In this work, we extend the paradigm to the problem of learning a…

Computation and Language · Computer Science 2019-08-26 Yacine Jernite

SumRank: Aligning Summarization Models for Long-Document Listwise Reranking

Large Language Models (LLMs) have demonstrated superior performance in listwise passage reranking task. However, directly applying them to rank long-form documents introduces both effectiveness and efficiency issues due to the substantially…

Information Retrieval · Computer Science 2026-03-26 Jincheng Feng , Wenhan Liu , Zhicheng Dou

Simple Unsupervised Summarization by Contextual Matching

We propose an unsupervised method for sentence summarization using only language modeling. The approach employs two language models, one that is generic (i.e. pretrained), and the other that is specific to the target domain. We show that by…

Computation and Language · Computer Science 2019-08-01 Jiawei Zhou , Alexander M. Rush

Evaluating the Factual Consistency of Abstractive Text Summarization

Currently used metrics for assessing summarization algorithms do not account for whether summaries are factually consistent with source documents. We propose a weakly-supervised, model-based approach for verifying factual consistency and…

Computation and Language · Computer Science 2019-10-29 Wojciech Kryściński , Bryan McCann , Caiming Xiong , Richard Socher

Unsupervised Opinion Summarization with Noising and Denoising

The supervised training of high-capacity models on large datasets containing hundreds of thousands of document-summary pairs is critical to the recent success of deep learning techniques for abstractive summarization. Unfortunately, in most…

Computation and Language · Computer Science 2020-04-22 Reinald Kim Amplayo , Mirella Lapata

Centrality Meets Centroid: A Graph-based Approach for Unsupervised Document Summarization

Unsupervised document summarization has re-acquired lots of attention in recent years thanks to its simplicity and data independence. In this paper, we propose a graph-based unsupervised approach for extractive document summarization.…

Computation and Language · Computer Science 2021-04-23 Haopeng Zhang , Jiawei Zhang

HighRES: Highlight-based Reference-less Evaluation of Summarization

There has been substantial progress in summarization research enabled by the availability of novel, often large-scale, datasets and recent advances on neural network-based approaches. However, manual evaluation of the system generated…

Computation and Language · Computer Science 2019-06-05 Hardy , Shashi Narayan , Andreas Vlachos

SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and…

Machine Learning · Statistics 2019-12-20 Kowshik Thopalli , Jayaraman J. Thiagarajan , Rushil Anirudh , Pavan Turaga

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers

Unsupervised extractive document summarization aims to select important sentences from a document without using labeled summaries during training. Existing methods are mostly graph-based with sentences as nodes and edge weights measured by…

Computation and Language · Computer Science 2021-12-14 Shusheng Xu , Xingxing Zhang , Yi Wu , Furu Wei , Ming Zhou

Mask-Align: Self-Supervised Neural Word Alignment

Word alignment, which aims to align translationally equivalent words between source and target sentences, plays an important role in many natural language processing tasks. Current unsupervised neural alignment methods focus on inducing…

Computation and Language · Computer Science 2021-05-18 Chi Chen , Maosong Sun , Yang Liu

Query-focused Multi-document Summarization: Combining a Novel Topic Model with Graph-based Semi-supervised Learning

Graph-based semi-supervised learning has proven to be an effective approach for query-focused multi-document summarization. The problem of previous semi-supervised learning is that sentences are ranked without considering the higher level…

Computation and Language · Computer Science 2014-01-03 Jiwei Li , Sujian Li

Proposition-Level Clustering for Multi-Document Summarization

Text clustering methods were traditionally incorporated into multi-document summarization (MDS) as a means for coping with considerable information repetition. Particularly, clusters were leveraged to indicate information saliency as well…

Computation and Language · Computer Science 2022-05-23 Ori Ernst , Avi Caciularu , Ori Shapira , Ramakanth Pasunuru , Mohit Bansal , Jacob Goldberger , Ido Dagan

Unsupervised Extractive Summarization with Learnable Length Control Strategies

Unsupervised extractive summarization is an important technique in information extraction and retrieval. Compared with supervised method, it does not require high-quality human-labelled summaries for training and thus can be easily applied…

Artificial Intelligence · Computer Science 2023-12-19 Renlong Jie , Xiaojun Meng , Xin Jiang , Qun Liu

Controlled Text Reduction

Producing a reduced version of a source text, as in generic or focused summarization, inherently involves two distinct subtasks: deciding on targeted content and generating a coherent text conveying it. While some popular approaches address…

Computation and Language · Computer Science 2022-10-25 Aviv Slobodkin , Paul Roit , Eran Hirsch , Ori Ernst , Ido Dagan

Focused Meeting Summarization via Unsupervised Relation Extraction

We present a novel unsupervised framework for focused meeting summarization that views the problem as an instance of relation extraction. We adapt an existing in-domain relation learner (Chen et al., 2011) by exploiting a set of…

Computation and Language · Computer Science 2016-06-28 Lu Wang , Claire Cardie

New Alignment Methods for Discriminative Book Summarization

We consider the unsupervised alignment of the full text of a book with a human-written summary. This presents challenges not seen in other text alignment problems, including a disparity in length and, consequent to this, a violation of the…

Computation and Language · Computer Science 2013-05-08 David Bamman , Noah A. Smith

Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

Automatic sentence summarization produces a shorter version of a sentence, while preserving its most important information. A good summary is characterized by language fluency and high information overlap with the source sentence. We model…

Computation and Language · Computer Science 2020-05-06 Raphael Schumann , Lili Mou , Yao Lu , Olga Vechtomova , Katja Markert