Related papers: ExtraPhrase: Efficient Data Augmentation for Abstr…

Abstractive Document Summarization without Parallel Data

Abstractive summarization typically relies on large collections of paired articles and summaries. However, in many cases, parallel data is scarce and costly to obtain. We develop an abstractive summarization system that relies only on large…

Computation and Language · Computer Science 2020-03-04 Nikola I. Nikolov , Richard H. R. Hahnloser

Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach

In the context of neural machine translation, data augmentation (DA) techniques may be used for generating additional training samples when the available parallel data are scarce. Many DA approaches aim at expanding the support of the…

Computation and Language · Computer Science 2021-09-09 Víctor M. Sánchez-Cartagena , Miquel Esplà-Gomis , Juan Antonio Pérez-Ortiz , Felipe Sánchez-Martínez

On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

We present a method to produce abstractive summaries of long documents that exceed several thousand words via neural abstractive summarization. We perform a simple extractive step before generating a summary, which is then used to condition…

Computation and Language · Computer Science 2020-04-29 Sandeep Subramanian , Raymond Li , Jonathan Pilault , Christopher Pal

Exploring Multitask Learning for Low-Resource AbstractiveSummarization

This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept…

Computation and Language · Computer Science 2021-09-20 Ahmed Magooda , Mohamed Elaraby , Diane Litman

Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization

Sentence summarization shortens given texts while maintaining core contents of the texts. Unsupervised approaches have been studied to summarize texts without human-written summaries. However, recent unsupervised models are extractive,…

Computation and Language · Computer Science 2022-12-22 Dongmin Hyun , Xiting Wang , Chanyoung Park , Xing Xie , Hwanjo Yu

Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation

The overreliance on large parallel corpora significantly limits the applicability of machine translation systems to the majority of language pairs. Back-translation has been dominantly used in previous approaches for unsupervised neural…

Computation and Language · Computer Science 2019-04-05 Jiawei Wu , Xin Wang , William Yang Wang

Multi-stage Pretraining for Abstractive Summarization

Neural models for abstractive summarization tend to achieve the best performance in the presence of highly specialized, summarization specific modeling add-ons such as pointer-generator, coverage-modeling, and inferencetime heuristics. We…

Computation and Language · Computer Science 2019-09-25 Sebastian Goodman , Zhenzhong Lan , Radu Soricut

STRASS: A Light and Effective Method for Extractive Summarization Based on Sentence Embeddings

This paper introduces STRASS: Summarization by TRAnsformation Selection and Scoring. It is an extractive text summarization method which leverages the semantic information in existing sentence embedding spaces. Our method creates an…

Computation and Language · Computer Science 2019-07-18 Léo Bouscarrat , Antoine Bonnefoy , Thomas Peel , Cécile Pereira

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i.e., compresses and paraphrases) to generate a concise overall…

Computation and Language · Computer Science 2018-05-29 Yen-Chun Chen , Mohit Bansal

Meta-Transfer Learning for Low-Resource Abstractive Summarization

Neural abstractive summarization has been studied in many pieces of literature and achieves great success with the aid of large corpora. However, when encountering novel tasks, one may not always benefit from transfer learning due to the…

Computation and Language · Computer Science 2021-06-01 Yi-Syuan Chen , Hong-Han Shuai

Enhancing Coherence of Extractive Summarization with Multitask Learning

This study proposes a multitask learning architecture for extractive summarization with coherence boosting. The architecture contains an extractive summarizer and coherent discriminator module. The coherent discriminator is trained online…

Computation and Language · Computer Science 2023-07-24 Renlong Jie , Xiaojun Meng , Lifeng Shang , Xin Jiang , Qun Liu

Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness

A notable challenge in Multi-Document Summarization (MDS) is the extremely-long length of the input. In this paper, we present an extract-then-abstract Transformer framework to overcome the problem. Specifically, we leverage pre-trained…

Computation and Language · Computer Science 2022-05-05 Yun-Zhu Song , Yi-Syuan Chen , Hong-Han Shuai

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization. However, pre-training objectives tailored for…

Computation and Language · Computer Science 2020-07-21 Jingqing Zhang , Yao Zhao , Mohammad Saleh , Peter J. Liu

A New Sentence Extraction Strategy for Unsupervised Extractive Summarization Methods

In recent years, text summarization methods have attracted much attention again thanks to the researches on neural network models. Most of the current text summarization methods based on neural network models are supervised methods which…

Computation and Language · Computer Science 2024-01-25 Dehao Tao , Yingzhu Xiong , Zhongliang Yang , Yongfeng Huang

Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization

This paper explores three simple data manipulation techniques (synthesis, augmentation, curriculum) for improving abstractive summarization models without the need for any additional data. We introduce a method of data synthesis with…

Computation and Language · Computer Science 2021-09-20 Ahmed Magooda , Diane Litman

Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

In end-to-end automatic speech recognition system, one of the difficulties for language expansion is the limited paired speech and text training data. In this paper, we propose a novel method to generate augmented samples with unpaired…

Audio and Speech Processing · Electrical Eng. & Systems 2023-08-01 Eric Sun , Jinyu Li , Jian Xue , Yifan Gong

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation

Data augmentation is a technique to generate new training data based on existing data. We evaluate the simple and cost-effective method of concatenating the original data examples to build new training instances. Continued training with…

Computation and Language · Computer Science 2023-06-12 Tsz Kin Lam , Shigehiko Schamoni , Stefan Riezler

Self-Supervised Learning for Contextualized Extractive Summarization

Existing models for extractive summarization are usually trained from scratch with a cross-entropy loss, which does not explicitly capture the global context at the document level. In this paper, we aim to improve this task by introducing…

Computation and Language · Computer Science 2019-06-12 Hong Wang , Xin Wang , Wenhan Xiong , Mo Yu , Xiaoxiao Guo , Shiyu Chang , William Yang Wang

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

Unsupervised extractive summarization aims to extract salient sentences from a document as the summary without labeled data. Recent literatures mostly research how to leverage sentence similarity to rank sentences in the order of salience.…

Computation and Language · Computer Science 2023-02-27 Shichao Sun , Ruifeng Yuan , Wenjie Li , Sujian Li

A Neural Attention Model for Abstractive Sentence Summarization

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method…

Computation and Language · Computer Science 2015-09-04 Alexander M. Rush , Sumit Chopra , Jason Weston