Related papers: Bootstrapping Generators from Noisy Data

Towards Content Transfer through Grounded Text Generation

Recent work in neural generation has attracted significant interest in controlling the form of text, such as style, persona, and politeness. However, there has been less work on controlling neural text generation for content. This paper…

Computation and Language · Computer Science 2019-05-15 Shrimai Prabhumoye , Chris Quirk , Michel Galley

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

We present a new dataset of Wikipedia articles each paired with a knowledge graph, to facilitate the research in conditional text generation, graph generation and graph representation learning. Existing graph-text paired datasets typically…

Computation and Language · Computer Science 2021-07-21 Luyu Wang , Yujia Li , Ozlem Aslan , Oriol Vinyals

Data-to-text Generation with Variational Sequential Planning

We consider the task of data-to-text generation, which aims to create textual output from non-linguistic input. We focus on generating long-form text, i.e., documents with multiple paragraphs, and propose a neural model enhanced with a…

Computation and Language · Computer Science 2022-03-01 Ratish Puduppully , Yao Fu , Mirella Lapata

Incorporating Consistency Verification into Neural Data-to-Document Generation

Recent neural models for data-to-document generation have achieved remarkable progress in producing fluent and informative texts. However, large proportions of generated texts do not actually conform to the input data. To address this…

Computation and Language · Computer Science 2018-08-21 Feng Nie , Hailin Chen , Jinpeng Wang , Jin-Ge Yao , Chin-Yew Lin , Rong Pan

Data-to-Text Generation with Style Imitation

Recent neural approaches to data-to-text generation have mostly focused on improving content fidelity while lacking explicit control over writing styles (e.g., word choices, sentence structures). More traditional systems use templates to…

Computation and Language · Computer Science 2020-10-12 Shuai Lin , Wentao Wang , Zichao Yang , Xiaodan Liang , Frank F. Xu , Eric Xing , Zhiting Hu

Search and Learn: Improving Semantic Coverage for Data-to-Text Generation

Data-to-text generation systems aim to generate text descriptions based on input data (often represented in the tabular form). A typical system uses huge training samples for learning the correspondence between tables and texts. However,…

Computation and Language · Computer Science 2021-12-07 Shailza Jolly , Zi Xuan Zhang , Andreas Dengel , Lili Mou

Neural Data-to-Text Generation with Dynamic Content Planning

Neural data-to-text generation models have achieved significant advancement in recent years. However, these models have two shortcomings: the generated texts tend to miss some vital information, and they often generate descriptions that are…

Computation and Language · Computer Science 2020-04-21 Kai Chen , Fayuan Li , Baotian Hu , Weihua Peng , Qingcai Chen , Hong Yu

Data-to-text Generation with Macro Planning

Recent approaches to data-to-text generation have adopted the very successful encoder-decoder architecture or variants thereof. These models generate text which is fluent (but often imprecise) and perform quite poorly at selecting…

Computation and Language · Computer Science 2021-02-05 Ratish Puduppully , Mirella Lapata

Uncertainty-Aware Bootstrap Learning for Joint Extraction on Distantly-Supervised Data

Jointly extracting entity pairs and their relations is challenging when working on distantly-supervised data with ambiguous or noisy labels. To mitigate such impact, we propose uncertainty-aware bootstrap learning, which is motivated by the…

Computation and Language · Computer Science 2023-06-12 Yufei Li , Xiao Yu , Yanchi Liu , Haifeng Chen , Cong Liu

End-to-End Content and Plan Selection for Data-to-Text Generation

Learning to generate fluent natural language from structured data with neural networks has become an common approach for NLG. This problem can be challenging when the form of the structured data varies between examples. This paper presents…

Computation and Language · Computer Science 2018-10-12 Sebastian Gehrmann , Falcon Z. Dai , Henry Elder , Alexander M. Rush

Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text

Knowledge graphs can represent information about the real-world using entities and their relations in a structured and semantically rich manner and they enable a variety of downstream applications such as question-answering, recommendation…

Computation and Language · Computer Science 2023-05-16 Hanieh Khorashadizadeh , Nandana Mihindukulasooriya , Sanju Tiwari , Jinghua Groppe , Sven Groppe

RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling

Recent advances in large-scale pre-training such as GPT-3 allow seemingly high quality text to be generated from a given prompt. However, such generation systems often suffer from problems of hallucinated facts, and are not inherently…

Computation and Language · Computer Science 2022-02-25 Yizhe Zhang , Siqi Sun , Xiang Gao , Yuwei Fang , Chris Brockett , Michel Galley , Jianfeng Gao , Bill Dolan

Focused Attention Improves Document-Grounded Generation

Document grounded generation is the task of using the information provided in a document to improve text generation. This work focuses on two different document grounded generation tasks: Wikipedia Update Generation task and Dialogue…

Computation and Language · Computer Science 2021-04-27 Shrimai Prabhumoye , Kazuma Hashimoto , Yingbo Zhou , Alan W Black , Ruslan Salakhutdinov

DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

Despite their growing capabilities, language models still frequently reproduce content from their training data, generate repetitive text, and favor common grammatical patterns and vocabulary. A possible cause is the decoding strategy: the…

Computation and Language · Computer Science 2026-01-15 Giorgio Franceschelli , Mirco Musolesi

Learning to Describe Differences Between Pairs of Similar Images

In this paper, we introduce the task of automatically generating text to describe the differences between two similar images. We collect a new dataset by crowd-sourcing difference descriptions for pairs of image frames extracted from…

Computation and Language · Computer Science 2018-09-03 Harsh Jhamtani , Taylor Berg-Kirkpatrick

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

Most Reading Comprehension methods limit themselves to queries which can be answered using a single sentence, paragraph, or document. Enabling models to combine disjoint pieces of textual evidence would extend the scope of machine…

Computation and Language · Computer Science 2018-06-12 Johannes Welbl , Pontus Stenetorp , Sebastian Riedel

A Single Example Can Improve Zero-Shot Data Generation

Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As…

Computation and Language · Computer Science 2021-08-17 Pavel Burnyshev , Valentin Malykh , Andrey Bout , Ekaterina Artemova , Irina Piontkovskaya

WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections

Datasets for data-to-text generation typically focus either on multi-domain, single-sentence generation or on single-domain, long-form generation. In this work, we cast generating Wikipedia sections as a data-to-text generation task and…

Computation and Language · Computer Science 2021-06-03 Mingda Chen , Sam Wiseman , Kevin Gimpel

Challenges in Data-to-Document Generation

Recent neural models have shown significant progress on the problem of generating short descriptive texts conditioned on a small number of database records. In this work, we suggest a slightly more difficult data-to-text generation task,…

Computation and Language · Computer Science 2017-07-26 Sam Wiseman , Stuart M. Shieber , Alexander M. Rush

Weaver: Deep Co-Encoding of Questions and Documents for Machine Reading

This paper aims at improving how machines can answer questions directly from text, with the focus of having models that can answer correctly multiple types of questions and from various types of texts, documents or even from large…

Computation and Language · Computer Science 2018-04-30 Martin Raison , Pierre-Emmanuel Mazaré , Rajarshi Das , Antoine Bordes