English
Related papers

Related papers: Challenges in Data-to-Document Generation

200 papers

We consider the task of data-to-text generation, which aims to create textual output from non-linguistic input. We focus on generating long-form text, i.e., documents with multiple paragraphs, and propose a neural model enhanced with a…

Computation and Language · Computer Science 2022-03-01 Ratish Puduppully , Yao Fu , Mirella Lapata

Recent approaches to data-to-text generation have adopted the very successful encoder-decoder architecture or variants thereof. These models generate text which is fluent (but often imprecise) and perform quite poorly at selecting…

Computation and Language · Computer Science 2021-02-05 Ratish Puduppully , Mirella Lapata

Recent neural approaches to data-to-text generation have mostly focused on improving content fidelity while lacking explicit control over writing styles (e.g., word choices, sentence structures). More traditional systems use templates to…

Computation and Language · Computer Science 2020-10-12 Shuai Lin , Wentao Wang , Zichao Yang , Xiaodan Liang , Frank F. Xu , Eric Xing , Zhiting Hu

Recent advances in deep neural language models combined with the capacity of large scale datasets have accelerated the development of natural language generation systems that produce fluent and coherent texts (to various degrees of success)…

Computation and Language · Computer Science 2025-04-15 Cristina Garbacea , Qiaozhu Mei

We propose a novel conditioned text generation model. It draws inspiration from traditional template-based text generation techniques, where the source provides the content (i.e., what to say), and the template influences how to say it.…

Computation and Language · Computer Science 2019-04-12 Hao Peng , Ankur P. Parikh , Manaal Faruqui , Bhuwan Dhingra , Dipanjan Das

Recent approaches to data-to-text generation have shown great promise thanks to the use of large-scale datasets and the application of neural network architectures which are trained end-to-end. These models rely on representation learning…

Computation and Language · Computer Science 2019-06-10 Ratish Puduppully , Li Dong , Mirella Lapata

Neural models have recently been used in text summarization including headline generation. The model can be trained using a set of document-headline pairs. However, the model does not explicitly consider topical similarities and differences…

Computation and Language · Computer Science 2016-08-23 Lei Xu , Ziyun Wang , Ayana , Zhiyuan Liu , Maosong Sun

We present a comparison of word-based and character-based sequence-to-sequence models for data-to-text natural language generation, which generate natural language descriptions for structured inputs. On the datasets of two recent generation…

Computation and Language · Computer Science 2018-10-12 Glorianna Jagfeld , Sabrina Jenne , Ngoc Thang Vu

Recent neural models for data-to-document generation have achieved remarkable progress in producing fluent and informative texts. However, large proportions of generated texts do not actually conform to the input data. To address this…

Computation and Language · Computer Science 2018-08-21 Feng Nie , Hailin Chen , Jinpeng Wang , Jin-Ge Yao , Chin-Yew Lin , Rong Pan

Neural data-to-text generation models have achieved significant advancement in recent years. However, these models have two shortcomings: the generated texts tend to miss some vital information, and they often generate descriptions that are…

Computation and Language · Computer Science 2020-04-21 Kai Chen , Fayuan Li , Baotian Hu , Weihua Peng , Qingcai Chen , Hong Yu

We follow the step-by-step approach to neural data-to-text generation we proposed in Moryossef et al (2019), in which the generation process is divided into a text-planning stage followed by a plan-realization stage. We suggest four…

Computation and Language · Computer Science 2019-09-24 Amit Moryossef , Ido Dagan , Yoav Goldberg

Recent advances in data-to-text generation have led to the use of large-scale datasets and neural network models which are trained end-to-end, without explicitly modeling what to say and in what order. In this work, we present a neural…

Computation and Language · Computer Science 2019-04-15 Ratish Puduppully , Li Dong , Mirella Lapata

Text generation system has made massive promising progress contributed by deep learning techniques and has been widely applied in our life. However, existing end-to-end neural models suffer from the problem of tending to generate…

Artificial Intelligence · Computer Science 2020-03-03 Hao Wang , Bin Guo , Wei Wu , Zhiwen Yu

Deep learning methods have recently achieved great empirical success on machine translation, dialogue response generation, summarization, and other text generation tasks. At a high level, the technique has been to train end-to-end neural…

Computation and Language · Computer Science 2017-11-28 Ziang Xie

We propose a data-to-text generation model with two modules, one for tracking and the other for text generation. Our tracking module selects and keeps track of salient information and memorizes which record has been mentioned. Our…

Computation and Language · Computer Science 2021-04-05 Hayate Iso , Yui Uehara , Tatsuya Ishigaki , Hiroshi Noji , Eiji Aramaki , Ichiro Kobayashi , Yusuke Miyao , Naoaki Okazaki , Hiroya Takamura

Recent advances in neural network-based generative modeling have reignited the hopes in having computer systems capable of seamlessly conversing with humans and able to understand natural language. Neural architectures have been employed to…

Computation and Language · Computer Science 2020-08-03 Cristina Garbacea , Qiaozhu Mei

In many cases of machine learning, research suggests that the development of training data might have a higher relevance than the choice and modelling of classifiers themselves. Thus, data augmentation methods have been developed to improve…

Computation and Language · Computer Science 2022-07-25 Markus Bayer , Marc-André Kaufhold , Björn Buchhold , Marcel Keller , Jörg Dallmeyer , Christian Reuter

The generation of precise and detailed Table-Of-Contents (TOC) from a document is a problem of major importance for document understanding and information extraction. Despite its importance, it is still a challenging task, especially for…

Computation and Language · Computer Science 2019-11-21 Najah-Imane Bentabet , Rémi Juge , Sira Ferradans

For many new application domains for data-to-text generation, the main obstacle in training neural models consists of a lack of training data. While usually large numbers of instances are available on the data side, often only very few text…

Computation and Language · Computer Science 2021-02-09 Ernie Chang , Xiaoyu Shen , Dawei Zhu , Vera Demberg , Hui Su

Sequence-to-sequence models have lead to significant progress in keyphrase generation, but it remains unknown whether they are reliable enough to be beneficial for document retrieval. This study provides empirical evidence that such models…

Information Retrieval · Computer Science 2021-06-29 Florian Boudin , Ygor Gallina , Akiko Aizawa
‹ Prev 1 2 3 10 Next ›