Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

Giuseppe Russo; Nora Hollenstein; Claudiu Musat; Ce Zhang

Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

Computation and Language 2020-10-05 v2

Authors: Giuseppe Russo , Nora Hollenstein , Claudiu Musat , Ce Zhang

Abstract

We introduce CGA, a conditional VAE architecture, to control, generate, and augment text. CGA is able to generate natural English sentences controlling multiple semantic and syntactic attributes by combining adversarial learning with a context-aware loss and a cyclical word dropout routine. We demonstrate the value of the individual model components in an ablation study. The scalability of our approach is ensured through a single discriminator, independently of the number of attributes. We show high quality, diversity and attribute control in the generated sentences through a series of automatic and human assessments. As the main application of our work, we test the potential of this new NLG model in a data augmentation scenario. In a downstream NLP task, the sentences generated by our CGA model show significant improvements over a strong baseline, and a classification performance often comparable to adding same amount of additional real data.

Keywords

text generation grammatical error correction natural language processing

Cite

@article{arxiv.2004.14983,
  title  = {Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation},
  author = {Giuseppe Russo and Nora Hollenstein and Claudiu Musat and Ce Zhang},
  journal= {arXiv preprint arXiv:2004.14983},
  year   = {2020}
}

Comments

Accepted at Findings of EMNLP 2020

Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

Abstract

Keywords

Cite

Comments

Related papers