Insertion-Deletion Transformer

Laura Ruis; Mitchell Stern; Julia Proskurnia; William Chan

Insertion-Deletion Transformer

Machine Learning 2020-01-17 v1 Computation and Language Machine Learning

Authors: Laura Ruis , Mitchell Stern , Julia Proskurnia , William Chan

Abstract

We propose the Insertion-Deletion Transformer, a novel transformer-based neural architecture and training method for sequence generation. The model consists of two phases that are executed iteratively, 1) an insertion phase and 2) a deletion phase. The insertion phase parameterizes a distribution of insertions on the current output hypothesis, while the deletion phase parameterizes a distribution of deletions over the current output hypothesis. The training method is a principled and simple algorithm, where the deletion model obtains its signal directly on-policy from the insertion model output. We demonstrate the effectiveness of our Insertion-Deletion Transformer on synthetic translation tasks, obtaining significant BLEU score improvement over an insertion-only model.

Keywords

transformer deep learning model transformation

Cite

@article{arxiv.2001.05540,
  title  = {Insertion-Deletion Transformer},
  author = {Laura Ruis and Mitchell Stern and Julia Proskurnia and William Chan},
  journal= {arXiv preprint arXiv:2001.05540},
  year   = {2020}
}

Comments

Accepted as an Extended Abstract at the Workshop of Neural Generation and Translation (WNGT 2019) at EMNLP 2019

Insertion-Deletion Transformer

Abstract

Keywords

Cite

Comments

Related papers