English

Document Graph for Neural Machine Translation

Computation and Language 2021-09-15 v3

Abstract

Previous works have shown that contextual information can improve the performance of neural machine translation (NMT). However, most existing document-level NMT methods only consider a few number of previous sentences. How to make use of the whole document as global contexts is still a challenge. To address this issue, we hypothesize that a document can be represented as a graph that connects relevant contexts regardless of their distances. We employ several types of relations, including adjacency, syntactic dependency, lexical consistency, and coreference, to construct the document graph. Then, we incorporate both source and target graphs into the conventional Transformer architecture with graph convolutional networks. Experiments on various NMT benchmarks, including IWSLT English--French, Chinese-English, WMT English--German and Opensubtitle English--Russian, demonstrate that using document graphs can significantly improve the translation quality. Extensive analysis verifies that the document graph is beneficial for capturing discourse phenomena.

Keywords

Cite

@article{arxiv.2012.03477,
  title  = {Document Graph for Neural Machine Translation},
  author = {Mingzhou Xu and Liangyou Li and Derek. F. Wong and Qun Liu and Lidia S. Chao},
  journal= {arXiv preprint arXiv:2012.03477},
  year   = {2021}
}

Comments

Accepted by EMNLP2021

R2 v1 2026-06-23T20:46:16.166Z