English

Zero-Shot Translation using Diffusion Models

Computation and Language 2021-11-03 v1 Machine Learning

Abstract

In this work, we show a novel method for neural machine translation (NMT), using a denoising diffusion probabilistic model (DDPM), adjusted for textual data, following recent advances in the field. We show that it's possible to translate sentences non-autoregressively using a diffusion model conditioned on the source sentence. We also show that our model is able to translate between pairs of languages unseen during training (zero-shot learning).

Keywords

Cite

@article{arxiv.2111.01471,
  title  = {Zero-Shot Translation using Diffusion Models},
  author = {Eliya Nachmani and Shaked Dovrat},
  journal= {arXiv preprint arXiv:2111.01471},
  year   = {2021}
}

Comments

preprint

R2 v1 2026-06-24T07:22:19.226Z