English
Related papers

Related papers: Reference Language based Unsupervised Neural Machi…

200 papers

Neural machine translation~(NMT) is ineffective for zero-resource languages. Recent works exploring the possibility of unsupervised neural machine translation (UNMT) with only monolingual data can achieve promising results. However, there…

Computation and Language · Computer Science 2021-04-02 Mingxuan Wang , Hongxiao Bai , Hai Zhao , Lei Li

Conventional retrieval-augmented neural machine translation (RANMT) systems leverage bilingual corpora, e.g., translation memories (TMs). Yet, in many settings, monolingual corpora in the target language are often available. This work…

Computation and Language · Computer Science 2025-10-02 Maxime Bouthors , Josep Crego , François Yvon

Unsupervised Neural Machine Translation (UNMT) focuses on improving NMT results under the assumption there is no human translated parallel data, yet little work has been done so far in highlighting its advantages compared to supervised…

Computation and Language · Computer Science 2023-12-21 Isidora Chara Tourni , Derry Wijaya

Parallel corpora are indispensable for training neural machine translation (NMT) models, and parallel corpora for most language pairs do not exist or are scarce. In such cases, pivot language NMT can be helpful where a pivot language is…

Computation and Language · Computer Science 2021-04-16 Raj Dabre , Aizhan Imankulova , Masahiro Kaneko , Abhisek Chakrabarty

Back-translation is a critical component of Unsupervised Neural Machine Translation (UNMT), which generates pseudo parallel data from target monolingual data. A UNMT model is trained on the pseudo parallel data with translated source, and…

Computation and Language · Computer Science 2022-03-24 Zhiwei He , Xing Wang , Rui Wang , Shuming Shi , Zhaopeng Tu

Recent work achieved remarkable results in training neural machine translation (NMT) systems in a fully unsupervised way, with new and dedicated architectures that rely on monolingual corpora only. In this work, we propose to define…

Computation and Language · Computer Science 2018-10-31 Benjamin Marie , Atsushi Fujita

Unsupervised neural machine translation (UNMT) has recently achieved remarkable results with only large monolingual corpora in each language. However, the uncertainty of associating target with source sentences makes UNMT theoretically an…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Yuanhang Su , Kai Fan , Nguyen Bach , C. -C. Jay Kuo , Fei Huang

In spite of the recent success of neural machine translation (NMT) in standard benchmarks, the lack of large parallel corpora poses a major practical problem for many language pairs. There have been several proposals to alleviate this issue…

Computation and Language · Computer Science 2018-02-27 Mikel Artetxe , Gorka Labaka , Eneko Agirre , Kyunghyun Cho

Despite the growing variety of languages supported by existing multilingual neural machine translation (MNMT) models, most of the world's languages are still being left behind. We aim to extend large-scale MNMT models to incorporate a new…

Computation and Language · Computer Science 2025-12-02 Wen Lai , Viktor Hangya , Yingli Shen , Alexander Fraser

Despite impressive progress in high-resource settings, Neural Machine Translation (NMT) still struggles in low-resource and out-of-domain scenarios, often failing to match the quality of phrase-based translation. We propose a novel…

Computation and Language · Computer Science 2018-05-31 Xing Niu , Michael Denkowski , Marine Carpuat

Back-translation (BT) has become one of the de facto components in unsupervised neural machine translation (UNMT), and it explicitly makes UNMT have translation ability. However, all the pseudo bi-texts generated by BT are treated equally…

Computation and Language · Computer Science 2021-09-24 Jinliang Lu , Jiajun Zhang

We present effective pre-training strategies for neural machine translation (NMT) using parallel corpora involving a pivot language, i.e., source-pivot and pivot-target, leading to a significant improvement in source-target translation. We…

Computation and Language · Computer Science 2019-09-23 Yunsu Kim , Petre Petrov , Pavel Petrushkov , Shahram Khadivi , Hermann Ney

For most language combinations, parallel data is either scarce or simply unavailable. To address this, unsupervised machine translation (UMT) exploits large amounts of monolingual data by using synthetic data generation techniques such as…

Computation and Language · Computer Science 2021-07-20 Dana Ruiter , Dietrich Klakow , Josef van Genabith , Cristina España-Bonet

Prior work has proved that Translation memory (TM) can boost the performance of Neural Machine Translation (NMT). In contrast to existing work that uses bilingual corpus as TM and employs source-side similarity search for memory retrieval,…

Computation and Language · Computer Science 2021-06-03 Deng Cai , Yan Wang , Huayang Li , Wai Lam , Lemao Liu

Using a language model (LM) pretrained on two languages with large monolingual data in order to initialize an unsupervised neural machine translation (UNMT) system yields state-of-the-art results. When limited data is available for one…

Computation and Language · Computer Science 2020-10-07 Alexandra Chronopoulou , Dario Stojanovski , Alexander Fraser

Pivot-based neural machine translation (NMT) is commonly used in low-resource setups, especially for translation between non-English language pairs. It benefits from using high resource source-pivot and pivot-target language pairs and an…

Computation and Language · Computer Science 2021-09-28 Evgeniia Tokarchuk , Jan Rosendahl , Weiyue Wang , Pavel Petrushkov , Tomer Lancewicki , Shahram Khadivi , Hermann Ney

We present an ensemble-driven self-training framework for unsupervised neural machine translation (UNMT). Starting from a primary language pair, we train multiple UNMT models that share the same translation task but differ in an auxiliary…

Computation and Language · Computer Science 2026-03-19 Ido Aharon , Jonathan Shaki , Sarit Kraus

While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage,…

Computation and Language · Computer Science 2016-12-13 Yong Cheng , Wei Xu , Zhongjun He , Wei He , Hua Wu , Maosong Sun , Yang Liu

A prerequisite for training corpus-based machine translation (MT) systems -- either Statistical MT (SMT) or Neural MT (NMT) -- is the availability of high-quality parallel data. This is arguably more important today than ever before, as NMT…

Computation and Language · Computer Science 2018-04-18 Alberto Poncelas , Dimitar Shterionov , Andy Way , Gideon Maillette de Buy Wenniger , Peyman Passban

Unsupervised neural machine translation (NMT) has attracted a lot of attention recently. While state-of-the-art methods for unsupervised translation usually perform well between similar languages (e.g., English-German translation), they…

Computation and Language · Computer Science 2019-06-26 Yichong Leng , Xu Tan , Tao Qin , Xiang-Yang Li , Tie-Yan Liu
‹ Prev 1 2 3 10 Next ›