Related papers: Simulated Multiple Reference Training Improves Low…

Neural machine translation for low-resource languages

Neural machine translation (NMT) approaches have improved the state of the art in many machine translation settings over the last couple of years, but they require large amounts of training data to produce sensible output. We demonstrate…

Computation and Language · Computer Science 2017-08-22 Robert Östling , Jörg Tiedemann

Simultaneous Machine Translation with Tailored Reference

Simultaneous machine translation (SiMT) generates translation while reading the whole source sentence. However, existing SiMT models are typically trained using the same reference disregarding the varying amounts of available source…

Computation and Language · Computer Science 2023-10-27 Shoutao Guo , Shaolei Zhang , Yang Feng

Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages

We conduct an empirical study of neural machine translation (NMT) for truly low-resource languages, and propose a training curriculum fit for cases when both parallel training data and compute resource are lacking, reflecting the reality of…

Computation and Language · Computer Science 2021-11-30 Garry Kuwanto , Afra Feyza Akyürek , Isidora Chara Tourni , Siyang Li , Alexander Gregory Jones , Derry Wijaya

Multiple References with Meaningful Variations Improve Literary Machine Translation

While a source sentence can be translated in many ways, most machine translation (MT) models are trained with only a single reference. Previous work has shown that using synthetic paraphrases can improve MT. This paper investigates best…

Computation and Language · Computer Science 2025-02-27 Si Wu , John Wieting , David A. Smith

Revisiting Low-Resource Neural Machine Translation: A Case Study

It has been shown that the performance of neural machine translation (NMT) drops starkly in low-resource conditions, underperforming phrase-based statistical machine translation (PBSMT) and requiring large amounts of auxiliary data to…

Computation and Language · Computer Science 2019-05-29 Rico Sennrich , Biao Zhang

Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation

Neural text generation, including neural machine translation, image captioning, and summarization, has been quite successful recently. However, during training time, typically only one reference is considered for each example, even though…

Computation and Language · Computer Science 2018-08-30 Renjie Zheng , Mingbo Ma , Liang Huang

NSL-MT: Linguistically Informed Negative Samples for Efficient Machine Translation in Low-Resource Languages

We introduce negative space learning machine translation (NSL-MT), a training method for underresourced languages, that augments limited parallel data with synthetically generated violations of the target language's grammar and explicitly…

Machine Learning · Computer Science 2026-05-07 Mamadou K. Keita , Christopher Homan , Huy Le

SMRT Chatbots: Improving Non-Task-Oriented Dialog with Simulated Multiple Reference Training

Non-task-oriented dialog models suffer from poor quality and non-diverse responses. To overcome limited conversational data, we apply Simulated Multiple Reference Training (SMRT; Khayrallah et al., 2020), and use a paraphraser to simulate…

Computation and Language · Computer Science 2021-04-23 Huda Khayrallah , João Sedoc

Enhanced back-translation for low resource neural machine translation using self-training

Improving neural machine translation (NMT) models using the back-translations of the monolingual target data (synthetic parallel data) is currently the state-of-the-art approach for training improved translation systems. The quality of the…

Computation and Language · Computer Science 2021-02-16 Idris Abdulmumin , Bashir Shehu Galadanci , Abubakar Isa

Learning Policies for Multilingual Training of Neural Machine Translation Systems

Low-resource Multilingual Neural Machine Translation (MNMT) is typically tasked with improving the translation performance on one or more language pairs with the aid of high-resource language pairs. In this paper, we propose two simple…

Computation and Language · Computer Science 2021-03-15 Gaurav Kumar , Philipp Koehn , Sanjeev Khudanpur

Neural Machine Translation for Low-Resource Languages: A Survey

Neural Machine Translation (NMT) has seen a tremendous spurt of growth in less than ten years, and has already entered a mature phase. While considered as the most widely used solution for Machine Translation, its performance on…

Computation and Language · Computer Science 2021-06-30 Surangika Ranathunga , En-Shiun Annie Lee , Marjana Prifti Skenduli , Ravi Shekhar , Mehreen Alam , Rishemjit Kaur

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

Multilingual neural machine translation (MNMT) trained in multiple language pairs has attracted considerable attention due to fewer model parameters and lower training costs by sharing knowledge among multiple languages. Nonetheless,…

Computation and Language · Computer Science 2022-07-21 Jian Yang , Yuwei Yin , Shuming Ma , Dongdong Zhang , Zhoujun Li , Furu Wei

Combining SMT and NMT Back-Translated Data for Efficient NMT

Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training. Consequently, techniques for augmenting the training set have become popular recently. One of these methods is…

Computation and Language · Computer Science 2019-09-10 Alberto Poncelas , Maja Popovic , Dimitar Shterionov , Gideon Maillette de Buy Wenniger , Andy Way

Bi-Directional Neural Machine Translation with Synthetic Parallel Data

Despite impressive progress in high-resource settings, Neural Machine Translation (NMT) still struggles in low-resource and out-of-domain scenarios, often failing to match the quality of phrase-based translation. We propose a novel…

Computation and Language · Computer Science 2018-05-31 Xing Niu , Michael Denkowski , Marine Carpuat

Improving Low-Resource Machine Translation via Round-Trip Reinforcement Learning

Low-resource machine translation (MT) has gained increasing attention as parallel data from low-resource language communities is collected, but many approaches for improving low-resource MT remain underexplored. We investigate a…

Computation and Language · Computer Science 2026-03-19 Ahmed Attia , Alham Fikri Aji

Handling Syntactic Divergence in Low-resource Machine Translation

Despite impressive empirical successes of neural machine translation (NMT) on standard benchmarks, limited parallel data impedes the application of NMT models to many language pairs. Data augmentation methods such as back-translation make…

Computation and Language · Computer Science 2019-10-08 Chunting Zhou , Xuezhe Ma , Junjie Hu , Graham Neubig

On the Weaknesses of Reinforcement Learning for Neural Machine Translation

Reinforcement learning (RL) is frequently used to increase performance in text generation tasks, including machine translation (MT), notably through the use of Minimum Risk Training (MRT) and Generative Adversarial Networks (GAN). However,…

Computation and Language · Computer Science 2020-01-16 Leshem Choshen , Lior Fox , Zohar Aizenbud , Omri Abend

Neural Machine Translation For Low Resource Languages

Neural Machine translation is a challenging task due to the inherent complex nature and the fluidity that natural languages bring. Nonetheless, in recent years, it has achieved state-of-the-art performance in several language pairs.…

Computation and Language · Computer Science 2023-04-19 Vakul Goyle , Parvathy Krishnaswamy , Kannan Girija Ravikumar , Utsa Chattopadhyay , Kartikay Goyle

Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages

The advent of deep learning has led to a significant gain in machine translation. However, most of the studies required a large parallel dataset which is scarce and expensive to construct and even unavailable for some languages. This paper…

Computation and Language · Computer Science 2023-04-04 Viet H. Pham , Thang M. Pham , Giang Nguyen , Long Nguyen , Dien Dinh

Meta-Learning for Low-Resource Neural Machine Translation

In this paper, we propose to extend the recently introduced model-agnostic meta-learning algorithm (MAML) for low-resource neural machine translation (NMT). We frame low-resource translation as a meta-learning problem, and we learn to adapt…

Computation and Language · Computer Science 2018-08-28 Jiatao Gu , Yong Wang , Yun Chen , Kyunghyun Cho , Victor O. K. Li