English
Related papers

Related papers: When Does Unsupervised Machine Translation Work?

200 papers

This paper studies the practicality of the current state-of-the-art unsupervised methods in neural machine translation (NMT). In ten translation tasks with various data settings, we analyze the conditions under which the unsupervised…

Computation and Language · Computer Science 2020-04-23 Yunsu Kim , Miguel Graça , Hermann Ney

Unsupervised machine translation---i.e., not assuming any cross-lingual supervision signal, whether a dictionary, translations, or comparable corpora---seems impossible, but nevertheless, Lample et al. (2018) recently proposed a fully…

Computation and Language · Computer Science 2018-05-10 Anders Søgaard , Sebastian Ruder , Ivan Vulić

Modern unsupervised machine translation (MT) systems reach reasonable translation quality under clean and controlled data conditions. As the performance gap between supervised and unsupervised MT narrows, it is interesting to ask whether…

Computation and Language · Computer Science 2022-04-15 Kelly Marchisio , Markus Freitag , David Grangier

This work focuses on comparing different solutions for machine translation on low resource language pairs, namely, with zero-shot transfer learning and unsupervised machine translation. We discuss how the data size affects the performance…

Computation and Language · Computer Science 2021-04-02 Aviral Joshi , Chengzhi Huang , Har Simrat Singh

Neural machine translation~(NMT) is ineffective for zero-resource languages. Recent works exploring the possibility of unsupervised neural machine translation (UNMT) with only monolingual data can achieve promising results. However, there…

Computation and Language · Computer Science 2021-04-02 Mingxuan Wang , Hongxiao Bai , Hai Zhao , Lei Li

While machine translation has traditionally relied on large amounts of parallel corpora, a recent research line has managed to train both Neural Machine Translation (NMT) and Statistical Machine Translation (SMT) systems using monolingual…

Computation and Language · Computer Science 2021-12-28 Mikel Artetxe , Gorka Labaka , Eneko Agirre

Achieving universal translation between all human language pairs is the holy-grail of machine translation (MT) research. While recent progress in massively multilingual MT is one step closer to reaching this goal, it is becoming evident…

Computation and Language · Computer Science 2022-01-14 Aditya Siddhant , Ankur Bapna , Orhan Firat , Yuan Cao , Mia Xu Chen , Isaac Caswell , Xavier Garcia

Unsupervised neural machine translation (UNMT) that relies solely on massive monolingual corpora has achieved remarkable results in several translation tasks. However, in real-world scenarios, massive monolingual corpora do not exist for…

Computation and Language · Computer Science 2021-05-25 Haipeng Sun , Rui Wang , Kehai Chen , Masao Utiyama , Eiichiro Sumita , Tiejun Zhao

Unsupervised Neural Machine Translation (UNMT) focuses on improving NMT results under the assumption there is no human translated parallel data, yet little work has been done so far in highlighting its advantages compared to supervised…

Computation and Language · Computer Science 2023-12-21 Isidora Chara Tourni , Derry Wijaya

Unsupervised machine translation, which utilizes unpaired monolingual corpora as training data, has achieved comparable performance against supervised machine translation. However, it still suffers from data-scarce domains. To address this…

Computation and Language · Computer Science 2021-05-10 Cheonbok Park , Yunwon Tae , Taehee Kim , Soyoung Yang , Mohammad Azam Khan , Eunjeong Park , Jaegul Choo

Domain adaptation has been well-studied in supervised neural machine translation (SNMT). However, it has not been well-studied for unsupervised neural machine translation (UNMT), although UNMT has recently achieved remarkable results in…

Computation and Language · Computer Science 2020-05-06 Haipeng Sun , Rui Wang , Kehai Chen , Masao Utiyama , Eiichiro Sumita , Tiejun Zhao , Chenhui Chu

Even with the latest developments in deep learning and large-scale language modeling, the task of machine translation (MT) of low-resource languages remains a challenge. Neural MT systems can be trained in an unsupervised way without any…

Computation and Language · Computer Science 2023-10-24 Ivana Kvapilíková , Ondřej Bojar

The vast majority of evaluation metrics for machine translation are supervised, i.e., (i) are trained on human scores, (ii) assume the existence of reference translations, or (iii) leverage parallel data. This hinders their applicability to…

Computation and Language · Computer Science 2024-03-05 Jonas Belouadi , Steffen Eger

Unsupervised neural machine translation (UNMT) has recently achieved remarkable results with only large monolingual corpora in each language. However, the uncertainty of associating target with source sentences makes UNMT theoretically an…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Yuanhang Su , Kai Fan , Nguyen Bach , C. -C. Jay Kuo , Fei Huang

With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such…

Programming Languages · Computer Science 2023-02-22 Aniketh Malyala , Katelyn Zhou , Baishakhi Ray , Saikat Chakraborty

Automatic machine translation (MT) metrics are widely used to distinguish the translation qualities of machine translation systems across relatively large test sets (system-level evaluation). However, it is unclear if automatic metrics are…

Computation and Language · Computer Science 2023-06-21 Nikita Moghe , Tom Sherborne , Mark Steedman , Alexandra Birch

Recent work achieved remarkable results in training neural machine translation (NMT) systems in a fully unsupervised way, with new and dedicated architectures that rely on monolingual corpora only. In this work, we propose to define…

Computation and Language · Computer Science 2018-10-31 Benjamin Marie , Atsushi Fujita

Unsupervised on-the-fly back-translation, in conjunction with multilingual pretraining, is the dominant method for unsupervised neural machine translation. Theoretically, however, the method should not work in general. We therefore conduct…

Computation and Language · Computer Science 2024-03-28 Nicolas Guerin , Shane Steinert-Threlkeld , Emmanuel Chemla

Without real bilingual corpus available, unsupervised Neural Machine Translation (NMT) typically requires pseudo parallel data generated with the back-translation method for the model training. However, due to weak supervision, the pseudo…

Computation and Language · Computer Science 2019-01-15 Shuo Ren , Zhirui Zhang , Shujie Liu , Ming Zhou , Shuai Ma

Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NMT), by drastically reducing the need for large parallel data. Most approaches adapt masked-language modeling (MLM) to sequence-to-sequence…

Computation and Language · Computer Science 2021-06-11 Christos Baziotis , Ivan Titov , Alexandra Birch , Barry Haddow
‹ Prev 1 2 3 10 Next ›