Related papers: Using Machine Translation to Augment Multilingual …

Revisiting Machine Translation for Cross-lingual Classification

Machine Translation (MT) has been widely used for cross-lingual classification, either by translating the test set into English and running inference with a monolingual model (translate-test), or translating the training set into the target…

Computation and Language · Computer Science 2023-05-24 Mikel Artetxe , Vedanuj Goswami , Shruti Bhosale , Angela Fan , Luke Zettlemoyer

Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?

Multilingual pretraining and fine-tuning have remarkably succeeded in various natural language processing tasks. Transferring representations from one language to another is especially crucial for cross-lingual learning. One can expect…

Computation and Language · Computer Science 2024-03-26 Shaoxiong Ji , Timothee Mickus , Vincent Segonne , Jörg Tiedemann

Transductive Data-Selection Algorithms for Fine-Tuning Neural Machine Translation

Machine Translation models are trained to translate a variety of documents from one language into another. However, models specifically trained for a particular characteristics of the documents tend to perform better. Fine-tuning is a…

Computation and Language · Computer Science 2019-10-09 Alberto Poncelas , Gideon Maillette de Buy Wenniger , Andy Way

Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation

Multilingual machine translation systems aim to make knowledge accessible across languages, yet learning effective cross-lingual representations remains challenging. These challenges are especially pronounced for low-resource languages,…

Computation and Language · Computer Science 2026-01-08 David Stap

The Reality of Multi-Lingual Machine Translation

Our book "The Reality of Multi-Lingual Machine Translation" discusses the benefits and perils of using more than two languages in machine translation systems. While focused on the particular task of sequence-to-sequence processing and…

Computation and Language · Computer Science 2022-02-28 Tom Kocmi , Dominik Macháček , Ondřej Bojar

The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation

Prior research diverges on language diversity in LLM fine-tuning: Some studies report benefits while others find no advantages. Through controlled fine-tuning experiments across 132 translation directions, we systematically resolve these…

Computation and Language · Computer Science 2025-09-22 David Stap , Christof Monz

Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?

Multilingual large language models are designed, claimed, and expected to cater to speakers of varied languages. We hypothesise that the current practices of fine-tuning and evaluating these models may not perfectly align with this…

Computation and Language · Computer Science 2024-09-27 Pinzhen Chen , Simon Yu , Zhicheng Guo , Barry Haddow

Improving Language Model Integration for Neural Machine Translation

The integration of language models for neural machine translation has been extensively studied in the past. It has been shown that an external language model, trained on additional target-side monolingual data, can help improve translation…

Computation and Language · Computer Science 2023-06-09 Christian Herold , Yingbo Gao , Mohammad Zeineldeen , Hermann Ney

Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios

We tackle the task of automatically discriminating between human and machine translations. As opposed to most previous work, we perform experiments in a multilingual setting, considering multiple languages and multilingual pretrained…

Computation and Language · Computer Science 2023-06-01 Malina Chichirau , Rik van Noord , Antonio Toral

Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?

Traditionally, success in multilingual machine translation can be attributed to three key factors in training data: large volume, diverse translation directions, and high quality. In the current practice of fine-tuning large language models…

Computation and Language · Computer Science 2024-10-07 Dawei Zhu , Pinzhen Chen , Miaoran Zhang , Barry Haddow , Xiaoyu Shen , Dietrich Klakow

How Multilingual Are Large Language Models Fine-Tuned for Translation?

A new paradigm for machine translation has recently emerged: fine-tuning large language models (LLM) on parallel text has been shown to outperform dedicated translation systems trained in a supervised fashion on much larger amounts of…

Computation and Language · Computer Science 2024-06-03 Aquia Richburg , Marine Carpuat

Machine Translation: A Literature Review

Machine translation (MT) plays an important role in benefiting linguists, sociologists, computer scientists, etc. by processing natural language to translate it into some other natural language. And this demand has grown exponentially over…

Computation and Language · Computer Science 2019-01-07 Ankush Garg , Mayank Agarwal

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Recent work demonstrates the potential of multilingual pretraining of creating one model that can be used for various tasks in different languages. Previous work in multilingual pretraining has demonstrated that machine translation systems…

Computation and Language · Computer Science 2020-08-04 Yuqing Tang , Chau Tran , Xian Li , Peng-Jen Chen , Naman Goyal , Vishrav Chaudhary , Jiatao Gu , Angela Fan

Document-Level Language Models for Machine Translation

Despite the known limitations, most machine translation systems today still operate on the sentence-level. One reason for this is, that most parallel training data is only sentence-level aligned, without document-level meta information…

Computation and Language · Computer Science 2023-10-20 Frithjof Petrick , Christian Herold , Pavel Petrushkov , Shahram Khadivi , Hermann Ney

Multilingual Neural Machine Translation with Task-Specific Attention

Multilingual machine translation addresses the task of translating between multiple source and target languages. We propose task-specific attention models, a simple but effective technique for improving the quality of sequence-to-sequence…

Computation and Language · Computer Science 2018-06-11 Graeme Blackwood , Miguel Ballesteros , Todd Ward

Conditions for Catastrophic Forgetting in Multilingual Translation

Fine-tuning multilingual foundation models on specific languages often induces catastrophic forgetting, degrading performance on languages unseen in fine-tuning. While this phenomenon is widely-documented, the literature presents fragmented…

Computation and Language · Computer Science 2025-10-23 Danni Liu , Jan Niehues

Do Multilingual Language Models Think Better in English?

Translate-test is a popular technique to improve the performance of multilingual language models. This approach works by translating the input into English using an external machine translation system, and running inference over the…

Computation and Language · Computer Science 2023-08-03 Julen Etxaniz , Gorka Azkune , Aitor Soroa , Oier Lopez de Lacalle , Mikel Artetxe

Balancing Training for Multilingual Neural Machine Translation

When training multilingual machine translation (MT) models that can translate to/from multiple languages, we are faced with imbalanced training sets: some languages have much more training data than others. Standard practice is to up-sample…

Computation and Language · Computer Science 2020-09-08 Xinyi Wang , Yulia Tsvetkov , Graham Neubig

Understanding the effects of language-specific class imbalance in multilingual fine-tuning

We study the effect of one type of imbalance often present in real-life multilingual classification datasets: an uneven distribution of labels across languages. We show evidence that fine-tuning a transformer-based Large Language Model…

Computation and Language · Computer Science 2024-02-21 Vincent Jung , Lonneke van der Plas

The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities

Fine-tuning large language models (LLMs) for machine translation has shown improvements in overall translation quality. However, it is unclear what is the impact of fine-tuning on desirable LLM behaviors that are not present in neural…

Computation and Language · Computer Science 2024-08-07 David Stap , Eva Hasler , Bill Byrne , Christof Monz , Ke Tran