Related papers: Adaptive Scheduling for Multi-Task Learning

Learning to Multi-Task Learn for Better Neural Machine Translation

Scarcity of parallel sentence pairs is a major challenge for training high quality neural machine translation (NMT) models in bilingually low-resource scenarios, as NMT is data-hungry. Multi-task learning is an elegant approach to inject…

Computation and Language · Computer Science 2020-01-13 Poorya Zaremoodi , Gholamreza Haffari

Balancing Training for Multilingual Neural Machine Translation

When training multilingual machine translation (MT) models that can translate to/from multiple languages, we are faced with imbalanced training sets: some languages have much more training data than others. Standard practice is to up-sample…

Computation and Language · Computer Science 2020-09-08 Xinyi Wang , Yulia Tsvetkov , Graham Neubig

Temporally Correlated Task Scheduling for Sequence Learning

Sequence learning has attracted much research attention from the machine learning community in recent years. In many applications, a sequence learning task is usually associated with multiple temporally correlated auxiliary tasks, which are…

Computation and Language · Computer Science 2021-07-05 Xueqing Wu , Lewen Wang , Yingce Xia , Weiqing Liu , Lijun Wu , Shufang Xie , Tao Qin , Tie-Yan Liu

Self-Paced Learning for Neural Machine Translation

Recent studies have proven that the training of neural machine translation (NMT) can be facilitated by mimicking the learning process of humans. Nevertheless, achievements of such kind of curriculum learning rely on the quality of…

Computation and Language · Computer Science 2022-10-20 Yu Wan , Baosong Yang , Derek F. Wong , Yikai Zhou , Lidia S. Chao , Haibo Zhang , Boxing Chen

Adaptive Scheduling for Machine Learning Tasks over Networks

A key functionality of emerging connected autonomous systems such as smart transportation systems, smart cities, and the industrial Internet-of-Things, is the ability to process and learn from data collected at different physical locations.…

Machine Learning · Computer Science 2021-01-26 Konstantinos Gatsis

Multilingual Neural Machine Translation with Task-Specific Attention

Multilingual machine translation addresses the task of translating between multiple source and target languages. We propose task-specific attention models, a simple but effective technique for improving the quality of sequence-to-sequence…

Computation and Language · Computer Science 2018-06-11 Graeme Blackwood , Miguel Ballesteros , Todd Ward

Adaptive and Robust Multi-Task Learning

We study the multi-task learning problem that aims to simultaneously analyze multiple datasets collected from different sources and learn one model for each of them. We propose a family of adaptive methods that automatically utilize…

Machine Learning · Statistics 2023-09-19 Yaqi Duan , Kaizheng Wang

An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Machine translation systems based on deep neural networks are expensive to train. Curriculum learning aims to address this issue by choosing the order in which samples are presented during training to help train better models faster. We…

Computation and Language · Computer Science 2018-11-05 Xuan Zhang , Gaurav Kumar , Huda Khayrallah , Kenton Murray , Jeremy Gwinnup , Marianna J Martindale , Paul McNamee , Kevin Duh , Marine Carpuat

Incremental Adaptation Strategies for Neural Network Language Models

It is today acknowledged that neural network language models outperform backoff language models in applications like speech recognition or statistical machine translation. However, training these models on large amounts of data can take…

Neural and Evolutionary Computing · Computer Science 2015-07-08 Aram Ter-Sarkisov , Holger Schwenk , Loic Barrault , Fethi Bougares

A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning

The aim of multi-task reinforcement learning is two-fold: (1) efficiently learn by training against multiple tasks and (2) quickly adapt, using limited samples, to a variety of new tasks. In this work, the tasks correspond to reward…

Machine Learning · Computer Science 2019-11-05 Nicholas C. Landolfi , Garrett Thomas , Tengyu Ma

From Bilingual to Multilingual Neural Machine Translation by Incremental Training

Multilingual Neural Machine Translation approaches are based on the use of task-specific models and the addition of one more language can only be done by retraining the whole system. In this work, we propose a new training schedule that…

Computation and Language · Computer Science 2019-07-12 Carlos Escolano , Marta R. Costa-Jussà , José A. R. Fonollosa

Neural Machine Translation Training in a Multi-Domain Scenario

In this paper, we explore alternative ways to train a neural machine translation system in a multi-domain scenario. We investigate data concatenation (with fine tuning), model stacking (multi-level fine tuning), data selection and…

Computation and Language · Computer Science 2018-11-21 Hassan Sajjad , Nadir Durrani , Fahim Dalvi , Yonatan Belinkov , Stephan Vogel

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

In this paper, we empirically study the optimization dynamics of multi-task learning, particularly focusing on those that govern a collection of tasks with significant data imbalance. We present a simple yet effective method of pre-training…

Computation and Language · Computer Science 2023-12-12 Dami Choi , Derrick Xin , Hamid Dadkhahi , Justin Gilmer , Ankush Garg , Orhan Firat , Chih-Kuan Yeh , Andrew M. Dai , Behrooz Ghorbani

Transductive Data-Selection Algorithms for Fine-Tuning Neural Machine Translation

Machine Translation models are trained to translate a variety of documents from one language into another. However, models specifically trained for a particular characteristics of the documents tend to perform better. Fine-tuning is a…

Computation and Language · Computer Science 2019-10-09 Alberto Poncelas , Gideon Maillette de Buy Wenniger , Andy Way

Meta-learning with an Adaptive Task Scheduler

To benefit the learning of a new task, meta-learning has been proposed to transfer a well-generalized meta-model learned from various meta-training tasks. Existing meta-learning algorithms randomly sample meta-training tasks with a uniform…

Machine Learning · Computer Science 2021-10-28 Huaxiu Yao , Yu Wang , Ying Wei , Peilin Zhao , Mehrdad Mahdavi , Defu Lian , Chelsea Finn

Accelerating Machine Learning Algorithms with Adaptive Sampling

The era of huge data necessitates highly efficient machine learning algorithms. Many common machine learning algorithms, however, rely on computationally intensive subroutines that are prohibitively expensive on large datasets. Oftentimes,…

Machine Learning · Computer Science 2023-09-26 Mo Tiwari

Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation

There are several approaches for improving neural machine translation for low-resource languages: Monolingual data can be exploited via pretraining or data augmentation; Parallel corpora on related language pairs can be used via parameter…

Computation and Language · Computer Science 2020-12-10 Stig-Arne Grönroos , Sami Virpioja , Mikko Kurimo

Simple, Scalable Adaptation for Neural Machine Translation

Fine-tuning pre-trained Neural Machine Translation (NMT) models is the dominant approach for adapting to new languages and domains. However, fine-tuning requires adapting and maintaining a separate model for each target task. We propose a…

Computation and Language · Computer Science 2019-09-19 Ankur Bapna , Naveen Arivazhagan , Orhan Firat

Adaptive scheduling for adaptive sampling in POS taggers construction

We introduce an adaptive scheduling for adaptive sampling as a novel way of machine learning in the construction of part-of-speech taggers. The goal is to speed up the training on large data sets, without significant loss of performance…

Computation and Language · Computer Science 2024-02-06 Manuel Vilares Ferro , Victor M. Darriba Bilbao , Jesús Vilares Ferro

Tricks for Training Sparse Translation Models

Multi-task learning with an unbalanced data distribution skews model learning towards high resource tasks, especially when model capacity is fixed and fully shared across all tasks. Sparse scaling architectures, such as BASELayers, provide…

Computation and Language · Computer Science 2021-10-18 Dheeru Dua , Shruti Bhosale , Vedanuj Goswami , James Cross , Mike Lewis , Angela Fan