Related papers: Task Selection Policies for Multitask Learning

Adaptive Scheduling for Multi-Task Learning

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we…

Machine Learning · Computer Science 2019-09-17 Sébastien Jean , Orhan Firat , Melvin Johnson

Learning to Multi-Task Learn for Better Neural Machine Translation

Scarcity of parallel sentence pairs is a major challenge for training high quality neural machine translation (NMT) models in bilingually low-resource scenarios, as NMT is data-hungry. Multi-task learning is an elegant approach to inject…

Computation and Language · Computer Science 2020-01-13 Poorya Zaremoodi , Gholamreza Haffari

Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits

Multi-task learning (MTL) aims to improve the performance of a primary task by jointly learning with related auxiliary tasks. Traditional MTL methods select tasks randomly during training. However, both previous studies and our results…

Computation and Language · Computer Science 2024-01-12 Xiangheng He , Junjie Chen , Björn W. Schuller

Adaptive and Robust Multi-Task Learning

We study the multi-task learning problem that aims to simultaneously analyze multiple datasets collected from different sources and learn one model for each of them. We propose a family of adaptive methods that automatically utilize…

Machine Learning · Statistics 2023-09-19 Yaqi Duan , Kaizheng Wang

Efficient Evaluation of Multi-Task Robot Policies With Active Experiment Selection

Evaluating learned robot control policies to determine their physical task-level capabilities costs experimenter time and effort. The growing number of policies and tasks exacerbates this issue. It is impractical to test every policy on…

Robotics · Computer Science 2025-02-17 Abrar Anwar , Rohan Gupta , Zain Merchant , Sayan Ghosh , Willie Neiswanger , Jesse Thomason

Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities

Adapting language models to new tasks through continued pretraining faces a fundamental trade-off: models must learn new capabilities while avoiding catastrophic forgetting of existing knowledge. While prior work has studied synthetic data…

Machine Learning · Computer Science 2025-10-15 Urs Spiegelhalter , Jörg K. H. Franke , Frank Hutter

Simulating Learners' Task-Selection Strategies and System Constraints in Mastery Learning

Intelligent Tutoring Systems often grant learners shared control over skill and problem selection. This choice brings motivational and metacognitive benefits. At the same time, past literature suggests that learners exhibit diverse…

Human-Computer Interaction · Computer Science 2026-05-26 Haley Noh , Aarna Chowdhary , Jeroen Ooge , Vincent Aleven , Conrad Borchers

Online Multi-task Learning with Hard Constraints

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain…

Machine Learning · Statistics 2009-03-27 Gabor Lugosi , Omiros Papaspiliopoulos , Gilles Stoltz

Online Budgeted Learning for Classifier Induction

In real-world machine learning applications, there is a cost associated with sampling of different features. Budgeted learning can be used to select which feature-values to acquire from each instance in a dataset, such that the best model…

Machine Learning · Computer Science 2019-03-14 Eran Fainman , Bracha Shapira , Lior Rokach , Yisroel Mirsky

Task Grouping for Automated Multi-Task Machine Learning via Task Affinity Prediction

When a number of similar tasks have to be learned simultaneously, multi-task learning (MTL) models can attain significantly higher accuracy than single-task learning (STL) models. However, the advantage of MTL depends on various factors,…

Machine Learning · Computer Science 2023-10-26 Afiya Ayman , Ayan Mukhopadhyay , Aron Laszka

Multi-Task Policy Search

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for…

Machine Learning · Statistics 2014-02-13 Marc Peter Deisenroth , Peter Englert , Jan Peters , Dieter Fox

Guiding Policies with Language via Meta-Learning

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their…

Machine Learning · Computer Science 2019-01-30 John D. Co-Reyes , Abhishek Gupta , Suvansh Sanjeev , Nick Altieri , Jacob Andreas , John DeNero , Pieter Abbeel , Sergey Levine

Self-Paced Multitask Learning with Shared Knowledge

This paper introduces self-paced task selection to multitask learning, where instances from more closely related tasks are selected in a progression of easier-to-harder tasks, to emulate an effective human education strategy, but applied to…

Machine Learning · Statistics 2017-06-20 Keerthiram Murugesan , Jaime Carbonell

Multi-task Representation Learning with Stochastic Linear Bandits

We study the problem of transfer-learning in the setting of stochastic linear bandit tasks. We consider that a low dimensional linear representation is shared across the tasks, and study the benefit of learning this representation in the…

Machine Learning · Statistics 2023-08-16 Leonardo Cella , Karim Lounici , Grégoire Pacreau , Massimiliano Pontil

Multi-Task Reinforcement Learning with Soft Modularization

Multi-task learning is a very challenging problem in reinforcement learning. While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains…

Machine Learning · Computer Science 2020-12-08 Ruihan Yang , Huazhe Xu , Yi Wu , Xiaolong Wang

Bandit Guided Submodular Curriculum for Adaptive Subset Selection

Traditional curriculum learning proceeds from easy to hard samples, yet defining a reliable notion of difficulty remains elusive. Prior work has used submodular functions to induce difficulty scores in curriculum learning. We reinterpret…

Machine Learning · Computer Science 2025-12-01 Prateek Chanda , Prayas Agrawal , Saral Sureka , Lokesh Reddy Polu , Atharv Kshirsagar , Ganesh Ramakrishnan

Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

Reinforcement learning can greatly benefit from the use of options as a way of encoding recurring behaviours and to foster exploration. An important open problem is how can an agent autonomously learn useful options when solving particular…

Machine Learning · Computer Science 2020-01-07 Manuel Del Verme , Bruno Castro da Silva , Gianluca Baldassarre

Task Selection for Bandit-Based Task Assignment in Heterogeneous Crowdsourcing

Task selection (picking an appropriate labeling task) and worker selection (assigning the labeling task to a suitable worker) are two major challenges in task assignment for crowdsourcing. Recently, worker selection has been successfully…

Machine Learning · Computer Science 2015-07-28 Hao Zhang , Masashi Sugiyama

Machine Translation System Selection from Bandit Feedback

Adapting machine translation systems in the real world is a difficult problem. In contrast to offline training, users cannot provide the type of fine-grained feedback (such as correct translations) typically used for improving the system.…

Computation and Language · Computer Science 2020-09-03 Jason Naradowsky , Xuan Zhang , Kevin Duh

Multi-Task Learning for Contextual Bandits

Contextual bandits are a form of multi-armed bandit in which the agent has access to predictive side information (known as the context) for each arm at each time step, and have been used to model personalized news recommendation, ad…

Machine Learning · Statistics 2017-05-25 Aniket Anand Deshmukh , Urun Dogan , Clayton Scott