Related papers: TaskWeb: Selecting Better Source Tasks for Multi-t…

Exploring and Predicting Transferability across NLP Tasks

Recent advances in NLP demonstrate the effectiveness of training large-scale language models and transferring them to downstream tasks. Can fine-tuning these models on tasks other than language modeling further improve performance? In this…

Computation and Language · Computer Science 2020-10-08 Tu Vu , Tong Wang , Tsendsuren Munkhdalai , Alessandro Sordoni , Adam Trischler , Andrew Mattarella-Micke , Subhransu Maji , Mohit Iyyer

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences. Nevertheless, it is still not understood…

Computation and Language · Computer Science 2023-02-16 Zhihan Zhang , Wenhao Yu , Mengxia Yu , Zhichun Guo , Meng Jiang

MetaXT: Meta Cross-Task Transfer between Disparate Label Spaces

Albeit the universal representational power of pre-trained language models, adapting them onto a specific NLP task still requires a considerably large amount of labeled data. Effective task fine-tuning meets challenges when only a few…

Machine Learning · Computer Science 2021-09-10 Srinagesh Sharma , Guoqing Zheng , Ahmed Hassan Awadallah

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance…

Computation and Language · Computer Science 2024-10-22 David Schulte , Felix Hamborg , Alan Akbik

Efficient Task Grouping Through Samplewise Optimisation Landscape Analysis

Shared training approaches, such as multi-task learning (MTL) and gradient-based meta-learning, are widely used in various machine learning applications, but they often suffer from negative transfer, leading to performance degradation in…

Machine Learning · Computer Science 2024-12-10 Anshul Thakur , Yichen Huang , Soheila Molaei , Yujiang Wang , David A. Clifton

Multi-Task Learning for Sequence Tagging: An Empirical Study

We study three general multi-task learning (MTL) approaches on 11 sequence tagging tasks. Our extensive empirical results show that in about 50% of the cases, jointly learning all 11 tasks improves upon either independent or pairwise…

Computation and Language · Computer Science 2018-08-14 Soravit Changpinyo , Hexiang Hu , Fei Sha

Identification of Negative Transfers in Multitask Learning Using Surrogate Models

Multitask learning is widely used in practice to train a low-resource target task by augmenting it with multiple related source tasks. Yet, naively combining all the source tasks with a target task does not always improve the prediction…

Machine Learning · Computer Science 2023-12-29 Dongyue Li , Huy L. Nguyen , Hongyang R. Zhang

Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning

Identifying beneficial tasks to transfer from is a critical step toward successful intermediate-task transfer learning. In this work, we experiment with 130 source-target task combinations and demonstrate that the transfer performance…

Computation and Language · Computer Science 2024-07-24 Pin-Jie Lin , Miaoran Zhang , Marius Mosbach , Dietrich Klakow

Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP

Meta-learning considers the problem of learning an efficient learning process that can leverage its past experience to accurately solve new tasks. However, the efficacy of meta-learning crucially depends on the distribution of tasks…

Computation and Language · Computer Science 2021-11-03 Trapit Bansal , Karthick Gunasekaran , Tong Wang , Tsendsuren Munkhdalai , Andrew McCallum

Relational Multi-Task Learning: Modeling Relations between Data and Tasks

A key assumption in multi-task learning is that at the inference time the multi-task model only has access to a given data point but not to the data point's labels from other tasks. This presents an opportunity to extend multi-task learning…

Machine Learning · Computer Science 2023-03-15 Kaidi Cao , Jiaxuan You , Jure Leskovec

Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information

The success of multi-task learning can depend heavily on which tasks are grouped together. Naively grouping all tasks or a random set of tasks can result in negative transfer, with the multi-task models performing worse than single-task…

Computation and Language · Computer Science 2025-07-18 Yingya Li , Timothy Miller , Steven Bethard , Guergana Savova

Efficiently Tuned Parameters are Task Embeddings

Intermediate-task transfer can benefit a wide range of NLP tasks with properly selected source datasets. However, it is computationally infeasible to experiment with all intermediate transfer combinations, making choosing a useful source…

Computation and Language · Computer Science 2022-10-24 Wangchunshu Zhou , Canwen Xu , Julian McAuley

XMixup: Efficient Transfer Learning with Auxiliary Samples by Cross-domain Mixup

Transferring knowledge from large source datasets is an effective way to fine-tune the deep neural networks of the target task with a small sample size. A great number of algorithms have been proposed to facilitate deep transfer learning,…

Machine Learning · Computer Science 2020-07-21 Xingjian Li , Haoyi Xiong , Haozhe An , Chengzhong Xu , Dejing Dou

Learning New Tasks from a Few Examples with Soft-Label Prototypes

Existing approaches to few-shot learning in NLP rely on large language models (LLMs) and/or fine-tuning of these to generalise on out-of-distribution data. In this work, we propose a novel few-shot learning approach based on soft-label…

Machine Learning · Computer Science 2024-09-24 Avyav Kumar Singh , Ekaterina Shutova , Helen Yannakoudakis

Using Task Descriptions in Lifelong Machine Learning for Improved Performance and Zero-Shot Transfer

Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically…

Machine Learning · Computer Science 2017-10-12 David Isele , Mohammad Rostami , Eric Eaton

Dynamic Task Vector Grouping for Efficient Multi-Task Prompt Tuning

Multi-task prompt tuning utilizes multiple high-resource source tasks to improve performance on low-source target tasks. Existing approaches transfer the soft prompt trained by combining all source tasks or a single ``high-similar'' source…

Computation and Language · Computer Science 2025-03-25 Pieyi Zhang , Richong Zhang , Zhijie Nie

Towards Principled Task Grouping for Multi-Task Learning

Multi-task learning (MTL) aims to leverage shared information among tasks to improve learning efficiency and accuracy. However, MTL often struggles to effectively manage positive and negative transfer between tasks, which can hinder…

Machine Learning · Computer Science 2025-05-19 Chenguang Wang , Xuanhao Pan , Tianshu Yu

Learning Task Relatedness in Multi-Task Learning for Images in Context

Multimedia applications often require concurrent solutions to multiple tasks. These tasks hold clues to each-others solutions, however as these relations can be complex this remains a rarely utilized property. When task relations are…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Gjorgji Strezoski , Nanne van Noord , Marcel Worring

Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks

Cross-lingual transfer is central to modern NLP, enabling models to perform tasks in languages different from those they were trained on. A common assumption is that training on more languages improves zero-shot transfer. We test this on…

Computation and Language · Computer Science 2025-10-17 Roksana Goworek , Haim Dubossarsky