English
Related papers

Related papers: TaskWeb: Selecting Better Source Tasks for Multi-t…

200 papers

Recent advances in NLP demonstrate the effectiveness of training large-scale language models and transferring them to downstream tasks. Can fine-tuning these models on tasks other than language modeling further improve performance? In this…

Computation and Language · Computer Science 2020-10-08 Tu Vu , Tong Wang , Tsendsuren Munkhdalai , Alessandro Sordoni , Adam Trischler , Andrew Mattarella-Micke , Subhransu Maji , Mohit Iyyer

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences. Nevertheless, it is still not understood…

Computation and Language · Computer Science 2023-02-16 Zhihan Zhang , Wenhao Yu , Mengxia Yu , Zhichun Guo , Meng Jiang

Albeit the universal representational power of pre-trained language models, adapting them onto a specific NLP task still requires a considerably large amount of labeled data. Effective task fine-tuning meets challenges when only a few…

Machine Learning · Computer Science 2021-09-10 Srinagesh Sharma , Guoqing Zheng , Ahmed Hassan Awadallah

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance…

Computation and Language · Computer Science 2024-10-22 David Schulte , Felix Hamborg , Alan Akbik

Shared training approaches, such as multi-task learning (MTL) and gradient-based meta-learning, are widely used in various machine learning applications, but they often suffer from negative transfer, leading to performance degradation in…

Machine Learning · Computer Science 2024-12-10 Anshul Thakur , Yichen Huang , Soheila Molaei , Yujiang Wang , David A. Clifton

We study three general multi-task learning (MTL) approaches on 11 sequence tagging tasks. Our extensive empirical results show that in about 50% of the cases, jointly learning all 11 tasks improves upon either independent or pairwise…

Computation and Language · Computer Science 2018-08-14 Soravit Changpinyo , Hexiang Hu , Fei Sha

Multitask learning is widely used in practice to train a low-resource target task by augmenting it with multiple related source tasks. Yet, naively combining all the source tasks with a target task does not always improve the prediction…

Machine Learning · Computer Science 2023-12-29 Dongyue Li , Huy L. Nguyen , Hongyang R. Zhang

Identifying beneficial tasks to transfer from is a critical step toward successful intermediate-task transfer learning. In this work, we experiment with 130 source-target task combinations and demonstrate that the transfer performance…

Computation and Language · Computer Science 2024-07-24 Pin-Jie Lin , Miaoran Zhang , Marius Mosbach , Dietrich Klakow

Meta-learning considers the problem of learning an efficient learning process that can leverage its past experience to accurately solve new tasks. However, the efficacy of meta-learning crucially depends on the distribution of tasks…

Computation and Language · Computer Science 2021-11-03 Trapit Bansal , Karthick Gunasekaran , Tong Wang , Tsendsuren Munkhdalai , Andrew McCallum

A key assumption in multi-task learning is that at the inference time the multi-task model only has access to a given data point but not to the data point's labels from other tasks. This presents an opportunity to extend multi-task learning…

Machine Learning · Computer Science 2023-03-15 Kaidi Cao , Jiaxuan You , Jure Leskovec

The success of multi-task learning can depend heavily on which tasks are grouped together. Naively grouping all tasks or a random set of tasks can result in negative transfer, with the multi-task models performing worse than single-task…

Computation and Language · Computer Science 2025-07-18 Yingya Li , Timothy Miller , Steven Bethard , Guergana Savova

Intermediate-task transfer can benefit a wide range of NLP tasks with properly selected source datasets. However, it is computationally infeasible to experiment with all intermediate transfer combinations, making choosing a useful source…

Computation and Language · Computer Science 2022-10-24 Wangchunshu Zhou , Canwen Xu , Julian McAuley

Transferring knowledge from large source datasets is an effective way to fine-tune the deep neural networks of the target task with a small sample size. A great number of algorithms have been proposed to facilitate deep transfer learning,…

Machine Learning · Computer Science 2020-07-21 Xingjian Li , Haoyi Xiong , Haozhe An , Chengzhong Xu , Dejing Dou

Existing approaches to few-shot learning in NLP rely on large language models (LLMs) and/or fine-tuning of these to generalise on out-of-distribution data. In this work, we propose a novel few-shot learning approach based on soft-label…

Machine Learning · Computer Science 2024-09-24 Avyav Kumar Singh , Ekaterina Shutova , Helen Yannakoudakis

Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically…

Machine Learning · Computer Science 2017-10-12 David Isele , Mohammad Rostami , Eric Eaton

Multi-task prompt tuning utilizes multiple high-resource source tasks to improve performance on low-source target tasks. Existing approaches transfer the soft prompt trained by combining all source tasks or a single ``high-similar'' source…

Computation and Language · Computer Science 2025-03-25 Pieyi Zhang , Richong Zhang , Zhijie Nie

Multi-task learning (MTL) aims to leverage shared information among tasks to improve learning efficiency and accuracy. However, MTL often struggles to effectively manage positive and negative transfer between tasks, which can hinder…

Machine Learning · Computer Science 2025-05-19 Chenguang Wang , Xuanhao Pan , Tianshu Yu

Multimedia applications often require concurrent solutions to multiple tasks. These tasks hold clues to each-others solutions, however as these relations can be complex this remains a rarely utilized property. When task relations are…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Gjorgji Strezoski , Nanne van Noord , Marcel Worring

Cross-lingual transfer is central to modern NLP, enabling models to perform tasks in languages different from those they were trained on. A common assumption is that training on more languages improves zero-shot transfer. We test this on…

Computation and Language · Computer Science 2025-10-17 Roksana Goworek , Haim Dubossarsky
‹ Prev 1 2 3 10 Next ›