English
Related papers

Related papers: Task Compass: Scaling Multi-task Pre-training with…

200 papers

Conversational analysis systems are trained using noisy human labels and often require heavy preprocessing during multi-modal feature extraction. Using noisy labels in single-task learning increases the risk of over-fitting. Auxiliary tasks…

Computation and Language · Computer Science 2021-12-07 Joshua Yee Kim , Tongliang Liu , Kalina Yacef

Multi-task learning is to improve the performance of the model by transferring and exploiting common knowledge among tasks. Existing MTL works mainly focus on the scenario where label sets among multiple tasks (MTs) are usually the same,…

Machine Learning · Computer Science 2022-01-10 Quan Feng , Songcan Chen

Despite the recent advances in multi-task learning of dense prediction problems, most methods rely on expensive labelled datasets. In this paper, we present a label efficient approach and look at jointly learning of multiple dense…

Computer Vision and Pattern Recognition · Computer Science 2022-05-05 Wei-Hong Li , Xialei Liu , Hakan Bilen

Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space, or parameter transfer. To provide sufficient learning support, modern MTL uses annotated data with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou

We consider the problem of conversational question answering over a large-scale knowledge base. To handle huge entity vocabulary of a large-scale knowledge base, recent neural semantic parsing based approaches usually decompose the task…

Computation and Language · Computer Science 2019-10-14 Tao Shen , Xiubo Geng , Tao Qin , Daya Guo , Duyu Tang , Nan Duan , Guodong Long , Daxin Jiang

Many real-world applications require making multiple predictions from the same text. Fine-tuning a large pre-trained language model for each downstream task causes computational burdens in the inference time due to several times of forward…

Computation and Language · Computer Science 2023-10-17 Kuan-Hao Huang , Liang Tan , Rui Hou , Sinong Wang , Amjad Almahairi , Ruty Rinott

Multi-task learning and self-training are two common ways to improve a machine learning model's performance in settings with limited training data. Drawing heavily on ideas from those two approaches, we suggest transductive auxiliary task…

Computation and Language · Computer Science 2019-09-24 Johannes Bjerva , Katharina Kann , Isabelle Augenstein

In the paradigm of multi-task learning, mul- tiple related prediction tasks are learned jointly, sharing information across the tasks. We propose a framework for multi-task learn- ing that enables one to selectively share the information…

Machine Learning · Computer Science 2012-07-03 Abhishek Kumar , Hal Daume

We compare sequential fine-tuning with a model for multi-task learning in the context where we are interested in boosting performance on two tasks, one of which depends on the other. We test these models on the FigLang2022 shared task which…

Computation and Language · Computer Science 2022-11-01 Irina Bigoulaeva , Rachneet Sachdeva , Harish Tayyar Madabushi , Aline Villavicencio , Iryna Gurevych

Large Language Models (LLMs) have transformed NLP with their remarkable In-context Learning (ICL) capabilities. Automated assistants based on LLMs are gaining popularity; however, adapting them to novel tasks is still challenging. While…

Computation and Language · Computer Science 2024-06-13 Anwoy Chatterjee , Eshaan Tanwar , Subhabrata Dutta , Tanmoy Chakraborty

In multi-task learning, a learner is given a collection of prediction tasks and needs to solve all of them. In contrast to previous work, which required that annotated training data is available for all tasks, we consider a new setting, in…

Machine Learning · Statistics 2017-06-09 Anastasia Pentina , Christoph H. Lampert

We present two architectures for multi-task learning with neural sequence models. Our approach allows the relationships between different tasks to be learned dynamically, rather than using an ad-hoc pre-defined structure as in previous…

Computation and Language · Computer Science 2018-11-27 Pengfei Liu , Jie Fu , Yue Dong , Xipeng Qiu , Jackie Chi Kit Cheung

In this paper we consider a problem known as multi-task learning, consisting of fitting a set of classifier or regression functions intended for solving different tasks. In our novel formulation, we couple the parameters of these functions,…

Machine Learning · Computer Science 2021-05-28 Juan Cervino , Juan Andres Bazerque , Miguel Calvo-Fullana , Alejandro Ribeiro

Neural processes have recently emerged as a class of powerful neural latent variable models that combine the strengths of neural networks and stochastic processes. As they can encode contextual data in the network's function space, they…

Machine Learning · Computer Science 2021-12-03 Jiayi Shen , Xiantong Zhen , Marcel Worring , Ling Shao

Recent work has found that multi-task training with a large number of diverse tasks can uniformly improve downstream performance on unseen target tasks. In contrast, literature on task transferability has established that the choice of…

Computation and Language · Computer Science 2022-07-13 Vishakh Padmakumar , Leonard Lausen , Miguel Ballesteros , Sheng Zha , He He , George Karypis

Large language models are increasingly deployed across diverse applications. This often includes tasks LLMs have not encountered during training. This implies that enumerating and obtaining the high-quality training data for all tasks is…

Computation and Language · Computer Science 2025-11-11 Shambhavi Krishna , Atharva Naik , Chaitali Agarwal , Sudharshan Govindan , Taesung Lee , Haw-Shiuan Chang

Multi-task learning (MTL) has received considerable attention, and numerous deep learning applications benefit from MTL with multiple objectives. However, constructing multiple related tasks is difficult, and sometimes only a single task is…

Computer Vision and Pattern Recognition · Computer Science 2019-11-25 Tao Gui , Lizhi Qing , Qi Zhang , Jiacheng Ye , Hang Yan , Zichu Fei , Xuanjing Huang

Language models pretrained on text from a wide variety of sources form the foundation of today's NLP. In light of the success of these broad-coverage models, we investigate whether it is still helpful to tailor a pretrained model to the…

Computation and Language · Computer Science 2020-05-07 Suchin Gururangan , Ana Marasović , Swabha Swayamdipta , Kyle Lo , Iz Beltagy , Doug Downey , Noah A. Smith

In recent years, simultaneous learning of multiple dense prediction tasks with partially annotated label data has emerged as an important research area. Previous works primarily focus on leveraging cross-task relations or conducting…

Computer Vision and Pattern Recognition · Computer Science 2025-08-01 Jingdong Zhang , Hanrong Ye , Xin Li , Wenping Wang , Dan Xu

Fine-tuning pre-trained large language models (LLMs) on a diverse array of tasks has become a common approach for building models that can solve various natural language processing (NLP) tasks. However, where and to what extent these models…

Computation and Language · Computer Science 2024-10-29 Zheng Zhao , Yftah Ziser , Shay B. Cohen
‹ Prev 1 2 3 10 Next ›