Related papers: Gradient Coordination for Quantifying and Maximizi…

Fair Resource Allocation in Multi-Task Learning

By jointly learning multiple tasks, multi-task learning (MTL) can leverage the shared knowledge across tasks, resulting in improved data efficiency and generalization performance. However, a major challenge in MTL lies in the presence of…

Machine Learning · Computer Science 2024-07-03 Hao Ban , Kaiyi Ji

Conflict-Averse Gradient Descent for Multi-task Learning

The goal of multi-task learning is to enable more efficient learning than single task learning by sharing model structures for a diverse set of tasks. A standard multi-task learning objective is to minimize the average loss across all…

Machine Learning · Computer Science 2024-02-22 Bo Liu , Xingchao Liu , Xiaojie Jin , Peter Stone , Qiang Liu

RotoGrad: Gradient Homogenization in Multitask Learning

Multitask learning is being increasingly adopted in applications domains like computer vision and reinforcement learning. However, optimally exploiting its advantages remains a major challenge due to the effect of negative transfer.…

Machine Learning · Computer Science 2022-02-17 Adrián Javaloy , Isabel Valera

GDOD: Effective Gradient Descent using Orthogonal Decomposition for Multi-Task Learning

Multi-task learning (MTL) aims at solving multiple related tasks simultaneously and has experienced rapid growth in recent years. However, MTL models often suffer from performance degeneration with negative transfer due to learning several…

Machine Learning · Computer Science 2023-02-01 Xin Dong , Ruize Wu , Chao Xiong , Hai Li , Lei Cheng , Yong He , Shiyou Qian , Jian Cao , Linjian Mo

Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective

Multi-task learning (MTL) trains deep neural networks to optimize several objectives simultaneously using a shared backbone, which leads to reduced computational costs, improved data efficiency, and enhanced performance through cross-task…

Machine Learning · Computer Science 2025-09-30 Hoang Phan , Lam Tran , Quyen Tran , Ngoc N. Tran , Tuan Truong , Qi Lei , Nhat Ho , Dinh Phung , Trung Le

MultiBalance: Multi-Objective Gradient Balancing in Industrial-Scale Multi-Task Recommendation System

In industrial recommendation systems, multi-task learning (learning multiple tasks simultaneously on a single model) is a predominant approach to save training/serving resources and improve recommendation performance via knowledge transfer…

Information Retrieval · Computer Science 2024-11-20 Yun He , Xuxing Chen , Jiayi Xu , Renqin Cai , Yiling You , Jennifer Cao , Minhui Huang , Liu Yang , Yiqun Liu , Xiaoyi Liu , Rong Jin , Sem Park , Bo Long , Xue Feng

Fantastic Multi-Task Gradient Updates and How to Find Them In a Cone

Balancing competing objectives remains a fundamental challenge in multi-task learning (MTL), primarily due to conflicting gradients across individual tasks. A common solution relies on computing a dynamic gradient update vector that…

Machine Learning · Computer Science 2025-02-04 Negar Hassanpour , Muhammad Kamran Janjua , Kunlin Zhang , Sepehr Lavasani , Xiaowen Zhang , Chunhua Zhou , Chao Gao

Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning

Multilingual models jointly pretrained on multiple languages have achieved remarkable performance on various multilingual downstream tasks. Moreover, models finetuned on a single monolingual downstream task have shown to generalize to…

Computation and Language · Computer Science 2022-03-01 Seanie Lee , Hae Beom Lee , Juho Lee , Sung Ju Hwang

Multitask Learning with Single Gradient Step Update for Task Balancing

Multitask learning is a methodology to boost generalization performance and also reduce computational intensity and memory usage. However, learning multiple tasks simultaneously can be more difficult than learning a single task because it…

Machine Learning · Computer Science 2020-06-03 Sungjae Lee , Youngdoo Son

Asynchronous Multi-Task Learning

Many real-world machine learning applications involve several learning tasks which are inter-related. For example, in healthcare domain, we need to learn a predictive model of a certain disease for many hospitals. The models for each…

Machine Learning · Computer Science 2016-10-03 Inci M. Baytas , Ming Yan , Anil K. Jain , Jiayu Zhou

Knowledge Distillation for Multi-task Learning

Multi-task learning (MTL) is to learn one single model that performs multiple tasks for achieving good performance on all tasks and lower cost on computation. Learning such a model requires to jointly optimize losses of a set of tasks with…

Computer Vision and Pattern Recognition · Computer Science 2020-09-25 Wei-Hong Li , Hakan Bilen

MoKD: Multi-Task Optimization for Knowledge Distillation

Compact models can be effectively trained through Knowledge Distillation (KD), a technique that transfers knowledge from larger, high-performing teacher models. Two key challenges in Knowledge Distillation (KD) are: 1) balancing learning…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Zeeshan Hayder , Ali Cheraghian , Lars Petersson , Mehrtash Harandi

Measuring and Harnessing Transference in Multi-Task Learning

Multi-task learning can leverage information learned by one task to benefit the training of other tasks. Despite this capacity, naive formulations often degrade performance and in particular, identifying the tasks that would benefit from…

Machine Learning · Computer Science 2021-09-13 Christopher Fifty , Ehsan Amid , Zhe Zhao , Tianhe Yu , Rohan Anil , Chelsea Finn

Cross-Task Knowledge Distillation in Multi-Task Recommendation

Multi-task learning (MTL) has been widely used in recommender systems, wherein predicting each type of user feedback on items (e.g, click, purchase) are treated as individual tasks and jointly trained with a unified model. Our key…

Information Retrieval · Computer Science 2022-03-29 Chenxiao Yang , Junwei Pan , Xiaofeng Gao , Tingyu Jiang , Dapeng Liu , Guihai Chen

Injecting Imbalance Sensitivity for Multi-Task Learning

Multi-task learning (MTL) has emerged as a promising approach for deploying deep learning models in real-life applications. Recent studies have proposed optimization-based learning paradigms to establish task-shared representations in MTL.…

Machine Learning · Computer Science 2025-03-12 Zhipeng Zhou , Liu Liu , Peilin Zhao , Wei Gong

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to…

Machine Learning · Computer Science 2022-04-15 Angelica Tiemi Mizuno Nakamura , Denis Fernando Wolf , Valdir Grassi

Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond

Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space, or parameter transfer. To provide sufficient learning support, modern MTL uses annotated data with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Dimitrios Kollias , Viktoriia Sharmanska , Stefanos Zafeiriou

Layerwise Optimization by Gradient Decomposition for Continual Learning

Deep neural networks achieve state-of-the-art and sometimes super-human performance across various domains. However, when learning tasks sequentially, the networks easily forget the knowledge of previous tasks, known as "catastrophic…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Shixiang Tang , Dapeng Chen , Jinguo Zhu , Shijie Yu , Wanli Ouyang

Meta-learning the Learning Trends Shared Across Tasks

Meta-learning stands for 'learning to learn' such that generalization to new tasks is achieved. Among these methods, Gradient-based meta-learning algorithms are a specific sub-class that excel at quick adaptation to new tasks with limited…

Machine Learning · Computer Science 2020-10-20 Jathushan Rajasegaran , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Mubarak Shah

Direct Routing Gradient (DRGrad): A Personalized Information Surgery for Multi-Task Learning (MTL) Recommendations

Multi-task learning (MTL) has emerged as a successful strategy in industrial-scale recommender systems, offering significant advantages such as capturing diverse users' interests and accurately detecting different behaviors like ``click" or…

Machine Learning · Computer Science 2025-10-14 Yuguang Liu , Yiyun Miao , Luyao Xia