English

Dual-Balancing for Multi-Task Learning

Machine Learning 2025-11-27 v3 Artificial Intelligence

Abstract

Multi-task learning aims to learn multiple related tasks simultaneously and has achieved great success in various fields. However, the disparity in loss and gradient scales among tasks often leads to performance compromises, and the balancing of tasks remains a significant challenge. In this paper, we propose Dual-Balancing Multi-Task Learning (DB-MTL) to achieve task balancing from both the loss and gradient perspectives. Specifically, DB-MTL achieves loss-scale balancing by performing logarithm transformation on each task loss, and rescales gradient magnitudes by normalizing all task gradients to comparable magnitudes using the maximum gradient norm. Extensive experiments on a number of benchmark datasets demonstrate that DB-MTL consistently performs better than the current state-of-the-art.

Keywords

Cite

@article{arxiv.2308.12029,
  title  = {Dual-Balancing for Multi-Task Learning},
  author = {Baijiong Lin and Weisen Jiang and Feiyang Ye and Yu Zhang and Pengguang Chen and Ying-Cong Chen and Shu Liu and Ivor W. Tsang and James T. Kwok},
  journal= {arXiv preprint arXiv:2308.12029},
  year   = {2025}
}

Comments

Accepted by Neural Networks