Related papers: Representation Surgery for Multi-Task Model Mergin…

SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery

Model merging-based multitask learning (MTL) offers a promising approach for performing MTL by merging multiple expert models without requiring access to raw training data. However, in this paper, we examine the merged model's…

Machine Learning · Computer Science 2024-10-21 Enneng Yang , Li Shen , Zhenyi Wang , Guibing Guo , Xingwei Wang , Xiaocun Cao , Jie Zhang , Dacheng Tao

Parameter-Efficient Interventions for Enhanced Model Merging

Model merging combines knowledge from task-specific models into a unified multi-task model to avoid joint training on all task data. However, current methods face challenges due to representation bias, which can interfere with tasks…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Marcin Osial , Daniel Marczak , Bartosz Zieliński

Joint Registration and Segmentation via Multi-Task Learning for Adaptive Radiotherapy of Prostate Cancer

Medical image registration and segmentation are two of the most frequent tasks in medical image analysis. As these tasks are complementary and correlated, it would be beneficial to apply them simultaneously in a joint manner. In this paper,…

Image and Video Processing · Electrical Eng. & Systems 2021-05-06 Mohamed S. Elmahdy , Laurens Beljaards , Sahar Yousefi , Hessam Sokooti , Fons Verbeek , U. A. van der Heide , Marius Staring

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Multi-task learning (MTL) aims to empower a model to tackle multiple tasks simultaneously. A recent development known as task arithmetic has revealed that several models, each fine-tuned for distinct tasks, can be directly merged into a…

Machine Learning · Computer Science 2024-05-29 Enneng Yang , Zhenyi Wang , Li Shen , Shiwei Liu , Guibing Guo , Xingwei Wang , Dacheng Tao

Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene Understanding

Purpose: Surgery scene understanding with tool-tissue interaction recognition and automatic report generation can play an important role in intra-operative guidance, decision-making and postoperative analysis in robotic surgery. However,…

Artificial Intelligence · Computer Science 2022-11-29 Lalithkumar Seenivasan , Mobarakol Islam , Mengya Xu , Chwee Ming Lim , Hongliang Ren

Localizing Task Information for Improved Model Merging and Compression

Model merging and task arithmetic have emerged as promising scalable approaches to merge multiple single-task checkpoints to one multi-task model, but their applicability is reduced by significant performance loss. Previous works have…

Machine Learning · Computer Science 2024-05-14 Ke Wang , Nikolaos Dimitriadis , Guillermo Ortiz-Jimenez , François Fleuret , Pascal Frossard

The multi-task learning ($MTL$) paradigm aims to simultaneously learn multiple tasks within a single model capturing higher-level, more general hidden patterns that are shared by the tasks. In deep learning, a significant challenge in the…

Machine Learning · Computer Science 2025-06-09 Thomas Borsani , Andrea Rosani , Giuseppe Nicosia , Giuseppe Di Fatta

Auxiliary Learning for Deep Multi-task Learning

Multi-task learning (MTL) is an efficient solution to solve multiple tasks simultaneously in order to get better speed and performance than handling each single-task in turn. The most current methods can be categorized as either: (i) hard…

Computer Vision and Pattern Recognition · Computer Science 2019-12-02 Yifan Liu , Bohan Zhuang , Chunhua Shen , Hao Chen , Wei Yin

MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks

Transfer learning has become a powerful tool to initialize deep learning models to achieve faster convergence and higher performance. This is especially useful in the medical imaging analysis domain, where data scarcity limits possible…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Ibrahim Almakky , Santosh Sanjeev , Anees Ur Rehman Hashmi , Mohammad Areeb Qazi , Hu Wang , Mohammad Yaqub

Unified Representation Learning for Efficient Medical Image Analysis

Medical image analysis typically includes several tasks such as enhancement, segmentation, and classification. Traditionally, these tasks are implemented using separate deep learning models for separate tasks, which is not efficient because…

Computer Vision and Pattern Recognition · Computer Science 2021-06-09 Ghada Zamzmi , Sivaramakrishnan Rajaraman , Sameer Antani

DivMerge: A divergence-based model merging method for multi-tasking

Multi-task learning (MTL) is often achieved by merging datasets before fine-tuning, but the growing availability of fine-tuned models has led to new approaches such as model merging via task arithmetic. A major challenge in this setting is…

Machine Learning · Computer Science 2025-09-15 Brahim Touayouch , Loïc Fosse , Géraldine Damnati , Gwénolé Lecorvé

MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction

In image-assisted minimally invasive surgeries (MIS), understanding surgical scenes is vital for real-time feedback to surgeons, skill evaluation, and improving outcomes through collaborative human-robot procedures. Within this context, the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Mithun Parab , Pranay Lendave , Jiyoung Kim , Thi Quynh Dan Nguyen , Palash Ingle

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Despite the promise of Multi-Task Learning in leveraging complementary knowledge across tasks, existing multi-task optimization (MTO) techniques remain fixated on resolving conflicts via optimizer-centric loss scaling and gradient…

Machine Learning · Computer Science 2025-07-29 Zedong Wang , Siyuan Li , Dan Xu

MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic

The advent of large language models (LLMs) like GPT-4 has catalyzed the exploration of multi-task learning (MTL), in which a single model demonstrates proficiency across diverse tasks. Task arithmetic has emerged as a cost-effective…

Computation and Language · Computer Science 2024-06-28 Yuyan Zhou , Liang Song , Bingning Wang , Weipeng Chen

Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts

Multi-Task Reinforcement Learning (MTRL) tackles the long-standing problem of endowing agents with skills that generalize across a variety of problems. To this end, sharing representations plays a fundamental role in capturing both unique…

Machine Learning · Computer Science 2024-05-07 Ahmed Hendawy , Jan Peters , Carlo D'Eramo

Multi-Task Cooperative Learning via Searching for Flat Minima

Multi-task learning (MTL) has shown great potential in medical image analysis, improving the generalizability of the learned features and the performance in individual tasks. However, most of the work on MTL focuses on either architecture…

Computer Vision and Pattern Recognition · Computer Science 2023-09-22 Fuping Wu , Le Zhang , Yang Sun , Yuanhan Mo , Thomas Nichols , Bartlomiej W. Papiez

Toward Enhancing Representation Learning in Federated Multi-Task Settings

Federated multi-task learning (FMTL) seeks to collaboratively train customized models for users with different tasks while preserving data privacy. Most existing approaches assume model congruity (i.e., the use of fully or partially…

Machine Learning · Computer Science 2026-02-03 Mehdi Setayesh , Mahdi Beitollahi , Yasser H. Khalil , Hongliang Li

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent

Merging multiple expert models offers a promising approach for performing multi-task learning without accessing their original data. Existing methods attempt to alleviate task conflicts by sparsifying task vectors or promoting orthogonality…

Machine Learning · Computer Science 2025-05-27 Yongxian Wei , Anke Tang , Li Shen , Zixuan Hu , Chun Yuan , Xiaochun Cao

Multi-Task Learning for Dense Prediction Tasks: A Survey

With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Simon Vandenhende , Stamatios Georgoulis , Wouter Van Gansbeke , Marc Proesmans , Dengxin Dai , Luc Van Gool

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs

Task arithmetic is a straightforward yet highly effective strategy for model merging, enabling the resultant model to exhibit multi-task capabilities. Recent research indicates that models demonstrating linearity enhance the performance of…

Machine Learning · Computer Science 2025-04-16 Rui Dai , Sile Hu , Xu Shen , Yonggang Zhang , Xinmei Tian , Jieping Ye