Related papers: Auto-FlexSwitch: Efficient Dynamic Model Merging v…

Less is More: Efficient Model Merging with Binary Task Switch

As an effective approach to equip models with multi-task capabilities without additional training, model merging has garnered significant attention. However, existing methods face challenges of redundant parameter conflicts and the…

Machine Learning · Computer Science 2024-12-03 Biqing Qi , Fangyuan Li , Zhen Wang , Junqi Gao , Dong Li , Peng Ye , Bowen Zhou

Efficient Multi-Task Modeling through Automated Fusion of Trained Models

Although multi-task learning is widely applied in intelligent services, traditional multi-task modeling methods often require customized designs based on specific task combinations, resulting in a cumbersome modeling process. Inspired by…

Machine Learning · Computer Science 2025-04-15 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong , Dayu Zhang

Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

Deep model merging represents an emerging research direction that combines multiple fine-tuned models to harness their specialized capabilities across different tasks and domains. Current model merging techniques focus on merging all…

Machine Learning · Computer Science 2025-01-17 Anke Tang , Enneng Yang , Li Shen , Yong Luo , Han Hu , Bo Du , Dacheng Tao

Multi-Task Model Merging via Adaptive Weight Disentanglement

Model merging has recently gained attention as an economical and scalable approach to incorporate task-specific weights from various tasks into a unified multi-task model. For example, in Task Arithmetic (TA), adding the fine-tuned weights…

Machine Learning · Computer Science 2025-01-10 Feng Xiong , Runxi Cheng , Wang Chen , Zhanqiu Zhang , Yiwen Guo , Chun Yuan , Ruifeng Xu

Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent

Merging multiple expert models offers a promising approach for performing multi-task learning without accessing their original data. Existing methods attempt to alleviate task conflicts by sparsifying task vectors or promoting orthogonality…

Machine Learning · Computer Science 2025-05-27 Yongxian Wei , Anke Tang , Li Shen , Zixuan Hu , Chun Yuan , Xiaochun Cao

Model Merging: Foundations and Algorithms

Modern deep learning usually treats models as separate artifacts: trained independently, specialized for particular purposes, and replaced when improved versions appear. This thesis studies model merging as an alternative paradigm:…

Machine Learning · Computer Science 2026-05-05 Donato Crisostomi

TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation

Accurate and efficient perception is essential for autonomous driving, where segmentation tasks such as drivable-area and lane segmentation provide critical cues for motion planning and control. However, achieving high segmentation accuracy…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Minh-Khoi Do , Huy Che , Dinh-Duy Phan , Duc-Khai Lam , Duc-Lung Vu

Localizing Task Information for Improved Model Merging and Compression

Model merging and task arithmetic have emerged as promising scalable approaches to merge multiple single-task checkpoints to one multi-task model, but their applicability is reduced by significant performance loss. Previous works have…

Machine Learning · Computer Science 2024-05-14 Ke Wang , Nikolaos Dimitriadis , Guillermo Ortiz-Jimenez , François Fleuret , Pascal Frossard

Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration

Multi-task model merging aims to consolidate knowledge from multiple fine-tuned task-specific experts into a unified model while minimizing performance degradation. Existing methods primarily approach this by minimizing differences between…

Machine Learning · Computer Science 2025-10-28 Wenju Sun , Qingyong Li , Wen Wang , Yang Liu , Yangli-ao Geng , Boyang Li

Towards Reversible Model Merging For Low-rank Weights

Model merging aims to combine multiple fine-tuned models into a single set of weights that performs well across all source tasks. While prior work has shown that merging can approximate the performance of individual fine-tuned models for…

Machine Learning · Computer Science 2025-10-17 Mohammadsajad Alipour , Mohammad Mohammadi Amiri

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

In the era of large language models, model merging is a promising way to combine multiple task-specific models into a single multitask model without extra training. However, two challenges remain: (a) interference between different models…

Computation and Language · Computer Science 2024-10-15 Zhenyi Lu , Chenghao Fan , Wei Wei , Xiaoye Qu , Dangyang Chen , Yu Cheng

Dynamic Model Merging Made Slim

Model merging enables the reuse of fine-tuned models without joint training or access to original data. Dynamic merging further improves flexibility by selectively activating task-relevant parameters and efficiently composing experts across…

Machine Learning · Computer Science 2026-05-20 Guodong Du , Wanyu Lin

Navigating the Accuracy-Size Trade-Off with Flexible Model Merging

Model merging has emerged as an efficient method to combine multiple single-task fine-tuned models. The merged model can enjoy multi-task capabilities without expensive training. While promising, merging into a single model often suffers…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Akash Dhasade , Divyansh Jhunjhunwala , Milos Vujasinovic , Gauri Joshi , Anne-Marie Kermarrec

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Multi-task learning (MTL) aims to empower a model to tackle multiple tasks simultaneously. A recent development known as task arithmetic has revealed that several models, each fine-tuned for distinct tasks, can be directly merged into a…

Machine Learning · Computer Science 2024-05-29 Enneng Yang , Zhenyi Wang , Li Shen , Shiwei Liu , Guibing Guo , Xingwei Wang , Dacheng Tao

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by…

Artificial Intelligence · Computer Science 2025-05-15 Wenju Sun , Qingyong Li , Yangli-ao Geng , Boyang Li

Wolf2Pack: The AutoFusion Framework for Dynamic Parameter Fusion

In the rapidly evolving field of deep learning, specialized models have driven significant advancements in tasks such as computer vision and natural language processing. However, this specialization leads to a fragmented ecosystem where…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Bowen Tian , Songning Lai , Yutao Yue

TransformMix: Learning Transformation and Mixing Strategies from Data

Data augmentation improves the generalization power of deep learning models by synthesizing more training samples. Sample-mixing is a popular data augmentation approach that creates additional data by combining existing samples. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Tsz-Him Cheung , Dit-Yan Yeung

Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation

Autonomous driving systems rely on panoptic perception to jointly handle object detection, drivable area segmentation, and lane line segmentation. Although multi-task learning is an effective way to integrate these tasks, its increasing…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Jiayuan Wang , Q. M. Jonathan Wu , Ning Zhang , Katsuya Suto , Lei Zhong

Fisher Mask Nodes for Language Model Merging

Fine-tuning pre-trained models provides significant advantages in downstream performance. The ubiquitous nature of pre-trained models such as BERT and its derivatives in natural language processing has also led to a proliferation of…

Computation and Language · Computer Science 2024-05-06 Thennal D K , Ganesh Nathan , Suchithra M S

MASS: MoErging through Adaptive Subspace Selection

Model merging has recently emerged as a lightweight alternative to ensembling, combining multiple fine-tuned models into a single set of parameters with no additional training overhead. Yet, existing merging methods fall short of matching…

Machine Learning · Computer Science 2026-03-18 Donato Crisostomi , Alessandro Zirilli , Antonio Andrea Gargiulo , Maria Sofia Bucarelli , Simone Scardapane , Fabrizio Silvestri , Iacopo Masi , Emanuele Rodolà