Related papers: Superpose Task-specific Features for Model Merging

SE-Merging: A Self-Enhanced Approach for Dynamic Model Merging

Model merging has gained increasing attention due to its intriguing property: interpolating the parameters of different task-specific fine-tuned models leads to multi-task abilities. However, despite its empirical success, the underlying…

Artificial Intelligence · Computer Science 2025-06-24 Zijun Chen , Zhanpeng Zhou , Bo Zhang , Weinan Zhang , Xi Sun , Junchi Yan

No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

Model merging integrates the weights of multiple task-specific models into a single multi-task model. Despite recent interest in the problem, a significant performance gap between the combined and single-task models remains. In this paper,…

Machine Learning · Computer Science 2025-06-12 Daniel Marczak , Simone Magistri , Sebastian Cygert , Bartłomiej Twardowski , Andrew D. Bagdanov , Joost van de Weijer

Merging by Matching Models in Task Parameter Subspaces

Model merging aims to cheaply combine individual task-specific models into a single multitask model. In this work, we view past merging methods as leveraging different notions of a ''task parameter subspace'' in which models are matched…

Machine Learning · Computer Science 2024-04-16 Derek Tam , Mohit Bansal , Colin Raffel

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities

Model merging is an efficient empowerment technique in the machine learning community that does not require the collection of raw training data and does not require expensive computation. As model merging becomes increasingly prevalent…

Machine Learning · Computer Science 2026-01-01 Enneng Yang , Li Shen , Guibing Guo , Xingwei Wang , Xiaochun Cao , Jie Zhang , Dacheng Tao

To See a World in a Spark of Neuron: Disentangling Multi-task Interference for Training-free Model Merging

Fine-tuning pre-trained models on targeted datasets enhances task-specific performance but often comes at the expense of generalization. Model merging techniques, which integrate multiple fine-tuned models into a single multi-task model…

Machine Learning · Computer Science 2025-09-11 Zitao Fang , Guodong DU , Shuyang Yu , Yifei Guo , Yiwei Zhang , Yiyao Cao , Jing Li , Ho-Kin Tang , Sim Kuan Goh

Model Merging in the Essential Subspace

Model merging aims to integrate multiple task-specific fine-tuned models derived from a shared pre-trained checkpoint into a single multi-task model without additional training. Despite extensive research, task interference remains a major…

Machine Learning · Computer Science 2026-02-25 Longhua Li , Lei Qi , Qi Tian , Xin Geng

From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches

Model merging has achieved significant success, with numerous innovative methods proposed to enhance capabilities by combining multiple models. However, challenges persist due to the lack of a unified framework for classification and…

Machine Learning · Computer Science 2025-03-13 Wei Ruan , Tianze Yang , Yifan Zhou , Tianming Liu , Jin Lu

SyMerge: From Non-Interference to Synergistic Merging via Single-Layer Adaptation

Model merging combines independently trained models into a single multi-task model. However, most existing approaches focus primarily on avoiding task interference. We argue that its greater potential lies in enabling task synergy, where…

Machine Learning · Computer Science 2026-05-25 Aecheon Jung , Seunghwan Lee , Dongyoon Han , Sungeun Hong

Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions

Model merging combines the parameters of multiple neural networks into a single model without additional training. As fine-tuned large language models (LLMs) proliferate, merging offers a computationally efficient alternative to ensembles…

Computation and Language · Computer Science 2026-03-31 Mingyang Song , Mao Zheng

In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models

Model merging is an effective strategy to merge multiple models for enhancing model performances, and more efficient than ensemble learning as it will not introduce extra computation into inference. However, limited research explores if the…

Computer Vision and Pattern Recognition · Computer Science 2025-05-19 Hu Wang , Ibrahim Almakky , Congbo Ma , Numan Saeed , Mohammad Yaqub

Training-free Heterogeneous Model Merging

Model merging has attracted significant attention as a powerful paradigm for model reuse, facilitating the integration of task-specific models into a singular, versatile framework endowed with multifarious capabilities. Previous studies,…

Machine Learning · Computer Science 2025-01-03 Zhengqi Xu , Han Zheng , Jie Song , Li Sun , Mingli Song

Model Merging: Foundations and Algorithms

Modern deep learning usually treats models as separate artifacts: trained independently, specialized for particular purposes, and replaced when improved versions appear. This thesis studies model merging as an alternative paradigm:…

Machine Learning · Computer Science 2026-05-05 Donato Crisostomi

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs

Task arithmetic is a straightforward yet highly effective strategy for model merging, enabling the resultant model to exhibit multi-task capabilities. Recent research indicates that models demonstrating linearity enhance the performance of…

Machine Learning · Computer Science 2025-04-16 Rui Dai , Sile Hu , Xu Shen , Yonggang Zhang , Xinmei Tian , Jieping Ye

Forgetting of task-specific knowledge in model merging-based continual learning

This paper investigates the linear merging of models in the context of continual learning (CL). Using controlled visual cues in computer vision experiments, we demonstrate that merging largely preserves or enhances shared knowledge, while…

Computer Vision and Pattern Recognition · Computer Science 2025-08-01 Timm Hess , Gido M van de Ven , Tinne Tuytelaars

Non-Uniform Parameter-Wise Model Merging

Combining multiple machine learning models has long been a technique for enhancing performance, particularly in distributed settings. Traditional approaches, such as model ensembles, work well, but are expensive in terms of memory and…

Machine Learning · Computer Science 2024-12-23 Albert Manuel Orozco Camacho , Stefan Horoi , Guy Wolf , Eugene Belilovsky

Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration

Multi-task model merging aims to consolidate knowledge from multiple fine-tuned task-specific experts into a unified model while minimizing performance degradation. Existing methods primarily approach this by minimizing differences between…

Machine Learning · Computer Science 2025-10-28 Wenju Sun , Qingyong Li , Wen Wang , Yang Liu , Yangli-ao Geng , Boyang Li

Collective Model Intelligence Requires Compatible Specialization

In this work, we explore the limitations of combining models by averaging intermediate features, referred to as model merging, and propose a new direction for achieving collective model intelligence through what we call compatible…

Machine Learning · Computer Science 2024-11-05 Jyothish Pari , Samy Jelassi , Pulkit Agrawal

Model Merging by Output-Space Projection

Model merging combines fine-tuned checkpoints into a single multi-task model without retraining. Existing methods - such as task arithmetic, model soups, TIES, and DARE - are computationally efficient and empirically successful, but rely on…

Machine Learning · Computer Science 2026-05-29 Bethan Evans , Benjamin Etheridge , Stephen Roberts , Jared Tanner

Subspace-Boosted Model Merging

Model merging enables the combination of multiple specialized expert models into a single model capable of performing multiple tasks. However, the benefits of merging an increasing amount of specialized experts generally lead to diminishing…

Machine Learning · Computer Science 2025-12-23 Ronald Skorobogat , Karsten Roth , Mariana-Iuliana Georgescu

Fine-Grained Model Merging via Modular Expert Recombination

Model merging constructs versatile models by integrating task-specific models without requiring labeled data or expensive joint retraining. Although recent methods improve adaptability to heterogeneous tasks by generating customized merged…

Machine Learning · Computer Science 2026-02-09 Haiyun Qiu , Xingyu Wu , Liang Feng , Kay Chen Tan