English

Parameter-Efficient Interventions for Enhanced Model Merging

Computer Vision and Pattern Recognition 2024-12-24 v1

Abstract

Model merging combines knowledge from task-specific models into a unified multi-task model to avoid joint training on all task data. However, current methods face challenges due to representation bias, which can interfere with tasks performance. As a remedy, we propose IntervMerge, a novel approach to multi-task model merging that effectively mitigates representation bias across the model using taskspecific interventions. To further enhance its efficiency, we introduce mini-interventions, which modify only part of the representation, thereby reducing the additional parameters without compromising performance. Experimental results demonstrate that IntervMerge consistently outperforms the state-of-the-art approaches using fewer parameters.

Keywords

Cite

@article{arxiv.2412.17023,
  title  = {Parameter-Efficient Interventions for Enhanced Model Merging},
  author = {Marcin Osial and Daniel Marczak and Bartosz Zieliński},
  journal= {arXiv preprint arXiv:2412.17023},
  year   = {2024}
}

Comments

10 pages, 6 figures, SIAM International Conference on Data Mining (SDM) 2025