Related papers: Editing Models with Task Arithmetic

When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Task arithmetic refers to editing the pre-trained model by adding a weighted sum of task vectors, each of which is the weight update from the pre-trained model to fine-tuned models for certain tasks. This approach recently gained attention…

Machine Learning · Computer Science 2025-05-27 Hongkang Li , Yihua Zhang , Shuai Zhang , Meng Wang , Sijia Liu , Pin-Yu Chen

Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models

Task arithmetic has recently emerged as a cost-effective and scalable approach to edit pre-trained models directly in weight space: By adding the fine-tuned weights of different tasks, the model's performance can be improved on these tasks,…

Machine Learning · Computer Science 2023-11-22 Guillermo Ortiz-Jimenez , Alessandro Favero , Pascal Frossard

Decomposing Task Vectors for Refined Model Editing

Large pre-trained models have transformed machine learning, yet adapting these models effectively to exhibit precise, concept-specific behaviors remains a significant challenge. Task vectors, defined as the difference between fine-tuned and…

Machine Learning · Computer Science 2025-12-30 Hamed Damirchi , Ehsan Abbasnejad , Zhen Zhang , Javen Shi

Distilling Linearized Behavior into Non-Linear Fine-Tuning for Effective Task Arithmetic

Task vector composition has emerged as a promising paradigm for editing pre-trained models, enabling model merging through addition and unlearning through subtraction. Fine-tuning in the tangent space of a pre-trained model (linear…

Machine Learning · Computer Science 2026-05-25 Thomas Sommariva , Francesca Morandi , Simone Calderara , Angelo Porrello

On Fairness of Task Arithmetic: The Role of Task Vectors

Model editing techniques, particularly task arithmetic with task vectors, offer an efficient alternative to full fine-tuning by enabling direct parameter updates through simple arithmetic operations. While this approach promises substantial…

Machine Learning · Computer Science 2026-02-13 Hiroki Naganuma , Kotaro Yoshida , Laura Gomezjurado Gonzalez , Takafumi Horie , Yuji Naraki , Ryotaro Shimizu

Revisiting Weight Averaging for Model Merging

Model merging aims to build a multi-task learner by combining the parameters of individually fine-tuned models without additional training. While a straightforward approach is to average model parameters across tasks, this often results in…

Machine Learning · Computer Science 2025-04-04 Jiho Choi , Donggyun Kim , Chanhyuk Lee , Seunghoon Hong

On Task Vectors and Gradients

Task arithmetic has emerged as a simple yet powerful technique for model merging, enabling the combination of multiple finetuned models into one. Despite its empirical success, a clear theoretical explanation of why and when it works is…

Machine Learning · Computer Science 2025-10-21 Luca Zhou , Daniele Solombrino , Donato Crisostomi , Maria Sofia Bucarelli , Giuseppe Alessio D'Inverno , Fabrizio Silvestri , Emanuele Rodolà

Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment

Task arithmetic enables efficient model editing by representing task-specific changes as vectors in parameter space. Task arithmetic typically assumes that the source and target models are initialized from the same pre-trained parameters.…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Kazuhiko Kawamoto , Atsuhiro Endo , Hiroshi Kera

Multi-Task Model Merging via Adaptive Weight Disentanglement

Model merging has recently gained attention as an economical and scalable approach to incorporate task-specific weights from various tasks into a unified multi-task model. For example, in Task Arithmetic (TA), adding the fine-tuned weights…

Machine Learning · Computer Science 2025-01-10 Feng Xiong , Runxi Cheng , Wang Chen , Zhanqiu Zhang , Yiwen Guo , Chun Yuan , Ruifeng Xu

Efficient Model Editing with Task-Localized Sparse Fine-tuning

Task arithmetic has emerged as a promising approach for editing models by representing task-specific knowledge as composable task vectors. However, existing methods rely on network linearization to derive task vectors, leading to…

Machine Learning · Computer Science 2025-04-04 Leonardo Iurada , Marco Ciccone , Tatiana Tommasi

Task Addition and Weight Disentanglement in Closed-Vocabulary Models

Task arithmetic has recently emerged as a promising method for editing pre-trained \textit{open-vocabulary} models, offering a cost-effective alternative to standard multi-task fine-tuning. However, despite the abundance of…

Machine Learning · Computer Science 2025-11-19 Adam Hazimeh , Alessandro Favero , Pascal Frossard

Task Weighting in Meta-learning with Trajectory Optimisation

Developing meta-learning algorithms that are un-biased toward a subset of training tasks often requires hand-designed criteria to weight tasks, potentially resulting in sub-optimal solutions. In this paper, we introduce a new principled and…

Machine Learning · Computer Science 2023-01-05 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro

Exploring and Leveraging Class Vectors for Classifier Editing

Image classifiers play a critical role in detecting diseases in medical imaging and identifying anomalies in manufacturing processes. However, their predefined behaviors after extensive training make post hoc model editing difficult,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-20 Jaeik Kim , Jaeyoung Do

Adaptive Weight Assignment Scheme For Multi-task Learning

Deep learning based models are used regularly in every applications nowadays. Generally we train a single model on a single task. However, we can train multiple tasks on a single model under multi-task learning settings. This provides us…

Machine Learning · Computer Science 2023-03-14 Aminul Huq , Mst Tasnim Pervin

Network Model Selection for Task-Focused Attributed Network Inference

Networks are models representing relationships between entities. Often these relationships are explicitly given, or we must learn a representation which generalizes and predicts observed behavior in underlying individual data (e.g.…

Social and Information Networks · Computer Science 2017-09-19 Ivan Brugere , Chris Kanich , Tanya Y. Berger-Wolf

NegMerge: Sign-Consensual Weight Merging for Machine Unlearning

Machine unlearning aims to selectively remove specific knowledge from a trained model. Existing approaches, such as Task Arithmetic, fine-tune the model on the forget set to create a task vector (i.e., a direction in weight space) for…

Machine Learning · Computer Science 2025-07-03 Hyo Seo Kim , Dongyoon Han , Junsuk Choe

Task Vector Quantization for Memory-Efficient Model Merging

Model merging enables efficient multi-task models by combining task-specific fine-tuned checkpoints. However, storing multiple task-specific checkpoints requires significant memory, limiting scalability and restricting model merging to…

Machine Learning · Computer Science 2025-08-08 Youngeun Kim , Seunghwan Lee , Aecheon Jung , Bogon Ryu , Sungeun Hong

Task Arithmetic Through The Lens Of One-Shot Federated Learning

Task Arithmetic is a model merging technique that enables the combination of multiple models' capabilities into a single model through simple arithmetic in the weight space, without the need for additional fine-tuning or access to the…

Machine Learning · Computer Science 2025-07-14 Zhixu Silvia Tao , Ian Mason , Sanjeev Kulkarni , Xavier Boix

Task Vector Bases: A Unified and Scalable Framework for Compressed Task Arithmetic

Task arithmetic, representing downstream tasks through linear operations on task vectors, has emerged as a simple yet powerful paradigm for transferring knowledge across diverse settings. However, maintaining a large collection of task…

Machine Learning · Computer Science 2025-10-10 Siqi Zeng , Yifei He , Meitong Liu , Weiqiu You , Yifan Hao , Yao-Hung Hubert Tsai , Makoto Yamada , Han Zhao

Bilinear Coordinate Alignment for Training-Free Task-Vector Transfer

Fine-tuning large-scale pre-trained models is a recent prevalent paradigm for adapting general representations to specialized tasks. However, when a new version of a pre-trained model becomes available, expertise acquired through…

Machine Learning · Computer Science 2026-05-28 Jungyong Son , Jinwook Jung , Minhee Park , Sungyong Baik