English
Related papers

Related papers: Parameter Efficient Multi-task Model Fusion with P…

200 papers

Pre-training Large Language Models (LLMs) on web-scale datasets becomes fundamental for advancing general-purpose AI. In contrast, enhancing their predictive performance on downstream tasks typically involves adapting their knowledge…

Parameter efficient finetuning (PEFT) methods are widely used in LLMs and generative models in computer vision. Especially one can use multiple of these during inference to change the behavior of the base model. In this paper we…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Ege Kesim , Selahattin Serdar Helli

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner. However, in multi-task scenarios, challenges such as training imbalance and the…

Computation and Language · Computer Science 2024-10-31 Xujia Wang , Haiyan Zhao , Shuo Wang , Hanqing Wang , Zhiyuan Liu

Merging parameter-efficient task experts has recently gained growing attention as a way to build modular architectures that can be rapidly adapted on the fly for specific downstream tasks, without requiring additional fine-tuning.…

Despite the success, the process of fine-tuning large-scale PLMs brings prohibitive adaptation costs. In fact, fine-tuning all the parameters of a colossal model and retaining separate instances for different tasks are practically…

Conventional fine-tuning encounters increasing difficulties given the size of current Pre-trained Language Models, which makes parameter-efficient tuning become the focal point of frontier research. Previous methods in this field add…

Computation and Language · Computer Science 2022-12-12 Wang Qi , Yu-Ping Ruan , Yuan Zuo , Taihao Li

Large-scale deep learning models with a pretraining-finetuning paradigm have led to a surge of numerous task-specific models fine-tuned from a common pre-trained model. Recently, several research efforts have been made on merging these…

Machine Learning · Computer Science 2025-04-22 Yeoreum Lee , Jinwook Jung , Sungyong Baik

Transfer learning via fine-tuning pre-trained transformer models has gained significant success in delivering state-of-the-art results across various NLP tasks. In the absence of centralized data, Federated Learning (FL) can benefit from…

This paper presents a novel methodology of fine-tuning for large language models-dynamic LoRA. Building from the standard Low-Rank Adaptation framework, this methodology further adds dynamic adaptation mechanisms to improve efficiency and…

Computation and Language · Computer Science 2025-01-28 Xiaoxuan Liao , Chihang Wang , Shicheng Zhou , Jiacheng Hu , Hongye Zheng , Jia Gao

In this paper, we introduce a method for fine-tuning Large Language Models (LLMs), inspired by Multi-Task learning in a federated manner. Our approach leverages the structure of each client's model and enables a learning scheme that…

Machine Learning · Computer Science 2024-10-22 Ahmed Elbakary , Chaouki Ben Issaid , Tamer ElBatt , Karim Seddik , Mehdi Bennis

Parameter-Efficient Fine-Tuning (PEFT) is essential for adapting Large Language Models (LLMs). In practice, LLMs are often required to handle a diverse set of tasks from multiple domains, a scenario naturally addressed by multi-task…

Computation and Language · Computer Science 2025-08-08 Jinda Liu , Bo Cheng , Yi Chang , Yuan Wu

Low-Rank Adaptation (LoRA) has become the leading Parameter-Efficient Fine-Tuning (PEFT) method for Large Language Models (LLMs), as it significantly reduces GPU memory usage while maintaining competitive fine-tuned model quality on…

Machine Learning · Computer Science 2025-10-02 Zhanda Zhu , Qidong Su , Yaoyao Ding , Kevin Song , Shang Wang , Gennady Pekhimenko

Recent advancements in Large Language Models (LLMs) have achieved robust performance across diverse tasks, but fine-tuning these models for specific domains remains resource-intensive. Parameter-Efficient Fine-Tuning (PEFT) methods like…

Computation and Language · Computer Science 2025-02-19 Yuxuan Zhang , Ruizhe Li

Parameter-efficient fine-tuning methods have emerged as a promising solution for adapting pre-trained models to various downstream tasks. While these methods perform well in single-task learning, extending them to multi-task learning…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Neeraj Gangwar , Anshuka Rangi , Rishabh Deshmukh , Holakou Rahmanian , Yesh Dattatreya , Nickvash Kani

There has been a significant increase in the deployment of neural network models, presenting substantial challenges in model adaptation and fine-tuning. Efficient adaptation is crucial in maintaining model performance across diverse tasks…

Machine Learning · Computer Science 2025-04-02 Maolin Wang , Xiangyu Zhao

Adapter tuning, which updates only a few parameters, has become a mainstream method for fine-tuning pretrained language models to downstream tasks. However, it often yields subpar results in few-shot learning. AdapterFusion, which assembles…

Computation and Language · Computer Science 2023-08-31 Shwai He , Run-Ze Fan , Liang Ding , Li Shen , Tianyi Zhou , Dacheng Tao

Checkpoint merging is a technique for combining multiple model snapshots into a single superior model, potentially reducing training time for large language models. This paper explores checkpoint merging in the context of…

Machine Learning · Computer Science 2025-04-29 Shi Jie Yu , Sehyun Choi

Adapting models pre-trained on large-scale datasets to a variety of downstream tasks is a common strategy in deep learning. Consequently, parameter-efficient fine-tuning methods have emerged as a promising way to adapt pre-trained models to…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Ahmed Agiza , Marina Neseem , Sherief Reda

Although multi-task learning is widely applied in intelligent services, traditional multi-task modeling methods often require customized designs based on specific task combinations, resulting in a cumbersome modeling process. Inspired by…

Machine Learning · Computer Science 2025-04-15 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong , Dayu Zhang

Human Activity Recognition is a foundational task in pervasive computing. While recent advances in self-supervised learning and transformer-based architectures have significantly improved HAR performance, adapting large pretrained models to…

Machine Learning · Computer Science 2025-12-23 Irina Seregina , Philippe Lalanda , German Vega
‹ Prev 1 2 3 10 Next ›