Related papers: Does Combining Parameter-efficient Modules Improve…

LoRA Learns Less and Forgets Less

Low-Rank Adaptation (LoRA) is a widely-used parameter-efficient finetuning method for large language models. LoRA saves memory by training only low rank perturbations to selected weight matrices. In this work, we compare the performance of…

Machine Learning · Computer Science 2024-09-24 Dan Biderman , Jacob Portes , Jose Javier Gonzalez Ortiz , Mansheej Paul , Philip Greengard , Connor Jennings , Daniel King , Sam Havens , Vitaliy Chiley , Jonathan Frankle , Cody Blakeney , John P. Cunningham

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Low-Rank Adaptation (LoRA) is a popular technique for parameter-efficient fine-tuning of Large Language Models (LLMs). We study how different LoRA modules can be merged to achieve skill composition -- testing the performance of the merged…

Computation and Language · Computer Science 2024-12-03 Akshara Prabhakar , Yuanzhi Li , Karthik Narasimhan , Sham Kakade , Eran Malach , Samy Jelassi

PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models

Low-rank Adaptation (LoRA) has gained popularity as a fine-tuning approach for Large Language Models (LLMs) due to its low resource requirements and good performance. While a plethora of work has investigated improving LoRA serving…

Machine Learning · Computer Science 2025-08-06 Minghao Yan , Zhuang Wang , Zhen Jia , Shivaram Venkataraman , Yida Wang

RandLoRA: Full-rank parameter-efficient fine-tuning of large models

Low-Rank Adaptation (LoRA) and its variants have shown impressive results in reducing the number of trainable parameters and memory requirements of large transformer networks while maintaining fine-tuning performance. The low-rank nature of…

Computation and Language · Computer Science 2025-03-13 Paul Albert , Frederic Z. Zhang , Hemanth Saratchandran , Cristian Rodriguez-Opazo , Anton van den Hengel , Ehsan Abbasnejad

LoRA vs. Full Fine-Tuning: A Theoretical Perspective

Fine-tuning adapts a pre-trained model to downstream tasks using a small amount of labeled data. Low-Rank Adaptation (LoRA) is an efficient fine-tuning method that reduces memory and computation costs while often achieving performance close…

Machine Learning · Computer Science 2026-05-20 Ali Zindari , Rotem Mulayoff , Sebastian U. Stich

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

We present Generalized LoRA (GLoRA), an advanced approach for universal parameter-efficient fine-tuning tasks. Enhancing Low-Rank Adaptation (LoRA), GLoRA employs a generalized prompt module to optimize pre-trained model weights and adjust…

Machine Learning · Computer Science 2023-10-17 Arnav Chavan , Zhuang Liu , Deepak Gupta , Eric Xing , Zhiqiang Shen

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner. However, in multi-task scenarios, challenges such as training imbalance and the…

Computation and Language · Computer Science 2024-10-31 Xujia Wang , Haiyan Zhao , Shuo Wang , Hanqing Wang , Zhiyuan Liu

LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models

Fine-tuning large language models (LLMs) with high parameter efficiency for downstream tasks has become a new paradigm. Low-Rank Adaptation (LoRA) significantly reduces the number of trainable parameters for fine-tuning. Although it has…

Computation and Language · Computer Science 2024-08-14 Jia-Chen Zhang , Yu-Jie Xiong , He-Xi Qiu , Dong-Hai Zhu , Chun-Ming Xia

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Low-rank adaptations (LoRA) are often employed to fine-tune large language models (LLMs) for new tasks. This paper investigates LoRA composability for cross-task generalization and introduces LoraHub, a simple framework devised for the…

Computation and Language · Computer Science 2024-08-20 Chengsong Huang , Qian Liu , Bill Yuchen Lin , Tianyu Pang , Chao Du , Min Lin

MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning

Parameter-efficient fine-tuning (PEFT) is a popular method for tailoring pre-trained large language models (LLMs), especially as the models' scale and the diversity of tasks increase. Low-rank adaptation (LoRA) is based on the idea that the…

Computation and Language · Computer Science 2025-05-27 Pengjie Ren , Chengshun Shi , Shiguang Wu , Mengqi Zhang , Zhaochun Ren , Maarten de Rijke , Zhumin Chen , Jiahuan Pei

Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs

Large Language Models (LLMs) such as ChatGPT demonstrate strong few-shot adaptability without requiring fine-tuning, positioning them ideal for data-limited and real-time applications. However, this adaptability has not yet been replicated…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 Zixuan Hu , Yongxian Wei , Li Shen , Chun Yuan , Dacheng Tao

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient. Previous results demonstrated that these methods can even improve performance on some…

Computation and Language · Computer Science 2024-04-09 Olesya Razuvayevskaya , Ben Wu , Joao A. Leite , Freddy Heppell , Ivan Srba , Carolina Scarton , Kalina Bontcheva , Xingyi Song

$\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning

Low-rank adapters (LoRA) and their variants are popular parameter-efficient fine-tuning (PEFT) techniques that closely match full model fine-tune performance while requiring only a small number of additional parameters. These additional…

Machine Learning · Computer Science 2024-05-28 Runqian Wang , Soumya Ghosh , David Cox , Diego Antognini , Aude Oliva , Rogerio Feris , Leonid Karlinsky

Low-Rank Adaptation for Multilingual Summarization: An Empirical Study

Although the advancements of pre-trained Large Language Models have significantly accelerated recent progress in NLP, their ever-increasing size poses significant challenges for conventional fine-tuning, especially in memory-intensive…

Computation and Language · Computer Science 2024-04-02 Chenxi Whitehouse , Fantine Huot , Jasmijn Bastings , Mostafa Dehghani , Chu-Cheng Lin , Mirella Lapata

Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts

Merging parameter-efficient task experts has recently gained growing attention as a way to build modular architectures that can be rapidly adapted on the fly for specific downstream tasks, without requiring additional fine-tuning.…

Machine Learning · Computer Science 2025-07-15 Samin Yeasar Arnob , Zhan Su , Minseon Kim , Oleksiy Ostapenko , Riyasat Ohib , Esra'a Saleh , Doina Precup , Lucas Caccia , Alessandro Sordoni

LoRA Is Slower Than You Think

Low-Rank Adaptation (LoRA) is one of the most widely used techniques for fine-tuning large language models (LLMs). By introducing a small number of trainable low-rank weight matrices, LoRA substantially reduces the number of parameters that…

Machine Learning · Computer Science 2025-07-15 Seokmin Ko

Layer-wise LoRA fine-tuning: a similarity metric approach

Pre-training Large Language Models (LLMs) on web-scale datasets becomes fundamental for advancing general-purpose AI. In contrast, enhancing their predictive performance on downstream tasks typically involves adapting their knowledge…

Machine Learning · Computer Science 2026-02-06 Keith Ando Ogawa , Bruno Lopes Yamamoto , Lucas Lauton de Alcantara , Lucas Pellicer , Rosimeire Pereira Costa , Edson Bollis , Anna Helena Reali Costa , Artur Jordao

Strategic Over-Parameterization for Generalizable Low-Rank Adaptation

Adapting large language models (LLMs) to downstream tasks via full fine-tuning is increasingly impractical due to its computational and memory demands. Parameter-efficient fine-tuning (PEFT) approaches such as Low-Rank Adaptation (LoRA)…

Machine Learning · Computer Science 2026-05-19 Jing Gao , Zhong-Yi Lu , Pan Zhang , Ze-Feng Gao

MerA: Merging Pretrained Adapters For Few-Shot Learning

Adapter tuning, which updates only a few parameters, has become a mainstream method for fine-tuning pretrained language models to downstream tasks. However, it often yields subpar results in few-shot learning. AdapterFusion, which assembles…

Computation and Language · Computer Science 2023-08-31 Shwai He , Run-Ze Fan , Liang Ding , Li Shen , Tianyi Zhou , Dacheng Tao

Composing Parameter-Efficient Modules with Arithmetic Operations

As an efficient alternative to conventional full finetuning, parameter-efficient finetuning (PEFT) is becoming the prevailing method to adapt pretrained language models. In PEFT, a lightweight module is learned on each dataset while the…

Computation and Language · Computer Science 2023-12-12 Jinghan Zhang , Shiqi Chen , Junteng Liu , Junxian He