English
Related papers

Related papers: IncreLoRA: Incremental Parameter Allocation Method…

200 papers

With the proliferation of large pre-trained language models (PLMs), fine-tuning all model parameters becomes increasingly inefficient, particularly when dealing with numerous downstream tasks that entail substantial training and storage…

Computation and Language · Computer Science 2024-01-23 Nadav Benedek , Lior Wolf

With the rapid scaling of large language models (LLMs), serving numerous low-rank adaptations (LoRAs) concurrently has become increasingly impractical, leading to unaffordable costs and necessitating more parameter-efficient finetuning…

Machine Learning · Computer Science 2024-05-28 Sheng Wang , Boyang Xue , Jiacheng Ye , Jiyue Jiang , Liheng Chen , Lingpeng Kong , Chuan Wu

LoRA-based large model parameter-efficient fine-tuning (PEFT) methods use low-rank de- composition to approximate updates to model parameters. However, compared to full- parameter fine-tuning, low-rank updates often lead to a performance…

Computation and Language · Computer Science 2025-08-26 Haojie Zhang

Low-Rank Adaptation (LoRA) has become a widely adopted technique for fine-tuning large-scale pre-trained models with minimal parameter updates. However, existing methods rely on fixed ranks or focus solely on either rank pruning or…

Machine Learning · Computer Science 2025-04-02 Huandong Chang , Zicheng Ma , Mingyuan Ma , Zhenting Qi , Andrew Sabot , Hong Jiang , H. T. Kung

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

LoRA has become a universal Parameter-Efficient Fine-Tuning (PEFT) technique that equips Large Language Models (LLMs) to adapt quickly to new tasks. However, when these models are scaled up, even the latest LoRA variants still introduce…

Computation and Language · Computer Science 2026-02-25 Xindian Ma , Rundong Kong , Peng Zhang , Ruoxiang Huang , Yongyu Jiang

Parameter-efficient fine-tuning (PEFT) is widely studied for its effectiveness and efficiency in the era of large language models. Low-rank adaptation (LoRA) has demonstrated commendable performance as a popular and representative method.…

Computation and Language · Computer Science 2024-04-16 Zequan Liu , Jiawen Lyn , Wei Zhu , Xing Tian , Yvette Graham

Parameter-efficient fine-tuning (PEFT) is a popular method for tailoring pre-trained large language models (LLMs), especially as the models' scale and the diversity of tasks increase. Low-rank adaptation (LoRA) is based on the idea that the…

Computation and Language · Computer Science 2025-05-27 Pengjie Ren , Chengshun Shi , Shiguang Wu , Mengqi Zhang , Zhaochun Ren , Maarten de Rijke , Zhumin Chen , Jiahuan Pei

Fine-tuning large-scale pre-trained models is inherently a resource-intensive task. While it can enhance the capabilities of the model, it also incurs substantial computational costs, posing challenges to the practical application of…

Computation and Language · Computer Science 2024-06-27 Yulong Mao , Kaiyu Huang , Changhao Guan , Ganglin Bao , Fengran Mo , Jinan Xu

Low-Rank Adaptation (LoRA) is a widely adopted parameter-efficient fine-tuning (PEFT) method for Large Language Models (LLMs), but it still incurs notable overhead and suffers from parameter interference in complex datasets. While recent…

Computation and Language · Computer Science 2025-12-19 Chunlin Tian , Xuyang Wei , Huanrong Liu , Zhijiang Guo , Li Li

Supervised fine-tuning is the most common method to adapt large language models (LLMs) to downstream tasks, but full fine-tuning LLMs requires massive computational resources. Recently, parameter-efficient fine-tuning (PEFT) methods have…

Computation and Language · Computer Science 2024-02-27 Xiangdi Meng , Damai Dai , Weiyao Luo , Zhe Yang , Shaoxiang Wu , Xiaochen Wang , Peiyi Wang , Qingxiu Dong , Liang Chen , Zhifang Sui

Low-rank adaptation (LoRA) has become a widely used paradigm for parameter-efficient fine-tuning of large language models, yet its representational capacity often lags behind full fine-tuning. Within the context of LoRA, a key open question…

Machine Learning · Computer Science 2025-11-04 Xin Yu , Cong Xie , Ziyu Zhao , Tiantian Fan , Lingzhou Xue , Zhi Zhang

Low-rank Adaptation (LoRA) has gained popularity as a fine-tuning approach for Large Language Models (LLMs) due to its low resource requirements and good performance. While a plethora of work has investigated improving LoRA serving…

Machine Learning · Computer Science 2025-08-06 Minghao Yan , Zhuang Wang , Zhen Jia , Shivaram Venkataraman , Yida Wang

Due to the demand for efficient fine-tuning of large language models, Low-Rank Adaptation (LoRA) has been widely adopted as one of the most effective parameter-efficient fine-tuning methods. Nevertheless, while LoRA improves efficiency,…

Computation and Language · Computer Science 2025-06-13 Naibin Gu , Zhenyu Zhang , Xiyu Liu , Peng Fu , Zheng Lin , Shuohuan Wang , Yu Sun , Hua Wu , Weiping Wang , Haifeng Wang

Fine-tuning large language models (LLMs) with high parameter efficiency for downstream tasks has become a new paradigm. Low-Rank Adaptation (LoRA) significantly reduces the number of trainable parameters for fine-tuning. Although it has…

Computation and Language · Computer Science 2024-08-14 Jia-Chen Zhang , Yu-Jie Xiong , He-Xi Qiu , Dong-Hai Zhu , Chun-Ming Xia

Adapting large language models (LLMs) to downstream tasks via full fine-tuning is increasingly impractical due to its computational and memory demands. Parameter-efficient fine-tuning (PEFT) approaches such as Low-Rank Adaptation (LoRA)…

Machine Learning · Computer Science 2026-05-19 Jing Gao , Zhong-Yi Lu , Pan Zhang , Ze-Feng Gao

Low-Rank Adaptation (LoRA) is currently the most commonly used Parameter-efficient fine-tuning (PEFT) method, it introduces auxiliary parameters for each layer to fine-tune the pre-trained model under limited computing resources. However,…

Machine Learning · Computer Science 2024-06-19 Hongyun Zhou , Xiangyu Lu , Wang Xu , Conghui Zhu , Tiejun Zhao , Muyun Yang

The rapid development of parameter-efficient fine-tuning methods has noticeably improved the efficiency of adapting large language models. Among these, LoRA has gained widespread popularity due to its strong balance of effectiveness and…

Machine Learning · Computer Science 2026-01-15 Yongfu Xue

Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that the low-rank updating mechanism…

Computation and Language · Computer Science 2024-05-21 Ting Jiang , Shaohan Huang , Shengyue Luo , Zihan Zhang , Haizhen Huang , Furu Wei , Weiwei Deng , Feng Sun , Qi Zhang , Deqing Wang , Fuzhen Zhuang

Fine-tuning large pre-trained models for downstream tasks has become a fundamental approach in natural language processing. Fully fine-tuning all model parameters is computationally expensive and memory-intensive, especially in…

Computation and Language · Computer Science 2025-09-24 Yu Chen , Yifei Han , Long Zhang , Yue Du , Bin Li
‹ Prev 1 2 3 10 Next ›