Related papers: Hyperparameter Optimization for Large Language Mod…

OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models

The advent of large language models (LLMs) has revolutionized natural language processing, enabling unprecedented capabilities in understanding and generating human-like text. However, the computational cost and convergence times associated…

Computation and Language · Computer Science 2024-11-26 Kerim Büyükakyüz

CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models

In fine-tuning large language models (LLMs), conserving computational resources while maintaining effectiveness and improving outcomes within the same computational constraints is crucial. The Low-Rank Adaptation (LoRA) strategy balances…

Machine Learning · Computer Science 2024-09-05 Xiaojun Xiao , Sen Shen , Qiming Bao , Hongfei Rong , Kairui Liu , Zhongsheng Wang , Jiamou Liu

Sensitivity-LoRA: Low-Load Sensitivity-Based Fine-Tuning for Large Language Models

Large Language Models (LLMs) have transformed both everyday life and scientific research. However, adapting LLMs from general-purpose models to specialized tasks remains challenging, particularly in resource-constrained environments.…

Machine Learning · Computer Science 2025-09-12 Hao Zhang , Bo Huang , Zhenjia Li , Xi Xiao , Hui Yi Leong , Zumeng Zhang , Xinwei Long , Tianyang Wang , Hao Xu

LoRA Is Slower Than You Think

Low-Rank Adaptation (LoRA) is one of the most widely used techniques for fine-tuning large language models (LLMs). By introducing a small number of trainable low-rank weight matrices, LoRA substantially reduces the number of parameters that…

Machine Learning · Computer Science 2025-07-15 Seokmin Ko

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Fine-tuning helps large language models (LLM) recover degraded information and enhance task performance. Although Low-Rank Adaptation (LoRA) is widely used and effective for fine-tuning, we have observed that its scaling factor can limit or…

Machine Learning · Computer Science 2025-01-14 Jun Liu , Zhenglun Kong , Peiyan Dong , Changdi Yang , Xuan Shen , Pu Zhao , Hao Tang , Geng Yuan , Wei Niu , Wenbin Zhang , Xue Lin , Dong Huang , Yanzhi Wang

A Language-Guided Bayesian Optimization for Efficient LoRA Hyperparameter Search

Fine-tuning Large Language Models (LLMs) with Low-Rank Adaptation (LoRA) offers a resource-efficient way to personalize or specialize. However, LoRA is highly sensitive to hyperparameter choices, and exhaustive hyperparameter search is…

Computation and Language · Computer Science 2026-05-29 Baek Seong-Eun , Lee Jung-Mok , Kim Sung-Bin , Tae-Hyun Oh

BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models

Low-rank adaptation (LoRA) is a popular method for fine-tuning large-scale pre-trained models in downstream tasks by learning low-rank incremental matrices. Though LoRA and its variants effectively reduce the number of trainable parameters…

Machine Learning · Computer Science 2024-03-21 Rushi Qiang , Ruiyi Zhang , Pengtao Xie

TLoRA+: A Low-Rank Parameter-Efficient Fine-Tuning Method for Large Language Models

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

Enhancing Parameter Efficiency and Generalization in Large-Scale Models: A Regularized and Masked Low-Rank Adaptation Approach

Large pre-trained models, such as large language models (LLMs), present significant resource challenges for fine-tuning due to their extensive parameter sizes, especially for applications in mobile systems. To address this, Low-Rank…

Machine Learning · Computer Science 2024-07-18 Yuzhu Mao , Siqi Ping , Zihao Zhao , Yang Liu , Wenbo Ding

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fine-tuning. Building on this paradigm, recent studies have proposed alternative initialization strategies, architectural modifications, and…

Machine Learning · Computer Science 2026-05-20 Yu-Ang Lee , Ching-Yun Ko , Pin-Yu Chen , Mi-Yen Yeh

PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models

Low-rank Adaptation (LoRA) has gained popularity as a fine-tuning approach for Large Language Models (LLMs) due to its low resource requirements and good performance. While a plethora of work has investigated improving LoRA serving…

Machine Learning · Computer Science 2025-08-06 Minghao Yan , Zhuang Wang , Zhen Jia , Shivaram Venkataraman , Yida Wang

LoRA-Mini : Adaptation Matrices Decomposition and Selective Training

The rapid advancements in large language models (LLMs) have revolutionized natural language processing, creating an increased need for efficient, task-specific fine-tuning methods. Traditional fine-tuning of LLMs involves updating a large…

Computation and Language · Computer Science 2024-11-26 Ayush Singh , Rajdeep Aher , Shivank Garg

ScaLoRA: Optimally Scaled Low-Rank Adaptation for Efficient High-Rank Fine-Tuning

As large language models (LLMs) continue to scale in size, the computational overhead has become a major bottleneck for task-specific fine-tuning. While low-rank adaptation (LoRA) effectively curtails this cost by confining the weight…

Machine Learning · Computer Science 2026-05-15 Yilang Zhang , Xiaodong Yang , Yiwei Cai , Georgios B. Giannakis

LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation

Low-rank adaptation (LoRA) has become the default approach to fine-tune large language models (LLMs) due to its significant reduction in trainable parameters. However, trainable parameter demand for LoRA increases with increasing model…

Computation and Language · Computer Science 2024-06-19 Seyedarmin Azizi , Souvik Kundu , Massoud Pedram

Using Large Language Models for Hyperparameter Optimization

This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on…

Machine Learning · Computer Science 2024-11-12 Michael R. Zhang , Nishkrit Desai , Juhan Bae , Jonathan Lorraine , Jimmy Ba

RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models

Low-Rank Adaptation (LoRA) lowers the computational and memory overhead of fine-tuning large models by updating a low-dimensional subspace of the pre-trained weight matrix. Albeit efficient, LoRA exhibits suboptimal convergence and…

Machine Learning · Computer Science 2026-02-25 Yilang Zhang , Bingcong Li , Georgios B. Giannakis

An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms

Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by…

Neural and Evolutionary Computing · Computer Science 2024-08-06 Leonardo Lucio Custode , Fabio Caraffini , Anil Yaman , Giovanni Iacca

Flexora: Flexible Low Rank Adaptation for Large Language Models

Large Language Models (LLMs) are driving advancements in artificial intelligence by increasing the scale of model parameters, which has significantly enhanced generalization ability and unlocked new capabilities in practice. However, their…

Artificial Intelligence · Computer Science 2025-10-20 Chenxing Wei , Yao Shu , Ying Tiffany He , Fei Richard Yu

Layer-wise LoRA fine-tuning: a similarity metric approach

Pre-training Large Language Models (LLMs) on web-scale datasets becomes fundamental for advancing general-purpose AI. In contrast, enhancing their predictive performance on downstream tasks typically involves adapting their knowledge…

Machine Learning · Computer Science 2026-02-06 Keith Ando Ogawa , Bruno Lopes Yamamoto , Lucas Lauton de Alcantara , Lucas Pellicer , Rosimeire Pereira Costa , Edson Bollis , Anna Helena Reali Costa , Artur Jordao

$\alpha$-LoRA: Effective Fine-Tuning via Base Model Rescaling

Fine-tuning has proven to be highly effective in adapting pre-trained models to perform better on new desired tasks with minimal data samples. Among the most widely used approaches are reparameterization methods, which update a target…

Machine Learning · Computer Science 2025-10-27 Aymane El Firdoussi , El Mahdi Chayti , Mohamed El Amine Seddik , Martin Jaggi