English
Related papers

Related papers: Hyperparameter Optimization for Large Language Mod…

200 papers

The advent of large language models (LLMs) has revolutionized natural language processing, enabling unprecedented capabilities in understanding and generating human-like text. However, the computational cost and convergence times associated…

Computation and Language · Computer Science 2024-11-26 Kerim Büyükakyüz

In fine-tuning large language models (LLMs), conserving computational resources while maintaining effectiveness and improving outcomes within the same computational constraints is crucial. The Low-Rank Adaptation (LoRA) strategy balances…

Machine Learning · Computer Science 2024-09-05 Xiaojun Xiao , Sen Shen , Qiming Bao , Hongfei Rong , Kairui Liu , Zhongsheng Wang , Jiamou Liu

Large Language Models (LLMs) have transformed both everyday life and scientific research. However, adapting LLMs from general-purpose models to specialized tasks remains challenging, particularly in resource-constrained environments.…

Machine Learning · Computer Science 2025-09-12 Hao Zhang , Bo Huang , Zhenjia Li , Xi Xiao , Hui Yi Leong , Zumeng Zhang , Xinwei Long , Tianyang Wang , Hao Xu

Low-Rank Adaptation (LoRA) is one of the most widely used techniques for fine-tuning large language models (LLMs). By introducing a small number of trainable low-rank weight matrices, LoRA substantially reduces the number of parameters that…

Machine Learning · Computer Science 2025-07-15 Seokmin Ko

Fine-tuning helps large language models (LLM) recover degraded information and enhance task performance. Although Low-Rank Adaptation (LoRA) is widely used and effective for fine-tuning, we have observed that its scaling factor can limit or…

Machine Learning · Computer Science 2025-01-14 Jun Liu , Zhenglun Kong , Peiyan Dong , Changdi Yang , Xuan Shen , Pu Zhao , Hao Tang , Geng Yuan , Wei Niu , Wenbin Zhang , Xue Lin , Dong Huang , Yanzhi Wang

Fine-tuning Large Language Models (LLMs) with Low-Rank Adaptation (LoRA) offers a resource-efficient way to personalize or specialize. However, LoRA is highly sensitive to hyperparameter choices, and exhaustive hyperparameter search is…

Computation and Language · Computer Science 2026-05-29 Baek Seong-Eun , Lee Jung-Mok , Kim Sung-Bin , Tae-Hyun Oh

Low-rank adaptation (LoRA) is a popular method for fine-tuning large-scale pre-trained models in downstream tasks by learning low-rank incremental matrices. Though LoRA and its variants effectively reduce the number of trainable parameters…

Machine Learning · Computer Science 2024-03-21 Rushi Qiang , Ruiyi Zhang , Pengtao Xie

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

Large pre-trained models, such as large language models (LLMs), present significant resource challenges for fine-tuning due to their extensive parameter sizes, especially for applications in mobile systems. To address this, Low-Rank…

Machine Learning · Computer Science 2024-07-18 Yuzhu Mao , Siqi Ping , Zihao Zhao , Yang Liu , Wenbo Ding

Low-Rank Adaptation (LoRA) is the prevailing approach for efficient large language model (LLM) fine-tuning. Building on this paradigm, recent studies have proposed alternative initialization strategies, architectural modifications, and…

Machine Learning · Computer Science 2026-05-20 Yu-Ang Lee , Ching-Yun Ko , Pin-Yu Chen , Mi-Yen Yeh

Low-rank Adaptation (LoRA) has gained popularity as a fine-tuning approach for Large Language Models (LLMs) due to its low resource requirements and good performance. While a plethora of work has investigated improving LoRA serving…

Machine Learning · Computer Science 2025-08-06 Minghao Yan , Zhuang Wang , Zhen Jia , Shivaram Venkataraman , Yida Wang

The rapid advancements in large language models (LLMs) have revolutionized natural language processing, creating an increased need for efficient, task-specific fine-tuning methods. Traditional fine-tuning of LLMs involves updating a large…

Computation and Language · Computer Science 2024-11-26 Ayush Singh , Rajdeep Aher , Shivank Garg

As large language models (LLMs) continue to scale in size, the computational overhead has become a major bottleneck for task-specific fine-tuning. While low-rank adaptation (LoRA) effectively curtails this cost by confining the weight…

Machine Learning · Computer Science 2026-05-15 Yilang Zhang , Xiaodong Yang , Yiwei Cai , Georgios B. Giannakis

Low-rank adaptation (LoRA) has become the default approach to fine-tune large language models (LLMs) due to its significant reduction in trainable parameters. However, trainable parameter demand for LoRA increases with increasing model…

Computation and Language · Computer Science 2024-06-19 Seyedarmin Azizi , Souvik Kundu , Massoud Pedram

This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on…

Machine Learning · Computer Science 2024-11-12 Michael R. Zhang , Nishkrit Desai , Juhan Bae , Jonathan Lorraine , Jimmy Ba

Low-Rank Adaptation (LoRA) lowers the computational and memory overhead of fine-tuning large models by updating a low-dimensional subspace of the pre-trained weight matrix. Albeit efficient, LoRA exhibits suboptimal convergence and…

Machine Learning · Computer Science 2026-02-25 Yilang Zhang , Bingcong Li , Georgios B. Giannakis

Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by…

Neural and Evolutionary Computing · Computer Science 2024-08-06 Leonardo Lucio Custode , Fabio Caraffini , Anil Yaman , Giovanni Iacca

Large Language Models (LLMs) are driving advancements in artificial intelligence by increasing the scale of model parameters, which has significantly enhanced generalization ability and unlocked new capabilities in practice. However, their…

Artificial Intelligence · Computer Science 2025-10-20 Chenxing Wei , Yao Shu , Ying Tiffany He , Fei Richard Yu

Pre-training Large Language Models (LLMs) on web-scale datasets becomes fundamental for advancing general-purpose AI. In contrast, enhancing their predictive performance on downstream tasks typically involves adapting their knowledge…

Fine-tuning has proven to be highly effective in adapting pre-trained models to perform better on new desired tasks with minimal data samples. Among the most widely used approaches are reparameterization methods, which update a target…

Machine Learning · Computer Science 2025-10-27 Aymane El Firdoussi , El Mahdi Chayti , Mohamed El Amine Seddik , Martin Jaggi
‹ Prev 1 2 3 10 Next ›