Related papers: Parameter-Efficient Finetuning of Transformers for…

Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices

In recent years, Large Language Models (LLMs) through Transformer structures have dominated many machine learning tasks, especially text processing. However, these models require massive amounts of data for training and induce high resource…

Machine Learning · Computer Science 2025-04-17 Kilian Pfeiffer , Mohamed Aboelenien Ahmed , Ramin Khalili , Jörg Henkel

SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models

Transfer learning via fine-tuning pre-trained transformer models has gained significant success in delivering state-of-the-art results across various NLP tasks. In the absence of centralized data, Federated Learning (FL) can benefit from…

Machine Learning · Computer Science 2023-08-15 Sara Babakniya , Ahmed Roushdy Elkordy , Yahya H. Ezzeldin , Qingfeng Liu , Kee-Bong Song , Mostafa El-Khamy , Salman Avestimehr

Parameter-Efficient Multi-Task Fine-Tuning in Code-Related Tasks

Large Language Models (LLMs) have proven highly effective in automating software engineering tasks, bridging natural language and code semantics to achieve notable results in code generation and summarization. However, their scale incurs…

Software Engineering · Computer Science 2026-01-22 Md Zahidul Haque , Saima Afrin , Antonio Mastropaolo

Parameter-Efficient Fine-Tuning for HAR: Integrating LoRA and QLoRA into Transformer Models

Human Activity Recognition is a foundational task in pervasive computing. While recent advances in self-supervised learning and transformer-based architectures have significantly improved HAR performance, adapting large pretrained models to…

Machine Learning · Computer Science 2025-12-23 Irina Seregina , Philippe Lalanda , German Vega

Low-Rank Adaptation for Multilingual Summarization: An Empirical Study

Although the advancements of pre-trained Large Language Models have significantly accelerated recent progress in NLP, their ever-increasing size poses significant challenges for conventional fine-tuning, especially in memory-intensive…

Computation and Language · Computer Science 2024-04-02 Chenxi Whitehouse , Fantine Huot , Jasmijn Bastings , Mostafa Dehghani , Chu-Cheng Lin , Mirella Lapata

Progtuning: Progressive Fine-tuning Framework for Transformer-based Language Models

Fine-tuning is a promising technique for leveraging Transformer-based language models in downstream tasks. As model sizes continue to grow, updating all model parameters becomes increasingly costly. Parameter-efficient fine-tuning methods…

Computation and Language · Computer Science 2025-06-27 Xiaoshuang Ji , Zhendong Zhao , Xiaojun Chen , Xin Zhao , Zeyao Liu

PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models

Low-rank Adaptation (LoRA) has gained popularity as a fine-tuning approach for Large Language Models (LLMs) due to its low resource requirements and good performance. While a plethora of work has investigated improving LoRA serving…

Machine Learning · Computer Science 2025-08-06 Minghao Yan , Zhuang Wang , Zhen Jia , Shivaram Venkataraman , Yida Wang

A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification

Classifying Non-Functional Requirements (NFRs) in software development life cycle is critical. Inspired by the theory of transfer learning, researchers apply powerful pre-trained models for NFR classification. However, full fine-tuning by…

Software Engineering · Computer Science 2025-03-12 Xia Li , Allen Kim

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

Fine-tuning significantly improves the performance of Large Language Models (LLMs), yet its underlying mechanisms remain poorly understood. This paper aims to provide an in-depth interpretation of the fine-tuning process through circuit…

Computation and Language · Computer Science 2025-06-16 Xu Wang , Yan Hu , Wenyu Du , Reynold Cheng , Benyou Wang , Difan Zou

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient. Previous results demonstrated that these methods can even improve performance on some…

Computation and Language · Computer Science 2024-04-09 Olesya Razuvayevskaya , Ben Wu , Joao A. Leite , Freddy Heppell , Ivan Srba , Carolina Scarton , Kalina Bontcheva , Xingyi Song

Parameter-Efficient Tuning with Special Token Adaptation

Parameter-efficient tuning aims at updating only a small subset of parameters when adapting a pretrained model to downstream tasks. In this work, we introduce PASTA, in which we only modify the special token representations (e.g., [SEP] and…

Computation and Language · Computer Science 2023-02-15 Xiaocong Yang , James Y. Huang , Wenxuan Zhou , Muhao Chen

Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models

Conventional fine-tuning encounters increasing difficulties given the size of current Pre-trained Language Models, which makes parameter-efficient tuning become the focal point of frontier research. Previous methods in this field add…

Computation and Language · Computer Science 2022-12-12 Wang Qi , Yu-Ping Ruan , Yuan Zuo , Taihao Li

FLoRA: Fused forward-backward adapters for parameter efficient fine-tuning and reducing inference-time latencies of LLMs

As the large language models (LLMs) grow in size each day, efficient training and fine-tuning has never been as important as nowadays. This resulted in the great interest in parameter efficient fine-tuning (PEFT), and effective methods…

Machine Learning · Computer Science 2025-11-04 Dhananjaya Gowda , Seoha Song , Junhyun Lee , Harshith Goka

Parameter-Efficient Transfer Learning for NLP

Fine-tuning large pre-trained models is an effective transfer mechanism in NLP. However, in the presence of many downstream tasks, fine-tuning is parameter inefficient: an entire new model is required for every task. As an alternative, we…

Machine Learning · Computer Science 2019-06-14 Neil Houlsby , Andrei Giurgiu , Stanislaw Jastrzebski , Bruna Morrone , Quentin de Laroussilhe , Andrea Gesmundo , Mona Attariyan , Sylvain Gelly

Empirical Study of Transformers for Source Code

Initially developed for natural language processing (NLP), Transformers are now widely used for source code processing, due to the format similarity between source code and text. In contrast to natural language, source code is strictly…

Machine Learning · Computer Science 2021-06-25 Nadezhda Chirkova , Sergey Troshin

Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence Modeling

The increasing size of language models raises great research interests in parameter-efficient fine-tuning such as LoRA that freezes the pre-trained model, and injects small-scale trainable parameters for multiple downstream tasks (e.g.,…

Computation and Language · Computer Science 2023-05-22 Yunqi Zhu , Xuebing Yang , Yuanyuan Wu , Wensheng Zhang

Layer-wise LoRA fine-tuning: a similarity metric approach

Pre-training Large Language Models (LLMs) on web-scale datasets becomes fundamental for advancing general-purpose AI. In contrast, enhancing their predictive performance on downstream tasks typically involves adapting their knowledge…

Machine Learning · Computer Science 2026-02-06 Keith Ando Ogawa , Bruno Lopes Yamamoto , Lucas Lauton de Alcantara , Lucas Pellicer , Rosimeire Pereira Costa , Edson Bollis , Anna Helena Reali Costa , Artur Jordao

On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation

Adapter-based tuning has recently arisen as an alternative to fine-tuning. It works by adding light-weight adapter modules to a pretrained language model (PrLM) and only updating the parameters of adapter modules when learning on a…

Computation and Language · Computer Science 2021-06-08 Ruidan He , Linlin Liu , Hai Ye , Qingyu Tan , Bosheng Ding , Liying Cheng , Jia-Wei Low , Lidong Bing , Luo Si

Exploring the Capabilities of LLMs for Code Change Related Tasks

Developers deal with code-change-related tasks daily, e.g., reviewing code. Pre-trained code and code-change-oriented models have been adapted to help developers with such tasks. Recently, large language models (LLMs) have shown their…

Software Engineering · Computer Science 2024-07-04 Lishui Fan , Jiakun Liu , Zhongxin Liu , David Lo , Xin Xia , Shanping Li

An empirical study of LoRA-based fine-tuning of large language models for automated test case generation

Automated test case generation from natural language requirements remains a challenging problem in software engineering due to the ambiguity of requirements and the need to produce structured, executable test artifacts. Recent advances in…

Software Engineering · Computer Science 2026-04-09 Milad Moradi , Ke Yan , David Colwell , Rhona Asgari