English
Related papers

Related papers: Adapting Large Language Models for Parameter-Effic…

200 papers

The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g.,…

Computation and Language · Computer Science 2023-10-10 Zhiqiang Hu , Lei Wang , Yihuai Lan , Wanyu Xu , Ee-Peng Lim , Lidong Bing , Xing Xu , Soujanya Poria , Roy Ka-Wei Lee

The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models…

Computation and Language · Computer Science 2024-02-19 Niall Taylor , Upamanyu Ghose , Omid Rohanian , Mohammadmahdi Nouriborji , Andrey Kormilitzin , David Clifton , Alejo Nevado-Holgado

The rise of Artificial Intelligence (AI)-and particularly Large Language Models (LLMs) for code-has reshaped Software Engineering (SE) by enabling the automation of tasks such as code generation, bug detection, and repair. However, these…

Software Engineering · Computer Science 2025-08-18 Saima Afrin , Md Zahidul Haque , Antonio Mastropaolo

With the continuous growth in the number of parameters of transformer-based pretrained language models (PLMs), particularly the emergence of large language models (LLMs) with billions of parameters, many natural language processing (NLP)…

Computation and Language · Computer Science 2023-12-20 Lingling Xu , Haoran Xie , Si-Zhao Joe Qin , Xiaohui Tao , Fu Lee Wang

Large language models (LLMs) demonstrate impressive capabilities to generate accurate code snippets given natural language intents in a zero-shot manner, i.e., without the need for specific fine-tuning. While prior studies have highlighted…

Software Engineering · Computer Science 2024-12-30 Martin Weyssow , Xin Zhou , Kisub Kim , David Lo , Houari Sahraoui

This review surveys the rapid evolution of Meta AI's LLaMA (Large Language Model Meta AI) series - from LLaMA 1 through LLaMA 4 and the specialized parameter-efficient fine-tuning (PEFT) methods developed for these models. We first describe…

Large models such as Large Language Models (LLMs) and Vision Language Models (VLMs) have transformed artificial intelligence, powering applications in natural language processing, computer vision, and multimodal learning. However, fully…

One-size-fits-all large language models (LLMs) are increasingly being used to help people with their writing. However, the style these models are trained to write in may not suit all users or use cases. LLMs would be more useful as writing…

Computation and Language · Computer Science 2024-09-10 Xinyue Liu , Harshita Diddee , Daphne Ippolito

As foundation models continue to exponentially scale in size, efficient methods of adaptation become increasingly critical. Parameter-efficient fine-tuning (PEFT), a recent class of techniques that require only modifying a small percentage…

Computation and Language · Computer Science 2023-05-01 George Pu , Anirudh Jain , Jihan Yin , Russell Kaplan

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

This paper delves into the pressing need in Parameter-Efficient Fine-Tuning (PEFT) for Large Language Models (LLMs). While LLMs possess remarkable capabilities, their extensive parameter requirements and associated computational demands…

Computation and Language · Computer Science 2023-11-23 Chengyu Wang , Junbing Yan , Wei Zhang , Jun Huang

Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually…

Computation and Language · Computer Science 2024-06-10 Xiongtao Zhou , Jie He , Yuhua Ke , Guangyao Zhu , Víctor Gutiérrez-Basulto , Jeff Z. Pan

Parameter-efficient fine-tuning (PEFT) methods, which fine-tune only a subset of model parameters, offer a promising solution by reducing the computational costs of tuning large language models (LLMs) while maintaining their performance.…

Software Engineering · Computer Science 2025-11-25 André Storhaug , Jingyue Li

Fine-tuning large language models (LLMs) on downstream tasks requires substantial computational resources. Selective PEFT, a class of parameter-efficient fine-tuning (PEFT) methodologies, aims to mitigate these computational challenges by…

Computation and Language · Computer Science 2025-06-24 Aradhye Agarwal , Suhas K Ramesh , Ayan Sengupta , Tanmoy Chakraborty

Parameter-Efficient Fine-Tuning (PEFT) is widely used for adapting Large Language Models (LLMs) for various tasks. Recently, there has been an increasing demand for fine-tuning a single LLM for multiple tasks because it requires overall…

Computation and Language · Computer Science 2026-05-15 Anjir Ahmed Chowdhury , Syed Zawad , Xiaolong Ma , Xu Dong , Feng Yan

Parameter-Efficient Fine-Tuning (PEFT) methods address the increasing size of Large Language Models (LLMs). Currently, many newly introduced PEFT methods are challenging to replicate, deploy, or compare with one another. To address this, we…

Computation and Language · Computer Science 2026-05-14 Robert Belanec , Ivan Srba , Maria Bielikova

Adapting pretrained language models to novel domains, such as clinical applications, traditionally involves retraining their entire set of parameters. Parameter-Efficient Fine-Tuning (PEFT) techniques for fine-tuning language models…

Computation and Language · Computer Science 2024-06-11 Aryo Pradipta Gema , Pasquale Minervini , Luke Daines , Tom Hope , Beatrice Alex

Parameter-efficient finetuning (PEFT) methods effectively adapt large language models (LLMs) to diverse downstream tasks, reducing storage and GPU memory demands. Despite these advantages, several applications pose new challenges to PEFT…

Machine Learning · Computer Science 2024-11-05 Baohao Liao , Christof Monz

Automated Program Repair (APR) aims to fix bugs by generating patches. And existing work has demonstrated that "pre-training and fine-tuning" paradigm enables Large Language Models (LLMs) improve fixing capabilities on APR. However,…

Software Engineering · Computer Science 2024-09-13 Guochang Li , Chen Zhi , Jialiang Chen , Junxiao Han , Shuiguang Deng

Software systems often record important runtime information in logs to help with troubleshooting. Log-based anomaly detection has become a key research area that aims to identify system issues through log data, ultimately enhancing the…

Software Engineering · Computer Science 2025-04-15 Wei Guan , Jian Cao , Shiyou Qian , Jianqi Gao , Chun Ouyang
‹ Prev 1 2 3 10 Next ›