English
Related papers

Related papers: Astraios: Parameter-Efficient Instruction Tuning C…

200 papers

The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models…

Computation and Language · Computer Science 2024-02-19 Niall Taylor , Upamanyu Ghose , Omid Rohanian , Mohammadmahdi Nouriborji , Andrey Kormilitzin , David Clifton , Alejo Nevado-Holgado

Instruction tuning has become an important step for finetuning pretrained language models to better follow human instructions and generalize on various tasks. Nowadays, pretrained language models become increasingly larger, and full…

Computation and Language · Computer Science 2024-11-27 Pengfei He

This paper presents a systematic overview of parameter-efficient fine-tuning methods, covering over 50 papers published between early 2019 and mid-2024. These methods aim to address the challenges of fine-tuning large language models by…

Computation and Language · Computer Science 2024-11-25 Vladislav Lialin , Vijeta Deshpande , Xiaowei Yao , Anna Rumshisky

Parameter-efficient fine-tuning (PEFT) methods, which fine-tune only a subset of model parameters, offer a promising solution by reducing the computational costs of tuning large language models (LLMs) while maintaining their performance.…

Software Engineering · Computer Science 2025-11-25 André Storhaug , Jingyue Li

Large models such as Large Language Models (LLMs) and Vision Language Models (VLMs) have transformed artificial intelligence, powering applications in natural language processing, computer vision, and multimodal learning. However, fully…

Automated Program Repair (APR) aims to fix bugs by generating patches. And existing work has demonstrated that "pre-training and fine-tuning" paradigm enables Large Language Models (LLMs) improve fixing capabilities on APR. However,…

Software Engineering · Computer Science 2024-09-13 Guochang Li , Chen Zhi , Jialiang Chen , Junxiao Han , Shuiguang Deng

Parameter Efficient Fine-Tuning (PEFT) methods are proposed as an alternative fine-tuning approach for Large Language Models (LLM) to minimize high training costs. While prior research demonstrates the effectiveness of PEFT methods in…

Software Engineering · Computer Science 2025-01-28 Amirreza Esmaeili , Iman Saberi , Fatemeh H. Fard

Pre-trained models (PTMs) have achieved great success in various Software Engineering (SE) downstream tasks following the ``pre-train then fine-tune'' paradigm. As fully fine-tuning all parameters of PTMs can be computationally expensive, a…

Software Engineering · Computer Science 2023-12-27 Wentao Zou , Qi Li , Jidong Ge , Chuanyi Li , Xiaoyu Shen , Liguo Huang , Bin Luo

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

As foundation models continue to exponentially scale in size, efficient methods of adaptation become increasingly critical. Parameter-efficient fine-tuning (PEFT), a recent class of techniques that require only modifying a small percentage…

Computation and Language · Computer Science 2023-05-01 George Pu , Anirudh Jain , Jihan Yin , Russell Kaplan

Despite the state-of-the-art performance of Large Language Models (LLMs) achieved on many tasks, their massive scale often leads to high computational and environmental costs, limiting their accessibility. Parameter-Efficient Fine-Tuning…

Computation and Language · Computer Science 2026-05-14 Robert Belanec , Branislav Pecher , Ivan Srba , Maria Bielikova

Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually…

Computation and Language · Computer Science 2024-06-10 Xiongtao Zhou , Jie He , Yuhua Ke , Guangyao Zhu , Víctor Gutiérrez-Basulto , Jeff Z. Pan

Large language models (LLMs) and vision language models (VLMs) demonstrate excellent performance on a wide range of tasks by scaling up parameter counts from O(10^9) to O(10^{12}) levels and further beyond. These large scales make it…

Computation and Language · Computer Science 2023-10-19 Yaqing Wang , Jialin Wu , Tanmaya Dabral , Jiageng Zhang , Geoff Brown , Chun-Ta Lu , Frederick Liu , Yi Liang , Bo Pang , Michael Bendersky , Radu Soricut

The large models, as predicted by scaling raw forecasts, have made groundbreaking progress in many fields, particularly in natural language generation tasks, where they have approached or even surpassed human levels. However, the…

Computation and Language · Computer Science 2025-04-25 Luping Wang , Sheng Chen , Linnan Jiang , Shu Pan , Runze Cai , Sen Yang , Fei Yang

Parameter-Efficient Fine-Tuning (PEFT) methods address the increasing size of Large Language Models (LLMs). Currently, many newly introduced PEFT methods are challenging to replicate, deploy, or compare with one another. To address this, we…

Computation and Language · Computer Science 2026-05-14 Robert Belanec , Ivan Srba , Maria Bielikova

The rise of Artificial Intelligence (AI)-and particularly Large Language Models (LLMs) for code-has reshaped Software Engineering (SE) by enabling the automation of tasks such as code generation, bug detection, and repair. However, these…

Software Engineering · Computer Science 2025-08-18 Saima Afrin , Md Zahidul Haque , Antonio Mastropaolo

Standard fine-tuning of large pre-trained language models (PLMs) for downstream tasks requires updating hundreds of millions to billions of parameters, and storing a large copy of the PLM weights for every task resulting in increased cost…

Computation and Language · Computer Science 2022-11-03 Yaqing Wang , Sahaj Agarwal , Subhabrata Mukherjee , Xiaodong Liu , Jing Gao , Ahmed Hassan Awadallah , Jianfeng Gao

Standard fine-tuning of large pre-trained language models (PLMs) for downstream tasks requires updating hundreds of millions to billions of parameters, and storing a large copy of the PLM weights for every task resulting in increased cost…

Computation and Language · Computer Science 2022-11-07 Yaqing Wang , Sahaj Agarwal , Subhabrata Mukherjee , Xiaodong Liu , Jing Gao , Ahmed Hassan Awadallah , Jianfeng Gao

Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there…

Computation and Language · Computer Science 2024-07-23 Divyanshu Aggarwal , Ashutosh Sathe , Ishaan Watts , Sunayana Sitaram

Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often…

Machine Learning · Computer Science 2024-09-17 Zeyu Han , Chao Gao , Jinyang Liu , Jeff Zhang , Sai Qian Zhang
‹ Prev 1 2 3 10 Next ›