English
Related papers

Related papers: Parameter-Efficient Finetuning for Robust Continua…

200 papers

Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there…

Computation and Language · Computer Science 2024-07-23 Divyanshu Aggarwal , Ashutosh Sathe , Ishaan Watts , Sunayana Sitaram

A common challenge towards the adaptability of Large Language Models (LLMs) is their ability to learn new languages over time without hampering the model's performance on languages in which the model is already proficient (usually English).…

Computation and Language · Computer Science 2026-04-24 Divyanshu Aggarwal , Sankarshan Damle , Navin Goyal , Satya Lokam , Sunayana Sitaram

Current large language models (LLMs) often exhibit imbalanced multilingual capabilities due to their English-centric training corpora. To address this, existing fine-tuning approaches operating at the data-level (e.g., through data…

Computation and Language · Computer Science 2025-06-03 Yangfan Ye , Xiaocheng Feng , Zekun Yuan , Xiachong Feng , Libo Qin , Lei Huang , Weitao Ma , Yichong Huang , Zhirui Zhang , Yunfei Lu , Xiaohui Yan , Duyu Tang , Dandan Tu , Bing Qin

Large language models (LLMs) have revolutionized various domains but still struggle with non-Latin scripts and low-resource languages. This paper addresses the critical challenge of improving multilingual performance without extensive…

Computation and Language · Computer Science 2025-01-08 Somnath Kumar , Vaibhav Balloli , Mercy Ranjit , Kabir Ahuja , Sunayana Sitaram , Kalika Bali , Tanuja Ganu , Akshay Nambi

A recent family of techniques, dubbed lightweight fine-tuning methods, facilitates parameter-efficient transfer learning by updating only a small set of additional parameters while keeping the parameters of the pretrained language model…

Computation and Language · Computer Science 2022-12-09 Mozhdeh Gheini , Xuezhe Ma , Jonathan May

Fine-tuning a task-specific multilingual large language model (LLM) involves training the model on a multilingual dataset with examples in all the required languages. Updating one or more supported languages with additional data or adding…

Computation and Language · Computer Science 2026-01-26 Alphaeus Dmonte , Vidhi Gupta , Daniel J Perry , Mark Arehart

Recently, fine-tuning pre-trained language models (e.g., multilingual BERT) to downstream cross-lingual tasks has shown promising results. However, the fine-tuning process inevitably changes the parameters of the pre-trained model and…

Computation and Language · Computer Science 2020-10-06 Zihan Liu , Genta Indra Winata , Andrea Madotto , Pascale Fung

Despite their success in many natural language tasks, solving math problems remains a significant challenge for large language models (LLMs). A large gap exists between LLMs' pass-at-one and pass-at-N performance in solving math problems,…

Computation and Language · Computer Science 2023-10-17 Yixin Liu , Avi Singh , C. Daniel Freeman , John D. Co-Reyes , Peter J. Liu

Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages.…

Computation and Language · Computer Science 2024-07-03 Sathish Reddy Indurthi , Wenxuan Zhou , Shamil Chollampatt , Ravi Agrawal , Kaiqiang Song , Lingxiao Zhao , Chenguang Zhu

Fine-tuning multilingual sequence-to-sequence large language models (msLLMs) has shown promise in developing neural machine translation (NMT) systems for low-resource languages (LRLs). However, conventional single-stage fine-tuning methods…

Computation and Language · Computer Science 2025-03-31 Sarubi Thillainathan , Songchen Yuan , En-Shiun Annie Lee , Sanath Jayasena , Surangika Ranathunga

Large Language Models (LLMs) exhibit significant disparities in performance across languages, primarily benefiting high-resource languages while marginalizing underrepresented ones. Continual Pretraining (CPT) has emerged as a promising…

Computation and Language · Computer Science 2025-10-09 Zihao Li , Shaoxiong Ji , Hengyu Luo , Jörg Tiedemann

Recent state-of-the-art language models utilize a two-phase training procedure comprised of (i) unsupervised pre-training on unlabeled text, and (ii) fine-tuning for a specific supervised task. More recently, many studies have been focused…

Computation and Language · Computer Science 2019-11-15 Itzik Malkiel , Lior Wolf

Pre-trained large language models (LLMs) have become a cornerstone of modern natural language processing, with their capabilities extending across a wide range of applications and languages. However, the fine-tuning of multilingual LLMs,…

Computation and Language · Computer Science 2025-07-08 Wanru Zhao , Yihong Chen , Royson Lee , Xinchi Qiu , Yan Gao , Hongxiang Fan , Nicholas D. Lane

Recent advancements in Large Language Models (LLMs) have showcased their remarkable capabilities in text understanding and generation. However, even stronger LLMs are susceptible to acquiring erroneous or obsolete information from the…

Computation and Language · Computer Science 2024-02-19 Shiwen Ni , Dingwei Chen , Chengming Li , Xiping Hu , Ruifeng Xu , Min Yang

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay,…

Machine Learning · Computer Science 2026-05-08 Yazheng Liu , Yuxuan Wan , Rui Xu , Xi Zhang , Sihong Xie , Hui Xiong

Large language models (LLMs) still struggle across tasks outside of high-resource languages. In this work, we investigate cross-lingual transfer to lower-resource languages where task-specific post-training data is scarce. Building on prior…

Computation and Language · Computer Science 2025-10-09 Lucas Bandarkar , Nanyun Peng

Recent advancements in Large Language Models (LLMs) have emphasized the critical role of fine-tuning (FT) techniques in adapting LLMs to specific tasks, especially when retraining from scratch is computationally infeasible. Fine-tuning…

Artificial Intelligence · Computer Science 2025-10-23 Xiao Han , Zimo Zhao , Wanyu Wang , Maolin Wang , Zitao Liu , Yi Chang , Xiangyu Zhao

Continual instruction tuning enables large language models (LLMs) to learn incrementally while retaining past knowledge, whereas existing methods primarily focus on how to retain old knowledge rather than on selecting which new knowledge to…

Computation and Language · Computer Science 2025-03-21 Peiyi Lin , Fukai Zhang , Kai Niu , Hao Fu

Multilingual Large Language Models (LLMs) struggle with cross-lingual tasks due to data imbalances between high-resource and low-resource languages, as well as monolingual bias in pre-training. Existing methods, such as bilingual…

Computation and Language · Computer Science 2026-04-14 Weihua Zheng , Chang Liu , Zhengyuan Liu , Xin Huang , Kui Wu , Muhammad Huzaifah Md Shahrin , Aiti Aw , Roy Ka-Wei Lee

Large language models (LLMs) have revolutionized lots of fields of research. Although it is well-known that fine-tuning is essential for enhancing the capabilities of LLMs, existing research suggests that there is potential redundancy in…

Artificial Intelligence · Computer Science 2025-02-14 Haoling Li , Xin Zhang , Xiao Liu , Yeyun Gong , Yifan Wang , Qi Chen , Peng Cheng
‹ Prev 1 2 3 10 Next ›