Related papers: Parameter-Efficient Finetuning for Robust Continua…

MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there…

Computation and Language · Computer Science 2024-07-23 Divyanshu Aggarwal , Ashutosh Sathe , Ishaan Watts , Sunayana Sitaram

Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model

A common challenge towards the adaptability of Large Language Models (LLMs) is their ability to learn new languages over time without hampering the model's performance on languages in which the model is already proficient (usually English).…

Computation and Language · Computer Science 2026-04-24 Divyanshu Aggarwal , Sankarshan Damle , Navin Goyal , Satya Lokam , Sunayana Sitaram

CC-Tuning: A Cross-Lingual Connection Mechanism for Improving Joint Multilingual Supervised Fine-Tuning

Current large language models (LLMs) often exhibit imbalanced multilingual capabilities due to their English-centric training corpora. To address this, existing fine-tuning approaches operating at the data-level (e.g., through data…

Computation and Language · Computer Science 2025-06-03 Yangfan Ye , Xiaocheng Feng , Zekun Yuan , Xiachong Feng , Libo Qin , Lei Huang , Weitao Ma , Yichong Huang , Zhirui Zhang , Yunfei Lu , Xiaohui Yan , Duyu Tang , Dandan Tu , Bing Qin

Bridging the Language Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs

Large language models (LLMs) have revolutionized various domains but still struggle with non-Latin scripts and low-resource languages. This paper addresses the critical challenge of improving multilingual performance without extensive…

Computation and Language · Computer Science 2025-01-08 Somnath Kumar , Vaibhav Balloli , Mercy Ranjit , Kabir Ahuja , Sunayana Sitaram , Kalika Bali , Tanuja Ganu , Akshay Nambi

Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning

A recent family of techniques, dubbed lightweight fine-tuning methods, facilitates parameter-efficient transfer learning by updating only a small set of additional parameters while keeping the parameters of the pretrained language model…

Computation and Language · Computer Science 2022-12-09 Mozhdeh Gheini , Xuezhe Ma , Jonathan May

Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging

Fine-tuning a task-specific multilingual large language model (LLM) involves training the model on a multilingual dataset with examples in all the required languages. Updating one or more supported languages with additional data or adding…

Computation and Language · Computer Science 2026-01-26 Alphaeus Dmonte , Vidhi Gupta , Daniel J Perry , Mark Arehart

Exploring Fine-tuning Techniques for Pre-trained Cross-lingual Models via Continual Learning

Recently, fine-tuning pre-trained language models (e.g., multilingual BERT) to downstream cross-lingual tasks has shown promising results. However, the fine-tuning process inevitably changes the parameters of the pre-trained model and…

Computation and Language · Computer Science 2020-10-06 Zihan Liu , Genta Indra Winata , Andrea Madotto , Pascale Fung

Improving Large Language Model Fine-tuning for Solving Math Problems

Despite their success in many natural language tasks, solving math problems remains a significant challenge for large language models (LLMs). A large gap exists between LLMs' pass-at-one and pass-at-N performance in solving math problems,…

Computation and Language · Computer Science 2023-10-17 Yixin Liu , Avi Singh , C. Daniel Freeman , John D. Co-Reyes , Peter J. Liu

Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages.…

Computation and Language · Computer Science 2024-07-03 Sathish Reddy Indurthi , Wenxuan Zhou , Shamil Chollampatt , Ravi Agrawal , Kaiqiang Song , Lingxiao Zhao , Chenguang Zhu

Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation

Fine-tuning multilingual sequence-to-sequence large language models (msLLMs) has shown promise in developing neural machine translation (NMT) systems for low-resource languages (LRLs). However, conventional single-stage fine-tuning methods…

Computation and Language · Computer Science 2025-03-31 Sarubi Thillainathan , Songchen Yuan , En-Shiun Annie Lee , Sanath Jayasena , Surangika Ranathunga

Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

Large Language Models (LLMs) exhibit significant disparities in performance across languages, primarily benefiting high-resource languages while marginalizing underrepresented ones. Continual Pretraining (CPT) has emerged as a promising…

Computation and Language · Computer Science 2025-10-09 Zihao Li , Shaoxiong Ji , Hengyu Luo , Jörg Tiedemann

MML: Maximal Multiverse Learning for Robust Fine-Tuning of Language Models

Recent state-of-the-art language models utilize a two-phase training procedure comprised of (i) unsupervised pre-training on unlabeled text, and (ii) fine-tuning for a specific supervised task. More recently, many studies have been focused…

Computation and Language · Computer Science 2019-11-15 Itzik Malkiel , Lior Wolf

Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Pre-trained large language models (LLMs) have become a cornerstone of modern natural language processing, with their capabilities extending across a wide range of applications and languages. However, the fine-tuning of multilingual LLMs,…

Computation and Language · Computer Science 2025-07-08 Wanru Zhao , Yihong Chen , Royson Lee , Xinchi Qiu , Yan Gao , Hongxiang Fan , Nicholas D. Lane

Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

Recent advancements in Large Language Models (LLMs) have showcased their remarkable capabilities in text understanding and generation. However, even stronger LLMs are susceptible to acquiring erroneous or obsolete information from the…

Computation and Language · Computer Science 2024-02-19 Shiwen Ni , Dingwei Chen , Chengming Li , Xiping Hu , Ruifeng Xu , Min Yang

Attribution-Guided Continual Learning for Large Language Models

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay,…

Machine Learning · Computer Science 2026-05-08 Yazheng Liu , Yuxuan Wan , Rui Xu , Xi Zhang , Sihong Xie , Hui Xiong

The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs

Large language models (LLMs) still struggle across tasks outside of high-resource languages. In this work, we investigate cross-lingual transfer to lower-resource languages where task-specific post-training data is scarce. Building on prior…

Computation and Language · Computer Science 2025-10-09 Lucas Bandarkar , Nanyun Peng

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning

Recent advancements in Large Language Models (LLMs) have emphasized the critical role of fine-tuning (FT) techniques in adapting LLMs to specific tasks, especially when retraining from scratch is computationally infeasible. Fine-tuning…

Artificial Intelligence · Computer Science 2025-10-23 Xiao Han , Zimo Zhao , Wanyu Wang , Maolin Wang , Zitao Liu , Yi Chang , Xiangyu Zhao

Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning

Continual instruction tuning enables large language models (LLMs) to learn incrementally while retaining past knowledge, whereas existing methods primarily focus on how to retain old knowledge rather than on selecting which new knowledge to…

Computation and Language · Computer Science 2025-03-21 Peiyi Lin , Fukai Zhang , Kai Niu , Hao Fu

Bridging Linguistic Gaps: Cross-Lingual Mapping in Pre-Training and Dataset for Enhanced Multilingual LLM Performance

Multilingual Large Language Models (LLMs) struggle with cross-lingual tasks due to data imbalances between high-resource and low-resource languages, as well as monolingual bias in pre-training. Existing methods, such as bilingual…

Computation and Language · Computer Science 2026-04-14 Weihua Zheng , Chang Liu , Zhengyuan Liu , Xin Huang , Kui Wu , Muhammad Huzaifah Md Shahrin , Aiti Aw , Roy Ka-Wei Lee

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection

Large language models (LLMs) have revolutionized lots of fields of research. Although it is well-known that fine-tuning is essential for enhancing the capabilities of LLMs, existing research suggests that there is potential redundancy in…

Artificial Intelligence · Computer Science 2025-02-14 Haoling Li , Xin Zhang , Xiao Liu , Yeyun Gong , Yifan Wang , Qi Chen , Peng Cheng