English

R-LoRA: Randomized Multi-Head LoRA for Efficient Multi-Task Learning

Machine Learning 2025-06-03 v2 Artificial Intelligence

Abstract

Fine-tuning large language models (LLMs) is computationally expensive, and Low-Rank Adaptation (LoRA) provides a cost-effective solution by approximating weight updates through low-rank matrices. In real-world scenarios, LLMs are fine-tuned on data from multiple domains to perform tasks across various fields, embodying multi-task learning (MTL). LoRA often underperforms in such complex scenarios. To enhance LoRA's capability in multi-task learning, we propose R-LoRA, which incorporates Multi-Head Randomization. Multi-Head Randomization diversifies the head matrices through Multi-Head Dropout and Multi-Head Random Initialization, enabling more efficient learning of task-specific features while maintaining shared knowledge representation. Our approach not only improves performance in MTL but also reduces GPU memory usage and training time. Experiments show that R-LoRA's gains stem from increased diversity in the head matrices, demonstrating its effectiveness for multi-task learning. The code is available at https://github.com/jinda-liu/R-LoRA

Keywords

Cite

@article{arxiv.2502.15455,
  title  = {R-LoRA: Randomized Multi-Head LoRA for Efficient Multi-Task Learning},
  author = {Jinda Liu and Yi Chang and Yuan Wu},
  journal= {arXiv preprint arXiv:2502.15455},
  year   = {2025}
}

Comments

8 pages, 7 figures

R2 v1 2026-06-28T21:52:44.708Z