NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

Authors: Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang

Artificial Intelligence2026-05v1license

Abstract

Diffusion Large Language Models (dLLMs) have emerged as a promising non-autoregressive generative paradigm. Given the prohibitive computational cost of full fine-tuning, Parameter-Efficient Fine-Tuning (PEFT) has become the standard approach. However, existing PEFT methods (e.g., LoRA), originally tailored for autoregressive models, rely on static parameters that are agnostic to the noise level. Consequently, they ignore the intrinsic dynamics of the diffusion process, where input distributions and generation difficulty shift significantly along the denoising trajectory, rendering them suboptimal for dLLMs. To address this, we propose Noise-aware Low-Rank Adaptation (NaRA), which introduces a low-rank core matrix generated by a lightweight, globally shared hypernetwork conditioned on the noise level. This design enables the update matrices to vary continuously along the diffusion process while keeping parameter and latency overhead negligible. We provide a theoretical justification for the proposed NaRA framework and empirically demonstrate consistent improvements over noise-agnostic baselines across commonsense reasoning, mathematical reasoning, and code generation benchmarks. Our code is available at https://github.com/generaldi/NaRA.

Cite

@article{arxiv.2605.29716,
  title  = {NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs},
  author = {Shuaidi Wang and Zhan Zhuang and Ruping Huang and Yu Zhang},
  journal= {arXiv preprint arXiv:2605.29716},
  year   = {2026}
}

← Artificial Intelligence · Home