English

Reversible Diffusion Decoding for Diffusion Language Models

Computation and Language 2026-02-03 v1 Artificial Intelligence

Abstract

Diffusion language models enable parallel token generation through block-wise decoding, but their irreversible commitments can lead to stagnation, where the reverse diffusion process fails to make further progress under a suboptimal context.We propose Reversible Diffusion Decoding (RDD), a decoding framework that introduces reversibility into block-wise diffusion generation. RDD detects stagnation as a state-dependent failure of the reverse process and enables efficient backtracking to earlier blocks without recomputation via cached model states. To avoid repeated failure trajectories, RDD applies confidence-guided re-masking to selectively reinitialize uncertain tokens while preserving reliable context.This reversible formulation allows decoding to recover from early commitment errors while maintaining the parallel efficiency of diffusion-based generation. Experiments show that RDD improves generation robustness and quality over baselines with minimal computational overhead.

Keywords

Cite

@article{arxiv.2602.00150,
  title  = {Reversible Diffusion Decoding for Diffusion Language Models},
  author = {Xinyun Wang and Min Zhang and Sen Cui and Zhikang Chen and Bo Jiang and Kun Kuang and Mingbao Lin},
  journal= {arXiv preprint arXiv:2602.00150},
  year   = {2026}
}