Machine Learning · Computer Science
Diffusion Language Models are Super Data Learners
Jinjie Ni, Qian Liu, Longxu Dou, Chao Du +4
2025-11-06
Machine Learning · Computer Science
Diffusion Beats Autoregressive in Data-Constrained Settings
Mihir Prabhudesai, Mengning Wu, Amir Zadeh, Katerina Fragkiadaki +1
2025-10-28
Machine Learning · Computer Science
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng, Yihan Geng, Jian Guan, Wei Wu +2
2025-06-10
Computation and Language · Computer Science
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye, Zaixiang Zheng, Yu Bao, Lihua Qian +1
2025-02-25
Computation and Language · Computer Science
Masked Diffusion Language Models with Frequency-Informed Training
Despoina Kosmopoulou, Efthymios Georgiou, Vaggelis Dorovatas, Georgios Paraskevopoulos +1
2025-09-08
Computation and Language · Computer Science
How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices
Han Peng, Peiyu Liu, Zican Dong, Daixuan Cheng +4
2025-11-11
Computation and Language · Computer Science
On the Role of Discreteness in Diffusion LLMs
Ziqi Jin, Bin Wang, Xiang Lin, Lidong Bing +1
2025-12-30
Machine Learning · Computer Science
Understanding and Accelerating the Training of Masked Diffusion Language Models
Chunsan Hong, Sanghyun Lee, Chieh-Hsin Lai, Satoshi Hayakawa +4
2026-05-14
Computation and Language · Computer Science
A Survey on Diffusion Language Models
Tianyi Li, Mingda Chen, Bowei Guo, Zhiqiang Shen
2025-12-08
Computation and Language · Computer Science
Simple and Effective Masked Diffusion Language Models
Subham Sekhar Sahoo, Marianne Arriola, Yair Schiff, Aaron Gokaslan +4
2024-11-12
Machine Learning · Computer Science
Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs
Linrui Ma, Yufei Cui, Kai Han, Yunhe Wang
2026-03-18
Machine Learning · Computer Science
Diffusion Models With Learned Adaptive Noise
Subham Sekhar Sahoo, Aaron Gokaslan, Chris De Sa, Volodymyr Kuleshov
2024-11-12
Computation and Language · Computer Science
D4: Improving LLM Pretraining via Document De-Duplication and Diversification
Kushal Tirumala, Daniel Simig, Armen Aghajanyan, Ari S. Morcos
2023-08-24
Computation and Language · Computer Science
How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Huazheng Wang, Daixuan Cheng, Haifeng Sun, Jingyu Wang +4
2023-07-27
Machine Learning · Computer Science
Looped Diffusion Language Models
Sanghyun Lee, Chunsan Hong, Seungryong Kim, Jonghyun Lee +2
2026-05-26
Computation and Language · Computer Science
Latent Diffusion for Language Generation
Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman +1
2023-11-08
Machine Learning · Computer Science
Learning Unmasking Policies for Diffusion Language Models
Metod Jazbec, Theo X. Olausson, Louis Béthune, Pierre Ablin +5
2026-03-17
Computation and Language · Computer Science
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen, Aston Zhang, Mu Li, Alex Smola +1
2023-04-11
Machine Learning · Computer Science
Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models
Minseo Kim, Coleman Hooper, Aditya Tomar, Chenfeng Xu +4
2025-12-16
Machine Learning · Computer Science
Soft-Masked Diffusion Language Models
Michael Hersche, Samuel Moor-Smith, Thomas Hofmann, Abbas Rahimi
2026-03-03
Computation and Language · Computer Science
Rethinking Token Prediction: Tree-Structured Diffusion Language Model
Zihao Wu, Haoming Yang, Juncheng Dong, Vahid Tarokh
2026-04-07
Computation and Language · Computer Science
Differences in Text Generated by Diffusion and Autoregressive Language Models
Zeyang Zhang, Chengwei Liang, Xingyan Chen, Meiqi Gu +3
2026-05-14