Related papers: Generalized Interpolating Discrete Diffusion

Generalized Discrete Diffusion from Snapshots

We introduce Generalized Discrete Diffusion from Snapshots (GDDS), a unified framework for discrete diffusion modeling that supports arbitrary noising processes over large discrete state spaces. Our formulation encompasses all existing…

Machine Learning · Statistics 2026-03-24 Oussama Zekri , Théo Uscidda , Nicolas Boullé , Anna Korba

Simplified and Generalized Masked Diffusion for Discrete Data

Masked (or absorbing) diffusion is actively explored as an alternative to autoregressive models for generative modeling of discrete data. However, existing work in this area has been hindered by unnecessarily complex model formulations and…

Machine Learning · Computer Science 2025-01-17 Jiaxin Shi , Kehang Han , Zhe Wang , Arnaud Doucet , Michalis K. Titsias

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

Latent-Augmented Discrete Diffusion Models

Discrete diffusion models have emerged as a powerful class of models and a promising route to fast language generation, but practical implementations typically rely on factored reverse transitions ignoring cross-token dependencies and…

Machine Learning · Computer Science 2026-05-14 Dario Shariatian , Alain Durmus , Umut Simsekli , Stefano Peluchetti

Masked Diffusion for Generative Recommendation

Generative recommendation (GR) with semantic IDs (SIDs) has emerged as a promising alternative to traditional recommendation approaches due to its performance gains, capitalization on semantic information provided through language model…

Machine Learning · Computer Science 2025-12-19 Kulin Shah , Bhuvesh Kumar , Neil Shah , Liam Collins

Remasking Discrete Diffusion Models with Inference-Time Scaling

Part of the success of diffusion models stems from their ability to perform iterative refinement, i.e., repeatedly correcting outputs during generation. However, modern masked discrete diffusion lacks this capability: when a token is…

Machine Learning · Computer Science 2026-02-10 Guanghan Wang , Yair Schiff , Subham Sekhar Sahoo , Volodymyr Kuleshov

Simple and Effective Masked Diffusion Language Models

While diffusion models excel at generating high-quality images, prior work reports a significant performance gap between diffusion and autoregressive (AR) methods in language modeling. In this work, we show that simple masked discrete…

Computation and Language · Computer Science 2024-11-12 Subham Sekhar Sahoo , Marianne Arriola , Yair Schiff , Aaron Gokaslan , Edgar Marroquin , Justin T Chiu , Alexander Rush , Volodymyr Kuleshov

Generalized Discrete Diffusion with Self-Correction

Self-correction is an effective technique for maintaining parallel sampling in discrete diffusion models with minimal performance degradation. Prior work has explored self-correction at inference time or during post-training; however, such…

Machine Learning · Computer Science 2026-03-04 Linxuan Wang , Ziyi Wang , Yikun Bai , Wei Deng , Guang Lin , Qifan Song

Interleaved Gibbs Diffusion: Generating Discrete-Continuous Data with Implicit Constraints

We introduce Interleaved Gibbs Diffusion (IGD), a novel generative modeling framework for discrete-continuous data, focusing on problems with important, implicit and unspecified constraints in the data. Most prior works on discrete and…

Machine Learning · Computer Science 2025-07-04 Gautham Govind Anil , Sachin Yadav , Dheeraj Nagaraj , Karthikeyan Shanmugam , Prateek Jain

DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling

Diffusion language models intrinsically fail to capture correlations between decoded tokens, which leads to a harsh trade-off between sampling quality and throughput. To solve this issue, we propose DiLaDiff, a variant of masked diffusion…

Machine Learning · Computer Science 2026-05-25 Jean-Marie Lemercier , Tomas Geffner , Karsten Kreis , Morteza Mardani , Arash Vahdat , Ante Jukić

Scaling Beyond Masked Diffusion Language Models

Diffusion language models are a promising alternative to autoregressive models due to their potential for faster generation. Among discrete diffusion approaches, Masked diffusion currently dominates, largely driven by strong perplexity on…

Machine Learning · Computer Science 2026-02-17 Subham Sekhar Sahoo , Jean-Marie Lemercier , Zhihan Yang , Justin Deschenaux , Jingyu Liu , John Thickstun , Ante Jukic

From Scores to Gibbs Correctors: Accelerating Uniform-Rate Discrete Diffusion Models

Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration methods…

Machine Learning · Computer Science 2026-05-27 Yuchen Liang , Ness Shroff , Yingbin Liang

Beyond Masks: Efficient, Flexible Diffusion Language Models via Deletion-Insertion Processes

While Masked Diffusion Language Models (MDLMs) relying on token masking and unmasking have shown promise in language modeling, their computational efficiency and generation flexibility remain constrained by the masking paradigm. In this…

Computation and Language · Computer Science 2026-03-26 Fangyu Ding , Ding Ding , Sijin Chen , Kaibo Wang , Peng Xu , Zijin Feng , Haoli Bai , Kai Han , Youliang Yan , Binhang Yuan , Jiacheng Sun

GDA: Generalized Diffusion for Robust Test-time Adaptation

Machine learning models struggle with generalization when encountering out-of-distribution (OOD) samples with unexpected distribution shifts. For vision tasks, recent studies have shown that test-time adaptation employing diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Yun-Yun Tsai , Fu-Chen Chen , Albert Y. C. Chen , Junfeng Yang , Che-Chun Su , Min Sun , Cheng-Hao Kuo

MADI: Masking-Augmented Diffusion with Inference-Time Scaling for Visual Editing

Despite the remarkable success of diffusion models in text-to-image generation, their effectiveness in grounded visual editing and compositional control remains challenging. Motivated by advances in self-supervised learning and in-context…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Shreya Kadambi , Risheek Garrepalli , Shubhankar Borse , Munawar Hyatt , Fatih Porikli

Interpolating Discrete Diffusion Models with Controllable Resampling

Discrete diffusion models form a powerful class of generative models across diverse domains, including text and graphs. However, existing approaches face fundamental limitations. Masked diffusion models suffer from irreversible errors due…

Machine Learning · Computer Science 2026-04-21 Marcel Kollovieh , Sirine Ayadi , Stephan Günnemann

Likelihood-Based Diffusion Language Models

Despite a growing interest in diffusion-based language models, existing work has not shown that these models can attain nontrivial likelihoods on standard language modeling benchmarks. In this work, we take the first steps towards closing…

Computation and Language · Computer Science 2023-05-31 Ishaan Gulrajani , Tatsunori B. Hashimoto

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

The recent surge of generative AI has been fueled by the generative power of diffusion probabilistic models and the scalable capabilities of large language models. Despite their potential, it remains elusive whether diffusion language…

Computation and Language · Computer Science 2025-02-25 Jiasheng Ye , Zaixiang Zheng , Yu Bao , Lihua Qian , Quanquan Gu

GUD: Generation with Unified Diffusion

Diffusion generative models transform noise into data by inverting a process that progressively adds noise to data samples. Inspired by concepts from the renormalization group in physics, which analyzes systems across different scales, we…

Machine Learning · Computer Science 2024-10-04 Mathis Gerdes , Max Welling , Miranda C. N. Cheng

Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

Diffusion language models, especially masked discrete diffusion models, have achieved great success recently. While there are some theoretical and primary empirical results showing the advantages of latent reasoning with looped transformers…

Artificial Intelligence · Computer Science 2026-05-13 Cai Zhou , Chenxiao Yang , Yi Hu , Chenyu Wang , Chubin Zhang , Muhan Zhang , Lester Mackey , Tommi Jaakkola , Stephen Bates , Dinghuai Zhang