English
Related papers

Related papers: Path Planning for Masked Diffusion Model Sampling

200 papers

In recent years, masked diffusion models (MDMs) have emerged as a promising alternative approach for generative modeling over discrete domains. Compared to autoregressive models (ARMs), MDMs trade off complexity at training time with…

Machine Learning · Computer Science 2025-08-21 Jaeyeon Kim , Kulin Shah , Vasilis Kontonis , Sham Kakade , Sitan Chen

Masked diffusion models (MDMs) have recently emerged as a novel framework for language modeling. MDMs generate sentences by iteratively denoising masked sequences, filling in [MASK] tokens step by step. Although MDMs support any-order…

Machine Learning · Computer Science 2026-02-27 Chunsan Hong , Seonho An , Min-Soo Kim , Jong Chul Ye

Masked diffusion models (MDM) are powerful generative models for discrete data that generate samples by progressively unmasking tokens in a sequence. Each token can take one of two states: masked or unmasked. We observe that token sequences…

Machine Learning · Computer Science 2025-10-23 Chen-Hao Chao , Wei-Fang Sun , Hanwen Liang , Chun-Yi Lee , Rahul G. Krishnan

Masked diffusion models (MDMs) generate discrete sequences by iterative denoising under an absorbing masking process. In standard masked diffusion, if a token remains masked after a reverse update, the model discards its clean-state…

Machine Learning · Computer Science 2026-05-01 Michael Cardei , Huu Binh Ta , Ferdinando Fioretto

Masked Diffusion Models (MDMs) offer flexible, non-autoregressive generation, but this freedom introduces a challenge: final output quality is highly sensitive to the decoding order. We are the first to formalize this issue, attributing the…

Computation and Language · Computer Science 2025-12-25 Ziyu Chen , Xinbei Jiang , Peng Sun , Tao Lin

Denoising Diffusion Probabilistic Models (DDPMs) have emerged as a powerful family of generative models that can yield high-fidelity samples and competitive log-likelihoods across a range of domains, including image and speech synthesis.…

Machine Learning · Computer Science 2021-06-08 Daniel Watson , Jonathan Ho , Mohammad Norouzi , William Chan

Diffusion language models have emerged as a powerful alternative to autoregressive models, enabling fast inference through more flexible and parallel generation paths. This flexibility of sampling is unlocked by new engineered sampling…

Masked Diffusion Models (MDMs) have emerged as one of the most promising paradigms for generative modeling over discrete domains. It is known that MDMs effectively train to decode tokens in a random order, and that this ordering has…

Machine Learning · Computer Science 2025-11-25 Prateek Garg , Bhavya Kohli , Sunita Sarawagi

Masked Diffusion Models (MDMs) offer a promising alternative to autoregressive language models by enabling parallel token generation and bidirectional context modeling. However, their inference speed is significantly limited by the…

Machine Learning · Computer Science 2026-04-08 Satyam Goyal , Kushal Patel , Tanush Mittal , Arjun Laxman

Diffusion language models, as a promising alternative to traditional autoregressive (AR) models, enable faster generation and richer conditioning on bidirectional context. However, they suffer from a key discrepancy between training and…

Machine Learning · Computer Science 2025-09-26 Haoyu He , Katrin Renz , Yong Cao , Andreas Geiger

Recently proposed generative models for discrete data, such as Masked Diffusion Models (MDMs), exploit conditional independence approximations to reduce the computational cost of popular Auto-Regressive Models (ARMs), at the price of some…

Machine Learning · Statistics 2025-12-18 Hugo Lavenant , Giacomo Zanella

Masked discrete diffusion models (MDMs) are a promising new approach to generative modelling, offering the ability for parallel token generation and therefore greater efficiency than autoregressive counterparts. However, achieving an…

Machine Learning · Computer Science 2026-03-02 David Fox , Sam Bowyer , Song Liu , Laurence Aitchison , Raul Santos-Rodriguez , Mengyue Yang

Masked diffusion models (MDMs) have recently emerged as a promising alternative to autoregressive models over discrete domains. MDMs generate sequences in an any-order, parallel fashion, enabling fast inference and strong performance on…

Machine Learning · Computer Science 2025-09-09 Jaeyeon Kim , Lee Cheuk-Kit , Carles Domingo-Enrich , Yilun Du , Sham Kakade , Timothy Ngotiaoco , Sitan Chen , Michael Albergo

Masked Diffusion Models (MDMs) enable flexible decoding orders, yet existing samplers remain largely greedy, selecting locally certain tokens without accounting for their downstream effects. We show that this myopia can increase cumulative…

Computation and Language · Computer Science 2026-05-25 Kaisen Yang , Jayden Teoh , Kaicheng Yang , Yitong Zhang , Alex Lamb

A major bottleneck of standard auto-regressive large language models is that their inference process is inherently sequential, resulting in very long and costly inference times. To circumvent this, practitioners proposed a class of language…

Machine Learning · Computer Science 2025-11-11 Sitan Chen , Kevin Cong , Jerry Li

Masked diffusion models (MDMs) have emerged as a promising alternative to autoregressive models, enabling parallel token generation while achieving competitive performance. Despite these advantages, MDMs face a fundamental limitation: once…

Machine Learning · Computer Science 2026-03-06 Yair Schiff , Omer Belhasin , Roy Uziel , Guanghan Wang , Marianne Arriola , Gilad Turok , Michael Elad , Volodymyr Kuleshov

Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that…

Machine Learning · Computer Science 2025-04-11 Sulin Liu , Juno Nam , Andrew Campbell , Hannes Stärk , Yilun Xu , Tommi Jaakkola , Rafael Gómez-Bombarelli

Masked diffusion models (MDMs) offer a compelling alternative to autoregressive models (ARMs) for discrete text generation because they enable parallel token sampling, rather than sequential, left-to-right generation. This means potentially…

Machine Learning · Computer Science 2025-10-28 Iskander Azangulov , Teodora Pandeva , Niranjani Prasad , Javier Zazo , Sushrut Karmalkar

While diffusion language models (DLMs) have achieved competitive performance in text generation, improving their reasoning ability with reinforcement learning remains an active research area. Here, we introduce d2, a reasoning framework…

Machine Learning · Computer Science 2026-02-10 Guanghan Wang , Gilad Turok , Yair Schiff , Marianne Arriola , Volodymyr Kuleshov

Recently, Masked Diffusion Models (MDMs) have shown promising potential across vision, language, and cross-modal generation. However, a notable discrepancy exists between their training and inference procedures. In particular, MDM inference…

Machine Learning · Computer Science 2025-12-30 Renping Zhou , Zanlin Ni , Tianyi Chen , Zeyu Liu , Yang Yue , Yulin Wang , Yuxuan Wang , Jingshu Liu , Gao Huang
‹ Prev 1 2 3 10 Next ›