Related papers: Continuous Diffusion Models Can Obey Formal Syntax

Continuous Diffusion Model for Language Modeling

Diffusion models have emerged as a promising alternative to autoregressive models in modeling discrete categorical data. However, diffusion models that directly work on discrete data space fail to fully exploit the power of iterative…

Machine Learning · Computer Science 2025-10-24 Jaehyeong Jo , Sung Ju Hwang

Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation

Recent advances in latent diffusion models have demonstrated state-of-the-art performance in high-dimensional time-series data synthesis while providing flexible control through conditioning and guidance. However, existing methodologies…

Machine Learning · Computer Science 2025-11-11 Matteo Pettenó , Alessandro Ilic Mezza , Alberto Bernardini

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Diffusion language models offer unique benefits over autoregressive models due to their potential for parallelized generation and controllability, yet they lag in likelihood modeling and are limited to fixed-length generation. In this work,…

Machine Learning · Computer Science 2025-05-20 Marianne Arriola , Aaron Gokaslan , Justin T. Chiu , Zhihan Yang , Zhixuan Qi , Jiaqi Han , Subham Sekhar Sahoo , Volodymyr Kuleshov

Constrained Discrete Diffusion

Discrete diffusion models are a class of generative models that construct sequences by progressively denoising samples from a categorical noise distribution. Beyond their rapidly growing ability to generate coherent natural language, these…

Computation and Language · Computer Science 2025-12-11 Michael Cardei , Jacob K Christopher , Thomas Hartvigsen , Bhavya Kailkhura , Ferdinando Fioretto

DINGO: Constrained Inference for Diffusion LLMs

Diffusion LLMs have emerged as a promising alternative to conventional autoregressive LLMs, offering significant potential for improved runtime efficiency. However, existing diffusion models lack the ability to provably enforce…

Machine Learning · Computer Science 2025-05-30 Tarun Suresh , Debangshu Banerjee , Shubham Ugare , Sasa Misailovic , Gagandeep Singh

Latent Diffusion for Language Generation

Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have…

Computation and Language · Computer Science 2023-11-08 Justin Lovelace , Varsha Kishore , Chao Wan , Eliot Shekhtman , Kilian Q. Weinberger

FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version

Self-conditioning has been central to the success of continuous diffusion language models, as it allows models to correct previous errors. Yet its ability degrades precisely in the regime where diffusion is most attractive for deployment:…

Computation and Language · Computer Science 2026-04-08 Dat Nguyen-Cong , Tung Kieu , Hoang Thanh-Tung

Flow Map Language Models: One-step Language Modeling via Continuous Denoising

Language models based on discrete diffusion have attracted widespread interest for their potential to provide faster generation than autoregressive models. Despite their promise, these models typically produce samples whose quality sharply…

Computation and Language · Computer Science 2026-05-21 Chanhyuk Lee , Jaehoon Yoo , Manan Agarwal , Sheel Shah , Jerry Huang , Aditi Raghunathan , Seunghoon Hong , Nicholas M. Boffi , Jinwoo Kim

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Recent works have demonstrated success in controlling sentence attributes ($e.g.$, sentiment) and structure ($e.g.$, syntactic structure) based on the diffusion language model. A key component that drives theimpressive performance for…

Computation and Language · Computer Science 2024-03-26 Shujian Zhang , Lemeng Wu , Chengyue Gong , Xingchao Liu

DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling

Diffusion language models intrinsically fail to capture correlations between decoded tokens, which leads to a harsh trade-off between sampling quality and throughput. To solve this issue, we propose DiLaDiff, a variant of masked diffusion…

Machine Learning · Computer Science 2026-05-25 Jean-Marie Lemercier , Tomas Geffner , Karsten Kreis , Morteza Mardani , Arash Vahdat , Ante Jukić

Constrained Decoding of Diffusion LLMs with Context-Free Grammars

Large language models (LLMs) have shown promising performance across diverse domains. Many practical applications of LLMs, such as code completion and structured data extraction, require adherence to syntactic constraints specified by a…

Machine Learning · Computer Science 2025-08-18 Niels Mündler , Jasper Dekoninck , Martin Vechev

Softly Constrained Denoisers for Diffusion Models

Diffusion models struggle to produce samples that respect constraints, a common requirement in scientific applications. Recent approaches have introduced regularization terms in the loss or guidance methods during sampling to enforce such…

Machine Learning · Computer Science 2026-02-06 Victor M. Yeom-Song , Severi Rissanen , Arno Solin , Samuel Kaski , Mingfei Sun

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

Discrete Diffusion Models for Language Generation

Diffusion models have emerged as a powerful class of generative models, achieving state-of-the-art results in continuous data domains such as image and video generation. Their core mechanism involves a forward diffusion process that…

Computation and Language · Computer Science 2025-07-10 Ashen Weligalle

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Diffusion models have recently been shown to be relevant for high-quality speech generation. Most work has been focused on generating spectrograms, and as such, they further require a subsequent model to convert the spectrogram to a…

Sound · Computer Science 2024-03-12 Roi Benita , Michael Elad , Joseph Keshet

CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation

Although autoregressive models have dominated language modeling in recent years, there has been a growing interest in exploring alternative paradigms to the conventional next-token prediction framework. Diffusion-based language models have…

Computation and Language · Computer Science 2025-10-23 Chihan Huang , Hao Tang

Diffusion Guided Language Modeling

Current language models demonstrate remarkable proficiency in text generation. However, for many applications it is desirable to control attributes, such as sentiment, or toxicity, of the generated language -- ideally tailored towards each…

Computation and Language · Computer Science 2024-08-09 Justin Lovelace , Varsha Kishore , Yiwei Chen , Kilian Q. Weinberger

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion

Denoising Diffusion Probabilistic Models have shown extraordinary ability on various generative tasks. However, their slow inference speed renders them impractical in speech synthesis. This paper proposes a linear diffusion model (LinDiff)…

Sound · Computer Science 2023-06-13 Haogeng Liu , Tao Wang , Jie Cao , Ran He , Jianhua Tao

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

Autoregressive (AR) models remain the standard for natural language generation but still suffer from high latency due to strictly sequential decoding. Recent diffusion-inspired approaches, such as LlaDA and Dream, mitigate this by…

Computation and Language · Computer Science 2025-10-16 Qinglin Zhu , Yizhen Yao , Runcong Zhao , Yanzheng Xiang , Amrutha Saseendran , Chen Jin , Philip Teare , Bin Liang , Yulan He , Lin Gui

Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models

Discrete diffusion language models (dLLMs) provide a fast and flexible alternative to autoregressive models (ARMs) via iterative denoising with parallel updates. However, their evaluation is challenging: existing metrics conflate denoiser…

Machine Learning · Computer Science 2026-05-29 Luhan Tang , Longxuan Yu , Shaorong Zhang , Greg Ver Steeg