中文
相关论文

相关论文: Consistent Diffusion Language Models

200 篇论文

Diffusion Language Models (DLMs) offer a promising parallel generation paradigm but suffer from slow inference due to numerous refinement steps and the inability to use standard KV caching. We introduce CDLM (Consistency Diffusion Language…

机器学习 · 计算机科学 2026-02-23 Minseo Kim , Chenfeng Xu , Coleman Hooper , Harman Singh , Ben Athiwaratkun , Ce Zhang , Kurt Keutzer , Amir Gholami

Diffusion models (DMs) have become the dominant paradigm of generative modeling in a variety of domains by learning stochastic processes from noise to data. Recently, diffusion denoising bridge models (DDBMs), a new formulation of…

机器学习 · 计算机科学 2024-11-01 Guande He , Kaiwen Zheng , Jianfei Chen , Fan Bao , Jun Zhu

Diffusion Language Models (DLMs) are rapidly emerging as a powerful and promising alternative to the dominant autoregressive (AR) paradigm. By generating tokens in parallel through an iterative denoising process, DLMs possess inherent…

计算与语言 · 计算机科学 2025-12-08 Tianyi Li , Mingda Chen , Bowei Guo , Zhiqiang Shen

Continuous diffusion has been the foundation of high-fidelity, controllable, and few-step generation of many data modalities such as images. However, in language modeling, prior continuous diffusion language models (DLMs) lag behind…

计算与语言 · 计算机科学 2026-04-16 Yuxin Chen , Chumeng Liang , Hangke Sui , Ruihan Guo , Chaoran Cheng , Jiaxuan You , Ge Liu

Diffusion Language Models (DLMs) have recently achieved strong results in text generation. However, their multi-step sampling leads to slow inference, limiting practical use. To address this, we extend Inverse Distillation, a technique…

Autoregressive large language models achieve strong results on many benchmarks, but decoding remains fundamentally latency-limited by sequential dependence on previously generated tokens. Diffusion language models (DLMs) promise parallel…

计算与语言 · 计算机科学 2026-01-06 Yihao Liang , Ze Wang , Hao Chen , Ximeng Sun , Jialian Wu , Xiaodong Yu , Jiang Liu , Emad Barsoum , Zicheng Liu , Niraj K. Jha

In this work, we provide a systematic survey of Discrete Diffusion Language Models (dLLMs) and Discrete Diffusion Multimodal Language Models (dMLLMs). Unlike autoregressive (AR) models, dLLMs and dMLLMs adopt a multi-token, parallel…

机器学习 · 计算机科学 2025-09-22 Runpeng Yu , Qi Li , Xinchao Wang

Language models based on discrete diffusion have attracted widespread interest for their potential to provide faster generation than autoregressive models. Despite their promise, these models typically produce samples whose quality sharply…

Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do…

Discrete diffusion language models (dLLMs) provide a fast and flexible alternative to autoregressive models (ARMs) via iterative denoising with parallel updates. However, their evaluation is challenging: existing metrics conflate denoiser…

机器学习 · 计算机科学 2026-05-29 Luhan Tang , Longxuan Yu , Shaorong Zhang , Greg Ver Steeg

Discrete diffusion models are a class of generative models that construct sequences by progressively denoising samples from a categorical noise distribution. Beyond their rapidly growing ability to generate coherent natural language, these…

计算与语言 · 计算机科学 2025-12-11 Michael Cardei , Jacob K Christopher , Thomas Hartvigsen , Bhavya Kailkhura , Ferdinando Fioretto

Diffusion language models (DLMs) have emerged as a promising alternative to autoregressive models for faster inference via parallel token generation. We provide a rigorous foundation for this advantage by formalizing a model of parallel…

机器学习 · 计算机科学 2026-01-01 Haozhe Jiang , Nika Haghtalab , Lijie Chen

Diffusion Language models (DLMs) are a promising avenue for text generation due to their practical properties on tractable controllable generation. They also have the advantage of not having to predict text autoregressively. However,…

机器学习 · 计算机科学 2024-02-13 Sofia Maria Lo Cicero Vaina , Nikita Balagansky , Daniil Gavrilov

Diffusion Language Models (DLMs) have recently achieved significant success due to their any-order generation capabilities. However, existing inference methods typically rely on local, immediate-step metrics such as confidence or entropy…

计算与语言 · 计算机科学 2025-12-03 Kecheng Chen , Ziru Liu , Xijia Tao , Hui Liu , Xinyu Fu , Suiyun Zhang , Dandan Tu , Lingpeng Kong , Rui Liu , Haoliang Li

While Diffusion Language Models (DLMs) are theoretically well-suited for iterative refinement due to their non-causal structure, they often fail to reliably revise incorrect tokens in practice. The key challenge lies in the model's…

机器学习 · 计算机科学 2026-01-30 Shuibai Zhang , Fred Zhangzhi Peng , Yiheng Zhang , Jin Pan , Grigorios G. Chrysos

Consistency Models (CM) (Song et al., 2023) accelerate score-based diffusion model sampling at the cost of sample quality but lack a natural way to trade-off quality for speed. To address this limitation, we propose Consistency Trajectory…

Diffusion language models have recently emerged as a leading alternative to standard language models, due to their ability for bidirectional attention and parallel text generation. In this work, we explore variants for their use in speech…

计算与语言 · 计算机科学 2026-04-30 Davyd Naveriani , Albert Zeyer , Ralf Schlüter , Hermann Ney

Diffusion language models have emerged as a promising approach for text generation. One would naturally expect this method to be an efficient replacement for autoregressive models since multiple tokens can be sampled in parallel during each…

机器学习 · 计算机科学 2025-06-10 Guhao Feng , Yihan Geng , Jian Guan , Wei Wu , Liwei Wang , Di He

Latent diffusion models offer an attractive alternative to discrete diffusion for non-autoregressive text generation by operating on continuous text representations and denoising entire sequences in parallel. The major challenge in latent…

Denoising diffusion bridge models (DDBMs) are a powerful variant of diffusion models for interpolating between two arbitrary paired distributions given as endpoints. Despite their promising performance in tasks like image translation, DDBMs…

机器学习 · 计算机科学 2025-05-01 Kaiwen Zheng , Guande He , Jianfei Chen , Fan Bao , Jun Zhu
‹ 上一页 1 2 3 10 下一页 ›