English
Related papers

Related papers: Autoregressive Diffusion Models

200 papers

Masked diffusion models (MDMs) have emerged as a promising approach for language modeling, yet they face a performance gap compared to autoregressive models (ARMs) and require more training iterations. In this work, we present the…

Machine Learning · Computer Science 2026-01-26 Mahdi Karami , Ali Ghodsi

Autoregressive models (ARMs) and diffusion models (DMs) represent two leading paradigms in generative modeling, each excelling in distinct areas: ARMs in global context modeling and long-sequence generation, and DMs in generating…

Machine Learning · Computer Science 2024-10-08 Hyungjin Chung , Dohun Lee , Jong Chul Ye

Post-training pretrained autoregressive models (ARMs) into masked diffusion models (MDMs) has emerged as a cost-effective way to overcome the limitations of sequential generation. Yet it remains unclear whether post-trained MDMs acquire…

Machine Learning · Computer Science 2026-05-29 Injin Kong , Hyoungjoon Lee , Yohan Jo

Autoregressive language models, despite their impressive capabilities, struggle with complex reasoning and long-term planning tasks. We introduce discrete diffusion models as a novel solution to these challenges. Through the lens of subgoal…

Computation and Language · Computer Science 2025-02-19 Jiacheng Ye , Jiahui Gao , Shansan Gong , Lin Zheng , Xin Jiang , Zhenguo Li , Lingpeng Kong

Despite advances in test-time scaling and diffusion finetuning, guidance for Auto-Regressive Diffusion Models (ARDMs) remains underexplored. We introduce an amortized framework that augments a pretrained ARDM with an offline-trained…

Machine Learning · Computer Science 2026-05-12 Prakhar Srivastava , Farrin Marouf Sofian , Francesco Immorlano , Kushagra Pandey , Stephan Mandt

Diffusion language models have recently emerged as a competitive alternative to autoregressive language models. Beyond next-token generation, they are more efficient and flexible by enabling parallel and any-order token generation. However,…

Machine Learning · Computer Science 2025-11-18 Chenxiao Yang , Cai Zhou , David Wipf , Zhiyuan Li

Diffusion language models enable any-order generation and bidirectional conditioning, offering appealing flexibility for tasks such as infilling, rewriting, and self-correction. However, their formulation-predicting one part of a sequence…

Computation and Language · Computer Science 2026-01-21 Tianqi Du , Lizhe Fang , Weijie Yang , Chenheng Zhang , Zeming Wei , Yifei Wang , Yisen Wang

Autoregressive (AR) models have long dominated the landscape of large language models, driving progress across a wide range of tasks. Recently, diffusion-based language models have emerged as a promising alternative, though their advantages…

Machine Learning · Computer Science 2025-10-28 Mihir Prabhudesai , Mengning Wu , Amir Zadeh , Katerina Fragkiadaki , Deepak Pathak

Autoregressive models (ARMs) are hindered by slow sequential inference. While masked diffusion models (MDMs) offer a parallel alternative, they suffer from critical drawbacks: high computational overhead from precluding Key-Value (KV)…

Computation and Language · Computer Science 2026-03-06 Jia-Nan Li , Jian Guan , Wei Wu , Chongxuan Li

Autoregressive models (ARMs) have become the workhorse for sequence generation tasks, since many problems can be modeled as next-token prediction. While there appears to be a natural ordering for text (i.e., left-to-right), for many data…

Machine Learning · Computer Science 2025-07-15 Zhe Wang , Jiaxin Shi , Nicolas Heess , Arthur Gretton , Michalis K. Titsias

Diffusion models have emerged as a powerful class of generative models, achieving state-of-the-art results in continuous data domains such as image and video generation. Their core mechanism involves a forward diffusion process that…

Computation and Language · Computer Science 2025-07-10 Ashen Weligalle

Autoregressive models (ARMs) currently hold state-of-the-art performance in likelihood-based modeling of image and audio data. Generally, neural network based ARMs are designed to allow fast inference, but sampling from these models is…

Machine Learning · Computer Science 2020-07-09 Auke Wiggers , Emiel Hoogeboom

The task of video generation requires synthesizing visually realistic and temporally coherent video frames. Existing methods primarily use asynchronous auto-regressive models or synchronous diffusion models to address this challenge.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Mingzhen Sun , Weining Wang , Gen Li , Jiawei Liu , Jiahui Sun , Wanquan Feng , Shanshan Lao , SiYu Zhou , Qian He , Jing Liu

In sequence-to-sequence Transformer ASR, autoregressive (AR) models achieve strong accuracy but suffer from slow decoding, while non-autoregressive (NAR) models enable parallel decoding at the cost of degraded performance. We propose a…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-26 Hao Yen , Pin-Jui Ku , Ante Jukić , Sabato Marco Siniscalchi

Diffusion models with transformer architectures have demonstrated promising capabilities in generating high-fidelity images and scalability for high resolution. However, iterative sampling process required for synthesis is very…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Yeongmin Kim , Sotiris Anagnostidis , Yuming Du , Edgar Schönfeld , Jonas Kohler , Markos Georgopoulos , Albert Pumarola , Ali Thabet , Artsiom Sanakoyeu

Time series forecasting (TSF) is essential in various domains, and recent advancements in diffusion-based TSF models have shown considerable promise. However, these models typically adopt traditional diffusion patterns, treating TSF as a…

Machine Learning · Computer Science 2024-12-13 Jiaxin Gao , Qinglong Cao , Yuntian Chen

Diffusion models have shown remarkable performance on many generative tasks. Despite recent success, most diffusion models are restricted in that they only allow linear transformation of the data distribution. In contrast, broader family of…

Machine Learning · Computer Science 2024-06-04 Grigory Bartosh , Dmitry Vetrov , Christian A. Naesseth

Deep generative models have emerged as promising tools for detecting arbitrary anomalies in data, dispensing with the necessity for manual labelling. Recently, autoregressive transformers have achieved state-of-the-art performance for…

Masked Diffusion Models (MDMs) have emerged as one of the most promising paradigms for generative modeling over discrete domains. It is known that MDMs effectively train to decode tokens in a random order, and that this ordering has…

Machine Learning · Computer Science 2025-11-25 Prateek Garg , Bhavya Kohli , Sunita Sarawagi

Diffusion-based graph generative models have recently obtained promising results for graph generation. However, existing diffusion-based graph generative models are mostly one-shot generative models that apply Gaussian diffusion in the…

Artificial Intelligence · Computer Science 2023-07-19 Lingkai Kong , Jiaming Cui , Haotian Sun , Yuchen Zhuang , B. Aditya Prakash , Chao Zhang
‹ Prev 1 2 3 10 Next ›