Related papers: Autoregressive Image Generation without Vector Qua…

D-AR: Diffusion via Autoregressive Models

This paper presents Diffusion via Autoregressive models (D-AR), a new paradigm recasting the image diffusion process as a vanilla autoregressive procedure in the standard next-token-prediction fashion. We start by designing the tokenizer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Ziteng Gao , Mike Zheng Shou

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Autoregressive visual generation models typically rely on tokenizers to compress images into tokens that can be predicted sequentially. A fundamental dilemma exists in token representation: discrete tokens enable straightforward modeling…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Yuqing Wang , Zhijie Lin , Yao Teng , Yuanzhi Zhu , Shuhuai Ren , Jiashi Feng , Xihui Liu

D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens

In the domain of image generation, latent-based generative models occupy a dominant status; however, these models rely heavily on image tokenizer. To meet modeling requirements, autoregressive models possessing the characteristics of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Panpan Wang , Liqiang Niu , Fandong Meng , Jinan Xu , Yufeng Chen , Jie Zhou

Continuous Visual Autoregressive Generation via Score Maximization

Conventional wisdom suggests that autoregressive models are used to process discrete data. When applied to continuous modalities such as visual data, Visual AutoRegressive modeling (VAR) typically resorts to quantization-based approaches to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Chenze Shao , Fandong Meng , Jie Zhou

Fast Autoregressive Models for Continuous Latent Generation

Autoregressive models have demonstrated remarkable success in sequential data generation, particularly in NLP, but their extension to continuous-domain image generation presents significant challenges. Recent work, the masked autoregressive…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Tiankai Hang , Jianmin Bao , Fangyun Wei , Dong Chen

Frequency Autoregressive Image Generation with Continuous Tokens

Autoregressive (AR) models for image generation typically adopt a two-stage paradigm of vector quantization and raster-scan ``next-token prediction", inspired by its great success in language modeling. However, due to the huge modality gap,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Hu Yu , Hao Luo , Hangjie Yuan , Yu Rong , Jie Huang , Feng Zhao

Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling

While inference-time scaling through search has revolutionized Large Language Models, translating these gains to image generation has proven difficult. Recent attempts to apply search strategies to continuous diffusion models show limited…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Erik Riise , Mehmet Onurcan Kaya , Dim P. Papadopoulos

Emage: Non-Autoregressive Text-to-Image Generation

Autoregressive and diffusion models drive the recent breakthroughs on text-to-image generation. Despite their huge success of generating high-realistic images, a common shortcoming of these models is their high inference latency -…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Zhangyin Feng , Runyi Hu , Liangxin Liu , Fan Zhang , Duyu Tang , Yong Dai , Xiaocheng Feng , Jiwei Li , Bing Qin , Shuming Shi

From Prediction to Perfection: Introducing Refinement to Autoregressive Image Generation

Autoregressive (AR) image generators offer a language-model-friendly approach to image generation by predicting discrete image tokens in a causal sequence. However, unlike diffusion models, AR models lack a mechanism to refine previous…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Cheng Cheng , Lin Song , Di An , Yicheng Xiao , Xuchong Zhang , Hongbin Sun , Ying Shan

Parallelized Autoregressive Visual Generation

Autoregressive models have emerged as a powerful approach for visual generation but suffer from slow inference speed due to their sequential token-by-token prediction process. In this paper, we propose a simple yet effective approach for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yuqing Wang , Shuhuai Ren , Zhijie Lin , Yujin Han , Haoyuan Guo , Zhenheng Yang , Difan Zou , Jiashi Feng , Xihui Liu

Revisiting Autoregressive Models for Generative Image Classification

Class-conditional generative models have emerged as accurate and robust classifiers, with diffusion models demonstrating clear advantages over other visual generative paradigms, including autoregressive (AR) models. In this work, we revisit…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Ilia Sudakov , Artem Babenko , Dmitry Baranchuk

Autoregressive Video Generation without Vector Quantization

This paper presents a novel approach that enables autoregressive video generation with high efficiency. We propose to reformulate the video generation problem as a non-quantized autoregressive modeling of temporal frame-by-frame prediction…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Haoge Deng , Ting Pan , Haiwen Diao , Zhengxiong Luo , Yufeng Cui , Huchuan Lu , Shiguang Shan , Yonggang Qi , Xinlong Wang

Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy

In this work, we first revisit the sampling issues in current autoregressive (AR) image generation models and identify that image tokens, unlike text tokens, exhibit lower information density and non-uniform spatial distribution.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Xiaoxiao Ma , Feng Zhao , Pengyang Ling , Haibo Qiu , Zhixiang Wei , Hu Yu , Jie Huang , Zhixiong Zeng , Lin Ma

Diffuse and Disperse: Image Generation with Representation Regularization

The development of diffusion-based generative models over the past decade has largely proceeded independently of progress in representation learning. These diffusion models typically rely on regression-based objectives and generally lack…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Runqian Wang , Kaiming He

Diffusion-Free Graph Generation with Next-Scale Prediction

Autoregressive models excel in efficiency and plug directly into the transformer ecosystem, delivering robust generalization, predictable scalability, and seamless workflows such as fine-tuning and parallelized training. However, they…

Machine Learning · Computer Science 2025-06-13 Samuel Belkadi , Steve Hong , Marian Chen , Miruna Cretu , Charles Harris , Pietro Lio

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Prevailing autoregressive (AR) models for text-to-image generation either rely on heavy, computationally-intensive diffusion models to process continuous image tokens, or employ vector quantization (VQ) to obtain discrete tokens with…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 NextStep Team , Chunrui Han , Guopeng Li , Jingwei Wu , Quan Sun , Yan Cai , Yuang Peng , Zheng Ge , Deyu Zhou , Haomiao Tang , Hongyu Zhou , Kenkun Liu , Ailin Huang , Bin Wang , Changxin Miao , Deshan Sun , En Yu , Fukun Yin , Gang Yu , Hao Nie , Haoran Lv , Hanpeng Hu , Jia Wang , Jian Zhou , Jianjian Sun , Kaijun Tan , Kang An , Kangheng Lin , Liang Zhao , Mei Chen , Peng Xing , Rui Wang , Shiyu Liu , Shutao Xia , Tianhao You , Wei Ji , Xianfang Zeng , Xin Han , Xuelin Zhang , Yana Wei , Yanming Xu , Yimin Jiang , Yingming Wang , Yu Zhou , Yucheng Han , Ziyang Meng , Binxing Jiao , Daxin Jiang , Xiangyu Zhang , Yibo Zhu

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Recent advances in autoregressive (AR) models with continuous tokens for image generation show promising results by eliminating the need for discrete tokenization. However, these models face efficiency challenges due to their sequential…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Zhihang Yuan , Yuzhang Shang , Hanling Zhang , Tongcheng Fang , Rui Xie , Bingxin Xu , Yan Yan , Shengen Yan , Guohao Dai , Yu Wang

Symbolic-Diffusion: Deep Learning Based Symbolic Regression with D3PM Discrete Token Diffusion

Symbolic regression refers to the task of finding a closed-form mathematical expression to fit a set of data points. Genetic programming based techniques are the most common algorithms used to tackle this problem, but recently,…

Machine Learning · Computer Science 2025-10-10 Ryan T. Tymkow , Benjamin D. Schnapp , Mojtaba Valipour , Ali Ghodshi

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Diffusion models have gained significant attention in the realm of image generation due to their exceptional performance. Their success has been recently expanded to text generation via generating all tokens within a sequence concurrently.…

Computation and Language · Computer Science 2023-12-14 Tong Wu , Zhihao Fan , Xiao Liu , Yeyun Gong , Yelong Shen , Jian Jiao , Hai-Tao Zheng , Juntao Li , Zhongyu Wei , Jian Guo , Nan Duan , Weizhu Chen

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation

The integration of Vector Quantised Variational AutoEncoder (VQ-VAE) with autoregressive models as generation part has yielded high-quality results on image generation. However, the autoregressive models will strictly follow the progressive…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Minghui Hu , Yujie Wang , Tat-Jen Cham , Jianfei Yang , P. N. Suganthan