Related papers: Autoregressive Image Generation with Randomized Pa…

Parallelized Autoregressive Visual Generation

Autoregressive models have emerged as a powerful approach for visual generation but suffer from slow inference speed due to their sequential token-by-token prediction process. In this paper, we propose a simple yet effective approach for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yuqing Wang , Shuhuai Ren , Zhijie Lin , Yujin Han , Haoyuan Guo , Zhenheng Yang , Difan Zou , Jiashi Feng , Xihui Liu

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

We present Locality-aware Parallel Decoding (LPD) to accelerate autoregressive image generation. Traditional autoregressive image generation relies on next-patch prediction, a memory-bound process that leads to high latency. Existing works…

Computer Vision and Pattern Recognition · Computer Science 2026-03-12 Zhuoyang Zhang , Luke J. Huang , Chengyue Wu , Shang Yang , Kelly Peng , Yao Lu , Song Han

From Sequential to Spatial: Reordering Autoregression for Efficient Visual Generation

Inspired by the remarkable success of autoregressive models in language modeling, this paradigm has been widely adopted in visual generation. However, the sequential token-by-token decoding mechanism inherent in traditional autoregressive…

Computer Vision and Pattern Recognition · Computer Science 2026-01-01 Siyang Wang , Hanting Li , Wei Li , Jie Hu , Xinghao Chen , Feng Zhao

Next Patch Prediction for Autoregressive Visual Generation

Autoregressive models, built based on the Next Token Prediction (NTP) paradigm, show great potential in developing a unified framework that integrates both language and vision tasks. Pioneering works introduce NTP to autoregressive visual…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Yatian Pang , Peng Jin , Shuo Yang , Bin Lin , Bin Zhu , Zhenyu Tang , Liuhan Chen , Francis E. H. Tay , Ser-Nam Lim , Harry Yang , Li Yuan

Randomized Autoregressive Visual Generation

This paper presents Randomized AutoRegressive modeling (RAR) for visual generation, which sets a new state-of-the-art performance on the image generation task while maintaining full compatibility with language modeling frameworks. The…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Qihang Yu , Ju He , Xueqing Deng , Xiaohui Shen , Liang-Chieh Chen

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

We introduce Autoregressive Retrieval Augmentation (AR-RAG), a novel paradigm that enhances image generation by autoregressively incorporating knearest neighbor retrievals at the patch level. Unlike prior methods that perform a single,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Jingyuan Qi , Zhiyang Xu , Qifan Wang , Lifu Huang

Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy

In this work, we first revisit the sampling issues in current autoregressive (AR) image generation models and identify that image tokens, unlike text tokens, exhibit lower information density and non-uniform spatial distribution.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Xiaoxiao Ma , Feng Zhao , Pengyang Ling , Haibo Qiu , Zhixiang Wei , Hu Yu , Jie Huang , Zhixiong Zeng , Lin Ma

Learning to Expand Images for Efficient Visual Autoregressive Modeling

Autoregressive models have recently shown great promise in visual generation by leveraging discrete token sequences akin to language modeling. However, existing approaches often suffer from inefficiency, either due to token-by-token…

Computer Vision and Pattern Recognition · Computer Science 2025-11-20 Ruiqing Yang , Kaixin Zhang , Zheng Zhang , Shan You , Tao Huang

Neighboring Autoregressive Modeling for Efficient Visual Generation

Visual autoregressive models typically adhere to a raster-order ``next-token prediction" paradigm, which overlooks the spatial and temporal locality inherent in visual content. Specifically, visual tokens exhibit significantly stronger…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Yefei He , Yuanyu He , Shaoxuan He , Feng Chen , Hong Zhou , Kaipeng Zhang , Bohan Zhuang

Fast Autoregressive Video Generation with Diagonal Decoding

Autoregressive Transformer models have demonstrated impressive performance in video generation, but their sequential token-by-token decoding process poses a major bottleneck, particularly for long videos represented by tens of thousands of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Yang Ye , Junliang Guo , Haoyu Wu , Tianyu He , Tim Pearce , Tabish Rashid , Katja Hofmann , Jiang Bian

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

We propose a novel AutoRegressive Generation-based paradigm for image Segmentation (ARGenSeg), achieving multimodal understanding and pixel-level perception within a unified framework. Prior works integrating image segmentation into…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Xiaolong Wang , Lixiang Ru , Ziyuan Huang , Kaixiang Ji , Dandan Zheng , Jingdong Chen , Jun Zhou

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Prevailing autoregressive (AR) models for text-to-image generation either rely on heavy, computationally-intensive diffusion models to process continuous image tokens, or employ vector quantization (VQ) to obtain discrete tokens with…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 NextStep Team , Chunrui Han , Guopeng Li , Jingwei Wu , Quan Sun , Yan Cai , Yuang Peng , Zheng Ge , Deyu Zhou , Haomiao Tang , Hongyu Zhou , Kenkun Liu , Ailin Huang , Bin Wang , Changxin Miao , Deshan Sun , En Yu , Fukun Yin , Gang Yu , Hao Nie , Haoran Lv , Hanpeng Hu , Jia Wang , Jian Zhou , Jianjian Sun , Kaijun Tan , Kang An , Kangheng Lin , Liang Zhao , Mei Chen , Peng Xing , Rui Wang , Shiyu Liu , Shutao Xia , Tianhao You , Wei Ji , Xianfang Zeng , Xin Han , Xuelin Zhang , Yana Wei , Yanming Xu , Yimin Jiang , Yingming Wang , Yu Zhou , Yucheng Han , Ziyang Meng , Binxing Jiao , Daxin Jiang , Xiangyu Zhang , Yibo Zhu

Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation

Autoregressive (AR) models, the theoretical performance benchmark for learned lossless image compression, are often dismissed as impractical due to prohibitive computational cost. This work re-thinks this paradigm, introducing a framework…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Daxin Li , Yuanchao Bai , Kai Wang , Wenbo Zhao , Junjun Jiang , Xianming Liu

Frequency Autoregressive Image Generation with Continuous Tokens

Autoregressive (AR) models for image generation typically adopt a two-stage paradigm of vector quantization and raster-scan ``next-token prediction", inspired by its great success in language modeling. However, due to the huge modality gap,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Hu Yu , Hao Luo , Hangjie Yuan , Yu Rong , Jie Huang , Feng Zhao

RandAR: Decoder-only Autoregressive Visual Generation in Random Orders

We introduce RandAR, a decoder-only visual autoregressive (AR) model capable of generating images in arbitrary token orders. Unlike previous decoder-only AR models that rely on a predefined generation order, RandAR removes this inductive…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Ziqi Pang , Tianyuan Zhang , Fujun Luan , Yunze Man , Hao Tan , Kai Zhang , William T. Freeman , Yu-Xiong Wang

ZipAR: Parallel Auto-regressive Image Generation through Spatial Locality

In this paper, we propose ZipAR, a training-free, plug-and-play parallel decoding framework for accelerating auto-regressive (AR) visual generation. The motivation stems from the observation that images exhibit local structures, and…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Yefei He , Feng Chen , Yuanyu He , Shaoxuan He , Hong Zhou , Kaipeng Zhang , Bohan Zhuang

Direction-Aware Diagonal Autoregressive Image Generation

The raster-ordered image token sequence exhibits a significant Euclidean distance between index-adjacent tokens at line breaks, making it unsuitable for autoregressive generation. To address this issue, this paper proposes Direction-Aware…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Yijia Xu , Jianzhong Ju , Jian Luan , Jinshi Cui

VPG: Visual Prefix Guidance for Autoregressive Image and Video Generation

Autoregressive image and video generators are trained with teacher-forced histories but must sample from their own generated prefixes at inference time, making them vulnerable to exposure bias and prefix drift. Existing remedies either…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Xinyao Liao , Qiyuan He , Yicong Li , Jiayin Zhu , Xiaoye Qu , Wei Wei , Angela Yao

Progressive Checkerboards for Autoregressive Multiscale Image Generation

A key challenge in autoregressive image generation is to efficiently sample independent locations in parallel, while still modeling mutual dependencies with serial conditioning. Some recent works have addressed this by conditioning between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-26 David Eigen

Visual Autoregressive Modeling for Image Super-Resolution

Image Super-Resolution (ISR) has seen significant progress with the introduction of remarkable generative models. However, challenges such as the trade-off issues between fidelity and realism, as well as computational complexity, have also…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 Yunpeng Qu , Kun Yuan , Jinhua Hao , Kai Zhao , Qizhi Xie , Ming Sun , Chao Zhou