English
Related papers

Related papers: Autoregressive Image Generation without Vector Qua…

200 papers

This paper presents Diffusion via Autoregressive models (D-AR), a new paradigm recasting the image diffusion process as a vanilla autoregressive procedure in the standard next-token-prediction fashion. We start by designing the tokenizer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Ziteng Gao , Mike Zheng Shou

Autoregressive visual generation models typically rely on tokenizers to compress images into tokens that can be predicted sequentially. A fundamental dilemma exists in token representation: discrete tokens enable straightforward modeling…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Yuqing Wang , Zhijie Lin , Yao Teng , Yuanzhi Zhu , Shuhuai Ren , Jiashi Feng , Xihui Liu

In the domain of image generation, latent-based generative models occupy a dominant status; however, these models rely heavily on image tokenizer. To meet modeling requirements, autoregressive models possessing the characteristics of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Panpan Wang , Liqiang Niu , Fandong Meng , Jinan Xu , Yufeng Chen , Jie Zhou

Conventional wisdom suggests that autoregressive models are used to process discrete data. When applied to continuous modalities such as visual data, Visual AutoRegressive modeling (VAR) typically resorts to quantization-based approaches to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Chenze Shao , Fandong Meng , Jie Zhou

Autoregressive models have demonstrated remarkable success in sequential data generation, particularly in NLP, but their extension to continuous-domain image generation presents significant challenges. Recent work, the masked autoregressive…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Tiankai Hang , Jianmin Bao , Fangyun Wei , Dong Chen

Autoregressive (AR) models for image generation typically adopt a two-stage paradigm of vector quantization and raster-scan ``next-token prediction", inspired by its great success in language modeling. However, due to the huge modality gap,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Hu Yu , Hao Luo , Hangjie Yuan , Yu Rong , Jie Huang , Feng Zhao

While inference-time scaling through search has revolutionized Large Language Models, translating these gains to image generation has proven difficult. Recent attempts to apply search strategies to continuous diffusion models show limited…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Erik Riise , Mehmet Onurcan Kaya , Dim P. Papadopoulos

Autoregressive and diffusion models drive the recent breakthroughs on text-to-image generation. Despite their huge success of generating high-realistic images, a common shortcoming of these models is their high inference latency -…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Zhangyin Feng , Runyi Hu , Liangxin Liu , Fan Zhang , Duyu Tang , Yong Dai , Xiaocheng Feng , Jiwei Li , Bing Qin , Shuming Shi

Autoregressive (AR) image generators offer a language-model-friendly approach to image generation by predicting discrete image tokens in a causal sequence. However, unlike diffusion models, AR models lack a mechanism to refine previous…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Cheng Cheng , Lin Song , Di An , Yicheng Xiao , Xuchong Zhang , Hongbin Sun , Ying Shan

Autoregressive models have emerged as a powerful approach for visual generation but suffer from slow inference speed due to their sequential token-by-token prediction process. In this paper, we propose a simple yet effective approach for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yuqing Wang , Shuhuai Ren , Zhijie Lin , Yujin Han , Haoyuan Guo , Zhenheng Yang , Difan Zou , Jiashi Feng , Xihui Liu

Class-conditional generative models have emerged as accurate and robust classifiers, with diffusion models demonstrating clear advantages over other visual generative paradigms, including autoregressive (AR) models. In this work, we revisit…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Ilia Sudakov , Artem Babenko , Dmitry Baranchuk

This paper presents a novel approach that enables autoregressive video generation with high efficiency. We propose to reformulate the video generation problem as a non-quantized autoregressive modeling of temporal frame-by-frame prediction…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Haoge Deng , Ting Pan , Haiwen Diao , Zhengxiong Luo , Yufeng Cui , Huchuan Lu , Shiguang Shan , Yonggang Qi , Xinlong Wang

In this work, we first revisit the sampling issues in current autoregressive (AR) image generation models and identify that image tokens, unlike text tokens, exhibit lower information density and non-uniform spatial distribution.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Xiaoxiao Ma , Feng Zhao , Pengyang Ling , Haibo Qiu , Zhixiang Wei , Hu Yu , Jie Huang , Zhixiong Zeng , Lin Ma

The development of diffusion-based generative models over the past decade has largely proceeded independently of progress in representation learning. These diffusion models typically rely on regression-based objectives and generally lack…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Runqian Wang , Kaiming He

Autoregressive models excel in efficiency and plug directly into the transformer ecosystem, delivering robust generalization, predictable scalability, and seamless workflows such as fine-tuning and parallelized training. However, they…

Machine Learning · Computer Science 2025-06-13 Samuel Belkadi , Steve Hong , Marian Chen , Miruna Cretu , Charles Harris , Pietro Lio

Prevailing autoregressive (AR) models for text-to-image generation either rely on heavy, computationally-intensive diffusion models to process continuous image tokens, or employ vector quantization (VQ) to obtain discrete tokens with…

Recent advances in autoregressive (AR) models with continuous tokens for image generation show promising results by eliminating the need for discrete tokenization. However, these models face efficiency challenges due to their sequential…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Zhihang Yuan , Yuzhang Shang , Hanling Zhang , Tongcheng Fang , Rui Xie , Bingxin Xu , Yan Yan , Shengen Yan , Guohao Dai , Yu Wang

Symbolic regression refers to the task of finding a closed-form mathematical expression to fit a set of data points. Genetic programming based techniques are the most common algorithms used to tackle this problem, but recently,…

Machine Learning · Computer Science 2025-10-10 Ryan T. Tymkow , Benjamin D. Schnapp , Mojtaba Valipour , Ali Ghodshi

Diffusion models have gained significant attention in the realm of image generation due to their exceptional performance. Their success has been recently expanded to text generation via generating all tokens within a sequence concurrently.…

Computation and Language · Computer Science 2023-12-14 Tong Wu , Zhihao Fan , Xiao Liu , Yeyun Gong , Yelong Shen , Jian Jiao , Hai-Tao Zheng , Juntao Li , Zhongyu Wei , Jian Guo , Nan Duan , Weizhu Chen

The integration of Vector Quantised Variational AutoEncoder (VQ-VAE) with autoregressive models as generation part has yielded high-quality results on image generation. However, the autoregressive models will strictly follow the progressive…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Minghui Hu , Yujie Wang , Tat-Jen Cham , Jianfei Yang , P. N. Suganthan
‹ Prev 1 2 3 10 Next ›