Related papers: FutureFill: Fast Generation from Convolutional Seq…

Fast Generation for Convolutional Autoregressive Models

Convolutional autoregressive models have recently demonstrated state-of-the-art performance on a number of generation tasks. While fast, parallel training methods have been crucial for their success, generation is typically implemented in a…

Machine Learning · Computer Science 2017-04-21 Prajit Ramachandran , Tom Le Paine , Pooya Khorrami , Mohammad Babaeizadeh , Shiyu Chang , Yang Zhang , Mark A. Hasegawa-Johnson , Roy H. Campbell , Thomas S. Huang

Predicting Through Generation: Why Generation Is Better for Prediction

This paper argues that generating output tokens is more effective than using pooled representations for prediction tasks because token-level generation retains more mutual information. Since LLMs are trained on massive text corpora using…

Computation and Language · Computer Science 2025-05-28 Md Kowsher , Nusrat Jahan Prottasha , Prakash Bhat , Chun-Nam Yu , Mojtaba Soltanalian , Ivan Garibay , Ozlem Garibay , Chen Chen , Niloofar Yousefi

Efficient Generative Modeling with Residual Vector Quantization-Based Tokens

We introduce ResGen, an efficient Residual Vector Quantization (RVQ)-based generative model for high-fidelity generation with fast sampling. RVQ improves data fidelity by increasing the number of quantization steps, referred to as depth,…

Machine Learning · Computer Science 2025-06-03 Jaehyeon Kim , Taehong Moon , Keon Lee , Jaewoong Cho

FastSeq: Make Sequence Generation Faster

Transformer-based models have made tremendous impacts in natural language generation. However the inference speed is a bottleneck due to large model size and intensive computing involved in auto-regressive decoding process. We develop…

Computation and Language · Computer Science 2021-07-14 Yu Yan , Fei Hu , Jiusheng Chen , Nikhil Bhendawade , Ting Ye , Yeyun Gong , Nan Duan , Desheng Cui , Bingyu Chi , Ruofei Zhang

FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling

Long-context modeling is a pivotal capability for Large Language Models, yet the quadratic complexity of attention remains a critical bottleneck, particularly during the compute-intensive prefilling phase. While various sparse attention…

Computation and Language · Computer Science 2026-03-09 Qihang Fan , Huaibo Huang , Zhiying Wu , Juqiu Wang , Bingning Wang , Ran He

Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models

We present a new, fast and flexible pipeline for indoor scene synthesis that is based on deep convolutional generative models. Our method operates on a top-down image-based representation, and inserts objects iteratively into the scene by…

Computer Vision and Pattern Recognition · Computer Science 2018-12-03 Daniel Ritchie , Kai Wang , Yu-an Lin

Unifying Autoregressive and Diffusion-Based Sequence Generation

We present significant extensions to diffusion-based sequence generation models, blurring the line with autoregressive language models. We introduce hyperschedules, which assign distinct noise schedules to individual token positions,…

Machine Learning · Computer Science 2025-10-08 Nima Fathi , Torsten Scholak , Pierre-André Noël

Characterizing and Efficiently Accelerating Multimodal Generation Model Inference

Generative artificial intelligence (AI) technology is revolutionizing the computing industry. Not only its applications have broadened to various sectors but also poses new system design and optimization opportunities. The technology is…

Machine Learning · Computer Science 2025-05-13 Yejin Lee , Anna Sun , Basil Hosmer , Bilge Acun , Can Balioglu , Changhan Wang , Charles David Hernandez , Christian Puhrsch , Daniel Haziza , Driss Guessous , Francisco Massa , Jacob Kahn , Jeffrey Wan , Jeremy Reizenstein , Jiaqi Zhai , Joe Isaacson , Joel Schlosser , Juan Pino , Kaushik Ram Sadagopan , Leonid Shamis , Linjian Ma , Min-Jae Hwang , Mingda Chen , Mostafa Elhoushi , Pedro Rodriguez , Ram Pasunuru , Scott Yih , Sravya Popuri , Xing Liu , Carole-Jean Wu

PixelSNAIL: An Improved Autoregressive Generative Model

Autoregressive generative models consistently achieve the best results in density estimation tasks involving high dimensional data, such as images or audio. They pose density estimation as a sequence modeling task, where a recurrent neural…

Machine Learning · Computer Science 2017-12-29 Xi Chen , Nikhil Mishra , Mostafa Rohaninejad , Pieter Abbeel

Recursive Flow Matching

Generative models have emerged as a powerful paradigm for solving physics systems and modeling complex spatiotemporal dynamics. However, achieving high physical accuracy without incurring high computational cost remains a fundamental…

Machine Learning · Computer Science 2026-05-27 Jiahe Huang , Sihan Xu , Sharvaree Vadgama , Rose Yu

DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction

This paper presents DetailFlow, a coarse-to-fine 1D autoregressive (AR) image generation method that models images through a novel next-detail prediction strategy. By learning a resolution-aware token sequence supervised with progressively…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Yiheng Liu , Liao Qu , Huichao Zhang , Xu Wang , Yi Jiang , Yiming Gao , Hu Ye , Xian Li , Shuai Wang , Daniel K. Du , Fangmin Chen , Zehuan Yuan , Xinglong Wu

Future Sight: Dynamic Story Generation with Large Pretrained Language Models

Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new…

Computation and Language · Computer Science 2022-12-21 Brian D. Zimmerman , Gaurav Sahu , Olga Vechtomova

FastMesh: Efficient Artistic Mesh Generation via Component Decoupling

Recent mesh generation approaches typically tokenize triangle meshes into sequences of tokens and train autoregressive models to generate these tokens sequentially. Despite substantial progress, such token sequences inevitably reuse…

Computer Vision and Pattern Recognition · Computer Science 2026-01-16 Jeonghwan Kim , Yushi Lan , Armando Fortes , Yongwei Chen , Xingang Pan

XSpecMesh: Quality-Preserving Auto-Regressive Mesh Generation Acceleration via Multi-Head Speculative Decoding

Current auto-regressive models can generate high-quality, topologically precise meshes; however, they necessitate thousands-or even tens of thousands-of next-token predictions during inference, resulting in substantial latency. We introduce…

Graphics · Computer Science 2025-08-07 Dian Chen , Yansong Qu , Xinyang Li , Ming Li , Shengchuan Zhang

Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations

Learning representations that accurately capture long-range dependencies in sequential inputs -- including text, audio, and genomic data -- is a key problem in deep learning. Feed-forward convolutional models capture only feature…

Machine Learning · Computer Science 2021-04-23 Sawyer Birnbaum , Volodymyr Kuleshov , Zayd Enam , Pang Wei Koh , Stefano Ermon

FuncGenFoil: Airfoil Generation and Editing Model in Function Space

Aircraft manufacturing is the jewel in the crown of industry, in which generating high-fidelity airfoil geometries with controllable and editable representations remains a fundamental challenge. Existing deep learning methods, which…

Machine Learning · Computer Science 2025-12-15 Jinouwen Zhang , Junjie Ren , Qianhong Ma , Jianyu Wu , Aobo Yang , Yan Lu , Lu Chen , Hairun Xie , Jing Wang , Miao Zhang , Wanli Ouyang , Shixiang Tang

TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction

We present TempoMaster, a novel framework that formulates long video generation as next-frame-rate prediction. Specifically, we first generate a low-frame-rate clip that serves as a coarse blueprint of the entire video sequence, and then…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Yukuo Ma , Cong Liu , Junke Wang , Junqi Liu , Haibin Huang , Zuxuan Wu , Chi Zhang , Xuelong Li

Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling

We propose a generic confidence-based approximation that can be plugged in and simplify the auto-regressive generation process with a proved convergence. We first assume that the priors of future samples can be generated in an independently…

Machine Learning · Computer Science 2019-10-16 YoungJoon Yoo , Sanghyuk Chun , Sangdoo Yun , Jung-Woo Ha , Jaejun Yoo

BIGFix: Bidirectional Image Generation with Token Fixing

Recent advances in image and video generation have raised significant interest from both academia and industry. A key challenge in this field is improving inference efficiency, as model size and the number of inference steps directly impact…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Victor Besnier , David Hurych , Andrei Bursuc , Eduardo Valle

Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential

Autoregressive language models are constrained by their inherently sequential nature, generating one token at a time. This paradigm limits inference speed and parallelism, especially during later stages of generation when the direction and…

Computation and Language · Computer Science 2025-07-17 Mohammad Samragh , Arnav Kundu , David Harrison , Kumari Nishu , Devang Naik , Minsik Cho , Mehrdad Farajtabar