English
Related papers

Related papers: FlowSeq: Non-Autoregressive Conditional Sequence G…

200 papers

Reaction prediction is a fundamental problem in computational chemistry. Existing approaches typically generate a chemical reaction by sampling tokens or graph edits sequentially, conditioning on previously generated outputs. These…

Chemical Physics · Physics 2021-02-08 Hangrui Bi , Hengyi Wang , Chence Shi , Jian Tang

Stochastic processes generated by non-stationary distributions are difficult to represent with conventional models such as Gaussian processes. This work presents Recurrent Autoregressive Flows as a method toward general stochastic process…

Machine Learning · Computer Science 2020-06-20 John Mern , Peter Morales , Mykel J. Kochenderfer

Normalizing flows are a powerful class of generative models for continuous random variables, showing both strong model flexibility and the potential for non-autoregressive generation. These benefits are also desired when modeling discrete…

Machine Learning · Statistics 2019-06-06 Zachary M. Ziegler , Alexander M. Rush

Transformer-based models have made tremendous impacts in natural language generation. However the inference speed is a bottleneck due to large model size and intensive computing involved in auto-regressive decoding process. We develop…

Computation and Language · Computer Science 2021-07-14 Yu Yan , Fei Hu , Jiusheng Chen , Nikhil Bhendawade , Ting Ye , Yeyun Gong , Nan Duan , Desheng Cui , Bingyu Chi , Ruofei Zhang

Autoregressive and diffusion models drive the recent breakthroughs on text-to-image generation. Despite their huge success of generating high-realistic images, a common shortcoming of these models is their high inference latency -…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Zhangyin Feng , Runyi Hu , Liangxin Liu , Fan Zhang , Duyu Tang , Yong Dai , Xiaocheng Feng , Jiwei Li , Bing Qin , Shuming Shi

Sequence-to-sequence vision-language models are showing promise, but their applicability is limited by their inference latency due to their autoregressive way of generating predictions. We propose a parallel decoding sequence-to-sequence…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Kunyu Shi , Qi Dong , Luis Goncalves , Zhuowen Tu , Stefano Soatto

Autoregressive models, often built on Transformer architectures, represent a powerful paradigm for generating ultra-long videos by synthesizing content in sequential chunks. However, this sequential generation process is notoriously slow.…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Yuexiao Ma , Xuzhe Zheng , Jing Xu , Xiwei Xu , Feng Ling , Xiawu Zheng , Huafeng Kuang , Huixia Li , Xing Wang , Xuefeng Xiao , Fei Chao , Rongrong Ji

Existing large language models have to run K times to generate a sequence of K tokens. In this paper, we present RecycleGPT, a generative language model with fast decoding speed by recycling pre-generated model states without running the…

Computation and Language · Computer Science 2024-05-24 Yufan Jiang , Qiaozhi He , Xiaomin Zhuang , Zhihua Wu , Kunpeng Wang , Wenlai Zhao , Guangwen Yang

Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently,…

Machine Learning · Computer Science 2020-01-10 Zhiqing Sun , Zhuohan Li , Haoqing Wang , Zi Lin , Di He , Zhi-Hong Deng

Recently, diffusion models have emerged as a new paradigm for generative models. Despite the success in domains using continuous signals such as vision and audio, adapting diffusion models to natural language is under-explored due to the…

Computation and Language · Computer Science 2023-02-15 Shansan Gong , Mukai Li , Jiangtao Feng , Zhiyong Wu , Lingpeng Kong

Developing a generative model of realistic order flow in financial markets is a challenging open problem, with numerous applications for market participants. Addressing this, we propose the first end-to-end autoregressive generative model…

Trading and Market Microstructure · Quantitative Finance 2023-09-06 Peer Nagy , Sascha Frey , Silvia Sapora , Kang Li , Anisoara Calinescu , Stefan Zohren , Jakob Foerster

We present significant extensions to diffusion-based sequence generation models, blurring the line with autoregressive language models. We introduce hyperschedules, which assign distinct noise schedules to individual token positions,…

Machine Learning · Computer Science 2025-10-08 Nima Fathi , Torsten Scholak , Pierre-André Noël

Generative models using neural network have opened a door to large-scale studies for various application domains, especially for studies that suffer from lack of real samples to obtain statistically robust inference. Typically, these…

Computer Vision and Pattern Recognition · Computer Science 2018-12-12 Seong Jae Hwang , Zirui Tao , Won Hwa Kim , Vikas Singh

Standard autoregressive seq2seq models are easily trained by max-likelihood, but tend to show poor results under small-data conditions. We introduce a class of seq2seq models, GAMs (Global Autoregressive Models), which combine an…

Machine Learning · Computer Science 2019-09-23 Tetiana Parshakova , Jean-Marc Andreoli , Marc Dymetman

Autoregressive models have emerged as a powerful approach for visual generation but suffer from slow inference speed due to their sequential token-by-token prediction process. In this paper, we propose a simple yet effective approach for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yuqing Wang , Shuhuai Ren , Zhijie Lin , Yujin Han , Haoyuan Guo , Zhenheng Yang , Difan Zou , Jiashi Feng , Xihui Liu

Flow-based generative models are powerful exact likelihood models with efficient sampling and inference. Despite their computational efficiency, flow-based models generally have much worse density modeling performance compared to…

Machine Learning · Computer Science 2019-05-17 Jonathan Ho , Xi Chen , Aravind Srinivas , Yan Duan , Pieter Abbeel

We propose an approach for improving sequence modeling based on autoregressive normalizing flows. Each autoregressive transform, acting across time, serves as a moving frame of reference, removing temporal correlations, and simplifying the…

Machine Learning · Computer Science 2022-03-09 Joseph Marino , Lei Chen , Jiawei He , Stephan Mandt

Autoregressive (AR) models have been the dominating approach to conditional sequence generation, but are suffering from the issue of high inference latency. Non-autoregressive (NAR) models have been recently proposed to reduce the latency…

Machine Learning · Computer Science 2020-07-01 Zhiqing Sun , Yiming Yang

We propose a conditional non-autoregressive neural sequence model based on iterative refinement. The proposed model is designed based on the principles of latent variable models and denoising autoencoders, and is generally applicable to any…

Machine Learning · Computer Science 2018-08-29 Jason Lee , Elman Mansimov , Kyunghyun Cho

Semantic parsing using sequence-to-sequence models allows parsing of deeper representations compared to traditional word tagging based models. In spite of these advantages, widespread adoption of these models for real-time conversational…

Computation and Language · Computer Science 2021-04-13 Arun Babu , Akshat Shrivastava , Armen Aghajanyan , Ahmed Aly , Angela Fan , Marjan Ghazvininejad
‹ Prev 1 2 3 10 Next ›