Related papers: Conditional Autoregressors are Interpretable Class…

Hierarchical Autoregressive Image Models with Auxiliary Decoders

Autoregressive generative models of images tend to be biased towards capturing local structure, and as a result they often produce samples which are lacking in terms of large-scale coherence. To address this, we propose two methods to learn…

Computer Vision and Pattern Recognition · Computer Science 2019-10-09 Jeffrey De Fauw , Sander Dieleman , Karen Simonyan

Autoregressive Score Matching

Autoregressive models use chain rule to define a joint probability distribution as a product of conditionals. These conditionals need to be normalized, imposing constraints on the functional families that can be used. To increase…

Machine Learning · Computer Science 2020-10-27 Chenlin Meng , Lantao Yu , Yang Song , Jiaming Song , Stefano Ermon

Hidden Biases in Conditioning Autoregressive Models

Large language and music models are increasingly used for constrained generation: rhyming lines, fixed meter, inpainting or infilling, positional endings, and other global form requirements. These systems often perform strikingly well, but…

Artificial Intelligence · Computer Science 2026-04-10 Francois Pachet , Pierre Roy

Autoregressive Conditional Neural Processes

Conditional neural processes (CNPs; Garnelo et al., 2018a) are attractive meta-learning models which produce well-calibrated predictions and are trainable via a simple maximum likelihood procedure. Although CNPs have many advantages, they…

Machine Learning · Statistics 2023-03-28 Wessel P. Bruinsma , Stratis Markou , James Requiema , Andrew Y. K. Foong , Tom R. Andersson , Anna Vaughan , Anthony Buonomo , J. Scott Hosking , Richard E. Turner

Revisiting Autoregressive Models for Generative Image Classification

Class-conditional generative models have emerged as accurate and robust classifiers, with diffusion models demonstrating clear advantages over other visual generative paradigms, including autoregressive (AR) models. In this work, we revisit…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Ilia Sudakov , Artem Babenko , Dmitry Baranchuk

Teaching Categories to Human Learners with Visual Explanations

We study the problem of computer-assisted teaching with explanations. Conventional approaches for machine teaching typically only provide feedback at the instance level e.g., the category or label of the instance. However, it is intuitive…

Computer Vision and Pattern Recognition · Computer Science 2018-02-21 Oisin Mac Aodha , Shihan Su , Yuxin Chen , Pietro Perona , Yisong Yue

Context-Aware Autoregressive Models for Multi-Conditional Image Generation

Autoregressive transformers have recently shown impressive image generation quality and efficiency on par with state-of-the-art diffusion models. Unlike diffusion architectures, autoregressive models can naturally incorporate arbitrary…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixiao Chen , Zhiyuan Ma , Guoli Jia , Che Jiang , Jianjun Li , Bowen Zhou

Surprisal-Triggered Conditional Computation with Neural Networks

Autoregressive neural network models have been used successfully for sequence generation, feature extraction, and hypothesis scoring. This paper presents yet another use for these models: allocating more computation to more difficult…

Machine Learning · Computer Science 2020-06-03 Loren Lugosch , Derek Nowrouzezahrai , Brett H. Meyer

Semantic Context Matters: Improving Conditioning for Autoregressive Models

Recently, autoregressive (AR) models have shown strong potential in image generation, offering better scalability and easier integration with unified multi-modal systems compared to diffusion-based methods. However, extending AR models to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Dongyang Jin , Ryan Xu , Jianhao Zeng , Rui Lan , Yancheng Bai , Lei Sun , Xiangxiang Chu

Non-Autoregressive Translation by Learning Target Categorical Codes

Non-autoregressive Transformer is a promising text generation model. However, current non-autoregressive models still fall behind their autoregressive counterparts in translation quality. We attribute this accuracy gap to the lack of…

Computation and Language · Computer Science 2021-03-23 Yu Bao , Shujian Huang , Tong Xiao , Dongqi Wang , Xinyu Dai , Jiajun Chen

Learning to Learn in Interactive Constraint Acquisition

Constraint Programming (CP) has been successfully used to model and solve complex combinatorial problems. However, modeling is often not trivial and requires expertise, which is a bottleneck to wider adoption. In Constraint Acquisition…

Artificial Intelligence · Computer Science 2023-12-19 Dimos Tsouros , Senne Berden , Tias Guns

Text-to-Image Generation with Attention Based Recurrent Neural Networks

Conditional image modeling based on textual descriptions is a relatively new domain in unsupervised learning. Previous approaches use a latent variable model and generative adversarial networks. While the formers are approximated by using…

Computer Vision and Pattern Recognition · Computer Science 2020-01-22 Tehseen Zia , Shahan Arif , Shakeeb Murtaza , Mirza Ahsan Ullah

An Additive Instance-Wise Approach to Multi-class Model Interpretation

Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main…

Machine Learning · Computer Science 2023-06-02 Vy Vo , Van Nguyen , Trung Le , Quan Hung Tran , Gholamreza Haffari , Seyit Camtepe , Dinh Phung

Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks

Transformers trained on huge text corpora exhibit a remarkable set of capabilities, e.g., performing basic arithmetic. Given the inherent compositional nature of language, one can expect the model to learn to compose these capabilities,…

Machine Learning · Computer Science 2024-02-07 Rahul Ramesh , Ekdeep Singh Lubana , Mikail Khona , Robert P. Dick , Hidenori Tanaka

Auxiliary Guided Autoregressive Variational Autoencoders

Generative modeling of high-dimensional data is a key problem in machine learning. Successful approaches include latent variable models and autoregressive models. The complementary strengths of these approaches, to model global and local…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Thomas Lucas , Jakob Verbeek

Image Decomposition and Classification through a Generative Model

We demonstrate in this paper that a generative model can be designed to perform classification tasks under challenging settings, including adversarial attacks and input distribution shifts. Specifically, we propose a conditional variational…

Computer Vision and Pattern Recognition · Computer Science 2019-02-12 Houpu Yao , Malcolm Regan , Yezhou Yang , Yi Ren

Autoregressive Knowledge Distillation through Imitation Learning

The performance of autoregressive models on natural language generation tasks has dramatically improved due to the adoption of deep, self-attentive architectures. However, these gains have come at the cost of hindering inference speed,…

Computation and Language · Computer Science 2020-10-30 Alexander Lin , Jeremy Wohlwend , Howard Chen , Tao Lei

Probability Series Expansion Classifier that is Interpretable by Design

This work presents a new classifier that is specifically designed to be fully interpretable. This technique determines the probability of a class outcome, based directly on probability assignments measured from the training data. The…

Machine Learning · Statistics 2017-10-31 Sapan Agarwal , Corey M. Hudson

Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement

We propose a conditional non-autoregressive neural sequence model based on iterative refinement. The proposed model is designed based on the principles of latent variable models and denoising autoencoders, and is generally applicable to any…

Machine Learning · Computer Science 2018-08-29 Jason Lee , Elman Mansimov , Kyunghyun Cho

Is In-Context Learning Learning?

In-context learning (ICL) allows some autoregressive models to solve tasks via next-token prediction and without needing further training. This has led to claims about these model's ability to solve (learn) unseen tasks with only a few…

Computation and Language · Computer Science 2026-02-12 Adrian de Wynter