English
Related papers

Related papers: CAFLOW: Conditional Autoregressive Flows

200 papers

Flow-based generative models show great potential in image synthesis due to its reversible pipeline and exact log-likelihood target, yet it suffers from weak ability for conditional image synthesis, especially for multi-label or unaware…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Rui Liu , Yu Liu , Xinyu Gong , Xiaogang Wang , Hongsheng Li

Flow models are effective at progressively generating realistic images, but they generally struggle to capture long-range dependencies during the generation process as they compress all the information from previous time steps into a single…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Mude Hui , Rui-Jie Zhu , Songlin Yang , Yu Zhang , Zirui Wang , Yuyin Zhou , Jason Eshraghian , Cihang Xie

In digital pathology, whole-slide images routinely exceed gigapixel resolution, making computationally intensive generative super-resolution (SR) impractical for routine deployment. We introduce CAFlow, an adaptive-depth single-step…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Elad Yoshai , Ariel D. Yoshai , Natan T. Shaked

Image-to-image (I2I) translation is a challenging topic in computer vision. We divide this problem into three tasks: strongly constrained translation, normally constrained translation, and weakly constrained translation. The constraint here…

Computer Vision and Pattern Recognition · Computer Science 2022-07-06 Weichen Fan , Jinghuan Chen , Jiabin Ma , Jun Hou , Shuai Yi

The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to…

Machine Learning · Computer Science 2017-02-01 Diederik P. Kingma , Tim Salimans , Rafal Jozefowicz , Xi Chen , Ilya Sutskever , Max Welling

Many image-to-image translation problems are ambiguous, as a single input image may correspond to multiple possible outputs. In this work, we aim to model a \emph{distribution} of possible outputs in a conditional generative modeling…

Computer Vision and Pattern Recognition · Computer Science 2018-10-25 Jun-Yan Zhu , Richard Zhang , Deepak Pathak , Trevor Darrell , Alexei A. Efros , Oliver Wang , Eli Shechtman

Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Shuangfei Zhai , Ruixiang Zhang , Preetum Nakkiran , David Berthelot , Jiatao Gu , Huangjie Zheng , Tianrong Chen , Miguel Angel Bautista , Navdeep Jaitly , Josh Susskind

Super-resolution is an ill-posed problem, since it allows for multiple predictions for a given low-resolution image. This fundamental fact is largely ignored by state-of-the-art deep learning based approaches. These methods instead train a…

Computer Vision and Pattern Recognition · Computer Science 2020-08-03 Andreas Lugmayr , Martin Danelljan , Luc Van Gool , Radu Timofte

To enhance low-light images to normally-exposed ones is highly ill-posed, namely that the mapping relationship between them is one-to-many. Previous works based on the pixel-wise reconstruction losses and deterministic processes fail to…

Image and Video Processing · Electrical Eng. & Systems 2021-09-14 Yufei Wang , Renjie Wan , Wenhan Yang , Haoliang Li , Lap-Pui Chau , Alex C. Kot

We present STARFlow, a scalable generative model based on normalizing flows that achieves strong performance in high-resolution image synthesis. The core of STARFlow is Transformer Autoregressive Flow (TARFlow), which combines the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Jiatao Gu , Tianrong Chen , David Berthelot , Huangjie Zheng , Yuyang Wang , Ruixiang Zhang , Laurent Dinh , Miguel Angel Bautista , Josh Susskind , Shuangfei Zhai

Diffusion models have emerged as a powerful tool for generating high-quality images from textual descriptions. Despite their successes, these models often exhibit limited diversity in the sampled images, particularly when sampling with a…

Computer Vision and Pattern Recognition · Computer Science 2024-06-03 Jiatao Gu , Ying Shen , Shuangfei Zhai , Yizhe Zhang , Navdeep Jaitly , Joshua M. Susskind

Image reconstruction from computed tomography (CT) measurement is a challenging statistical inverse problem since a high-dimensional conditional distribution needs to be estimated. Based on training data obtained from high-quality…

Image and Video Processing · Electrical Eng. & Systems 2020-06-12 Alexander Denker , Maximilian Schmidt , Johannes Leuschner , Peter Maass , Jens Behrmann

Flow-based generative models have highly desirable properties like exact log-likelihood evaluation and exact latent-variable inference, however they are still in their infancy and have not received as much attention as alternative…

Computer Vision and Pattern Recognition · Computer Science 2020-04-06 Albert Pumarola , Stefan Popov , Francesc Moreno-Noguer , Vittorio Ferrari

Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently,…

Machine Learning · Computer Science 2020-01-10 Zhiqing Sun , Zhuohan Li , Haoqing Wang , Zi Lin , Di He , Zhi-Hong Deng

Normalizing Flows (NFs) are a class of generative models distinguished by a mathematically invertible architecture, where the forward pass transforms data into a latent space for density estimation, and the reverse pass generates new…

Computer Vision and Pattern Recognition · Computer Science 2025-12-05 Yang Chen , Xiaowei Xu , Shuai Wang , Chenhui Zhu , Ruxue Wen , Xubin Li , Tiezheng Ge , Limin Wang

Normalizing flows have recently demonstrated promising results for low-level vision tasks. For image super-resolution (SR), it learns to predict diverse photo-realistic high-resolution (HR) images from the low-resolution (LR) image rather…

Image and Video Processing · Electrical Eng. & Systems 2021-08-29 Jingyun Liang , Andreas Lugmayr , Kai Zhang , Martin Danelljan , Luc Van Gool , Radu Timofte

Autoregressive models have driven remarkable progress in language modeling. Their foundational reliance on discrete tokens, unidirectional context, and single-pass decoding, while central to their success, also inspires the exploration of a…

Rectified flow and reflow procedures have significantly advanced fast generation by progressively straightening ordinary differential equation (ODE) flows. They operate under the assumption that image and noise pairs, known as couplings,…

Machine Learning · Computer Science 2024-11-04 Dogyun Park , Sojin Lee , Sihyeon Kim , Taehoon Lee , Youngjoon Hong , Hyunwoo J. Kim

Flow matching models have emerged as a powerful framework for realistic image generation by learning to reverse a corruption process that progressively adds Gaussian noise. However, because noise is injected in the latent domain, its impact…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Sucheng Ren , Qihang Yu , Ju He , Xiaohui Shen , Alan Yuille , Liang-Chieh Chen

Autoregressive transformers have recently shown impressive image generation quality and efficiency on par with state-of-the-art diffusion models. Unlike diffusion architectures, autoregressive models can naturally incorporate arbitrary…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixiao Chen , Zhiyuan Ma , Guoli Jia , Che Jiang , Jianjun Li , Bowen Zhou
‹ Prev 1 2 3 10 Next ›