Related papers: Analog Bits: Generating Discrete Data using Diffus…

Discrete Modeling via Boundary Conditional Diffusion Processes

We present an novel framework for efficiently and effectively extending the powerful continuous diffusion processes to discrete modeling. Previous approaches have suffered from the discrepancy between discrete data and continuous modeling.…

Machine Learning · Computer Science 2024-10-31 Yuxuan Gu , Xiaocheng Feng , Lei Huang , Yingsheng Wu , Zekun Zhou , Weihong Zhong , Kun Zhu , Bing Qin

DiffCap: Exploring Continuous Diffusion on Image Captioning

Current image captioning works usually focus on generating descriptions in an autoregressive manner. However, there are limited works that focus on generating descriptions non-autoregressively, which brings more decoding diversity. Inspired…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Yufeng He , Zefan Cai , Xu Gan , Baobao Chang

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

Diffusion models (DMs) have revolutionized generative learning. They utilize a diffusion process to encode data into a simple Gaussian distribution. However, encoding a complex, potentially multimodal data distribution into a single…

Machine Learning · Computer Science 2024-07-04 Yilun Xu , Gabriele Corso , Tommi Jaakkola , Arash Vahdat , Karsten Kreis

Self-conditioned Embedding Diffusion for Text Generation

Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as…

Computation and Language · Computer Science 2022-11-09 Robin Strudel , Corentin Tallec , Florent Altché , Yilun Du , Yaroslav Ganin , Arthur Mensch , Will Grathwohl , Nikolay Savinov , Sander Dieleman , Laurent Sifre , Rémi Leblond

Discrete Diffusion Models for Language Generation

Diffusion models have emerged as a powerful class of generative models, achieving state-of-the-art results in continuous data domains such as image and video generation. Their core mechanism involves a forward diffusion process that…

Computation and Language · Computer Science 2025-07-10 Ashen Weligalle

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Diffusion models have demonstrated remarkable performance in generating unimodal data across various tasks, including image, video, and text generation. On the contrary, the joint generation of multimodal data through diffusion models is…

Machine Learning · Computer Science 2025-06-16 Kevin Rojas , Yuchen Zhu , Sichen Zhu , Felix X. -F. Ye , Molei Tao

LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models

Creating graphic layouts is a fundamental step in graphic designs. In this work, we present a novel generative model named LayoutDiffusion for automatic layout generation. As layout is typically represented as a sequence of discrete tokens,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Junyi Zhang , Jiaqi Guo , Shizhao Sun , Jian-Guang Lou , Dongmei Zhang

D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens

In the domain of image generation, latent-based generative models occupy a dominant status; however, these models rely heavily on image tokenizer. To meet modeling requirements, autoregressive models possessing the characteristics of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Panpan Wang , Liqiang Niu , Fandong Meng , Jinan Xu , Yufeng Chen , Jie Zhou

Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis

Recently, diffusion models have shown remarkable results in image synthesis by gradually removing noise and amplifying signals. Although the simple generative process surprisingly works well, is this the best way to generate image data? For…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Sangyun Lee , Hyungjin Chung , Jaehyeon Kim , Jong Chul Ye

Simple Guidance Mechanisms for Discrete Diffusion Models

Diffusion models for continuous data gained widespread adoption owing to their high quality generation and control mechanisms. However, controllable diffusion on discrete data faces challenges given that continuous guidance methods do not…

Machine Learning · Computer Science 2025-05-29 Yair Schiff , Subham Sekhar Sahoo , Hao Phung , Guanghan Wang , Sam Boshar , Hugo Dalla-torre , Bernardo P. de Almeida , Alexander Rush , Thomas Pierrot , Volodymyr Kuleshov

Diffusion Self-Distillation for Zero-Shot Customized Image Generation

Text-to-image diffusion models produce impressive results but are frustrating tools for artists who desire fine-grained control. For example, a common use case is to create images of a specific instance in novel contexts, i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Shengqu Cai , Eric Chan , Yunzhi Zhang , Leonidas Guibas , Jiajun Wu , Gordon Wetzstein

Dynamical Regimes of Discrete Diffusion Models

Diffusion models generate high-dimensional data such as images by learning a process that gradually removes noise from corrupted data. Recent studies have shown that the backward dynamics of diffusion models exhibit two characteristic…

Statistical Mechanics · Physics 2026-04-14 Tomoei Takahashi , Takashi Takahashi , Yoshiyuki Kabashima

Distillation of Discrete Diffusion through Dimensional Correlations

Diffusion models have demonstrated exceptional performances in various fields of generative modeling, but suffer from slow sampling speed due to their iterative nature. While this issue is being addressed in continuous domains, discrete…

Machine Learning · Computer Science 2025-05-12 Satoshi Hayakawa , Yuhta Takida , Masaaki Imaizumi , Hiromi Wakaki , Yuki Mitsufuji

Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Nithesh Chandher Karthikeyan , Jonas Unger , Gabriel Eilertsen

Efficient Dataset Distillation via Minimax Diffusion

Dataset distillation reduces the storage and computational consumption of training a network by generating a small surrogate dataset that encapsulates rich information of the original large-scale one. However, previous distillation methods…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Jianyang Gu , Saeed Vahidian , Vyacheslav Kungurtsev , Haonan Wang , Wei Jiang , Yang You , Yiran Chen

Generative Modelling With Inverse Heat Dissipation

While diffusion models have shown great success in image generation, their noise-inverting generative process does not explicitly consider the structure of images, such as their inherent multi-scale nature. Inspired by diffusion models and…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Severi Rissanen , Markus Heinonen , Arno Solin

CADC: Content Adaptive Diffusion-Based Generative Image Compression

Diffusion-based generative image compression has demonstrated remarkable potential for achieving realistic reconstruction at ultra-low bitrates. The key to unlocking this potential lies in making the entire compression process…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Xihua Sheng , Lingyu Zhu , Tianyu Zhang , Dong Liu , Shiqi Wang , Jing Wang

SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models

We are witnessing a revolution in conditional image synthesis with the recent success of large scale text-to-image generation methods. This success also opens up new opportunities in controlling the generation and editing process using…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 Burak Can Biner , Farrin Marouf Sofian , Umur Berkay Karakaş , Duygu Ceylan , Erkut Erdem , Aykut Erdem

Discrete Copula Diffusion

Discrete diffusion models have recently shown significant progress in modeling complex data, such as natural languages and DNA sequences. However, unlike diffusion models for continuous data, which can generate high-quality samples in just…

Machine Learning · Computer Science 2025-03-20 Anji Liu , Oliver Broadrick , Mathias Niepert , Guy Van den Broeck

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis. However, little progress has been made on using diffusion models to generate discrete symbolic music…

Sound · Computer Science 2023-10-24 Jincheng Zhang , György Fazekas , Charalampos Saitis