Related papers: DINOISER: Diffused Conditional Sequence Learning b…

Empowering Diffusion Models on the Embedding Space for Text Generation

Diffusion models have achieved state-of-the-art synthesis quality on both visual and audio tasks, and recent works further adapt them to textual data by diffusing on the embedding space. In this paper, we conduct systematic studies of the…

Computation and Language · Computer Science 2024-04-23 Zhujin Gao , Junliang Guo , Xu Tan , Yongxin Zhu , Fang Zhang , Jiang Bian , Linli Xu

Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training

Diffusion models learn to denoise data and the trained denoiser is then used to generate new samples from the data distribution. In this paper, we revisit the diffusion sampling process and identify a fundamental cause of sample quality…

Machine Learning · Computer Science 2024-11-05 Yunshu Wu , Yingtao Luo , Xianghao Kong , Evangelos E. Papalexakis , Greg Ver Steeg

Noise Estimation for Generative Diffusion Models

Generative diffusion models have emerged as leading models in speech and image generation. However, in order to perform well with a small number of denoising steps, a costly tuning of the set of noise parameters is needed. In this work, we…

Machine Learning · Computer Science 2021-09-14 Robin San-Roman , Eliya Nachmani , Lior Wolf

Simple Guidance Mechanisms for Discrete Diffusion Models

Diffusion models for continuous data gained widespread adoption owing to their high quality generation and control mechanisms. However, controllable diffusion on discrete data faces challenges given that continuous guidance methods do not…

Machine Learning · Computer Science 2025-05-29 Yair Schiff , Subham Sekhar Sahoo , Hao Phung , Guanghan Wang , Sam Boshar , Hugo Dalla-torre , Bernardo P. de Almeida , Alexander Rush , Thomas Pierrot , Volodymyr Kuleshov

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers

Diffusion model, a new generative modelling paradigm, has achieved great success in image, audio, and video generation. However, considering the discrete categorical nature of text, it is not trivial to extend continuous diffusion models to…

Computation and Language · Computer Science 2023-05-23 Hongyi Yuan , Zheng Yuan , Chuanqi Tan , Fei Huang , Songfang Huang

Self-diffusion for Solving Inverse Problems

We propose self-diffusion, a novel framework for solving inverse problems without relying on pretrained generative models. Traditional diffusion-based approaches require training a model on a clean dataset to learn to reverse the forward…

Machine Learning · Computer Science 2025-12-09 Guanxiong Luo , Shoujin Huang , Yanlong Yang

Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control

Diffusion Probabilistic Models (DPMs) are powerful generative models that have achieved unparalleled success in a number of generative tasks. In this work, we aim to build inductive biases into the training and sampling of diffusion models…

Machine Learning · Computer Science 2025-03-14 Thomas Jiralerspong , Berton Earnshaw , Jason Hartford , Yoshua Bengio , Luca Scimeca

DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents

Diffusion models (DMs) have revolutionized generative learning. They utilize a diffusion process to encode data into a simple Gaussian distribution. However, encoding a complex, potentially multimodal data distribution into a single…

Machine Learning · Computer Science 2024-07-04 Yilun Xu , Gabriele Corso , Tommi Jaakkola , Arash Vahdat , Karsten Kreis

Realistic Noise Synthesis with Diffusion Models

Deep denoising models require extensive real-world training data, which is challenging to acquire. Current noise synthesis techniques struggle to accurately model complex noise distributions. We propose a novel Realistic Noise Synthesis…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Qi Wu , Mingyan Han , Ting Jiang , Chengzhi Jiang , Jinting Luo , Man Jiang , Haoqiang Fan , Shuaicheng Liu

Conditional Denoising Diffusion for Sequential Recommendation

Generative models have attracted significant interest due to their ability to handle uncertainty by learning the inherent data distributions. However, two prominent generative models, namely Generative Adversarial Networks (GANs) and…

Machine Learning · Computer Science 2023-04-25 Yu Wang , Zhiwei Liu , Liangwei Yang , Philip S. Yu

Enhancing Sample Generation of Diffusion Models using Noise Level Correction

The denoising process of diffusion models can be interpreted as an approximate projection of noisy samples onto the data manifold. Moreover, the noise level in these samples approximates their distance to the underlying manifold. Building…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Abulikemu Abuduweili , Chenyang Yuan , Changliu Liu , Frank Permenter

DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling

Learning from noisy labels is a challenge that arises in many real-world applications where training data can contain incorrect or corrupted labels. When fine-tuning language models with noisy labels, models can easily overfit the label…

Computation and Language · Computer Science 2023-06-14 Yuchen Zhuang , Yue Yu , Lingkai Kong , Xiang Chen , Chao Zhang

Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation

Diffusion models have achieved remarkable success in generating high-resolution, realistic images across diverse natural distributions. However, their performance heavily relies on high-quality training data, making it challenging to learn…

Machine Learning · Computer Science 2025-05-22 Tianyu Chen , Yasi Zhang , Zhendong Wang , Ying Nian Wu , Oscar Leong , Mingyuan Zhou

Conditional Generation from Unconditional Diffusion Models using Denoiser Representations

Denoising diffusion models have gained popularity as a generative modeling technique for producing high-quality and diverse images. Applying these models to downstream tasks requires conditioning, which can take the form of text, class…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Alexandros Graikos , Srikar Yellapragada , Dimitris Samaras

Towards Faster Training of Diffusion Models: An Inspiration of A Consistency Phenomenon

Diffusion models (DMs) are a powerful generative framework that have attracted significant attention in recent years. However, the high computational cost of training DMs limits their practical applications. In this paper, we start with a…

Machine Learning · Computer Science 2024-04-12 Tianshuo Xu , Peng Mi , Ruilin Wang , Yingcong Chen

FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version

Self-conditioning has been central to the success of continuous diffusion language models, as it allows models to correct previous errors. Yet its ability degrades precisely in the regime where diffusion is most attractive for deployment:…

Computation and Language · Computer Science 2026-04-08 Dat Nguyen-Cong , Tung Kieu , Hoang Thanh-Tung

Analyzing Diffusion as Serial Reproduction

Diffusion models are a class of generative models that learn to synthesize samples by inverting a diffusion process that gradually maps data into noise. While these models have enjoyed great success recently, a full theoretical…

Machine Learning · Computer Science 2023-09-22 Raja Marjieh , Ilia Sucholutsky , Thomas A. Langlois , Nori Jacoby , Thomas L. Griffiths

On Analyzing Generative and Denoising Capabilities of Diffusion-based Deep Generative Models

Diffusion-based Deep Generative Models (DDGMs) offer state-of-the-art performance in generative modeling. Their main strength comes from their unique setup in which a model (the backward diffusion process) is trained to reverse the forward…

Machine Learning · Computer Science 2022-06-02 Kamil Deja , Anna Kuzina , Tomasz Trzciński , Jakub M. Tomczak

DiffSED: Sound Event Detection with Denoising Diffusion

Sound Event Detection (SED) aims to predict the temporal boundaries of all the events of interest and their class labels, given an unconstrained audio sample. Taking either the splitand-classify (i.e., frame-level) strategy or the more…

Sound · Computer Science 2023-08-21 Swapnil Bhosale , Sauradip Nag , Diptesh Kanojia , Jiankang Deng , Xiatian Zhu

Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes

Diffusion models have emerged as a promising approach for text generation, with recent works falling into two main categories: discrete and continuous diffusion models. Discrete diffusion models apply token corruption independently using…

Computation and Language · Computer Science 2025-05-29 Bocheng Li , Zhujin Gao , Linli Xu