Related papers: Diff-Instruct: A Universal Approach for Transferri…

Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

Fast and high-quality language generation is the holy grail that people pursue in the age of AI. In this work, we introduce Discrete Diffusion Divergence Instruct (DiDi-Instruct), a training-based method that initializes from a pre-trained…

Computation and Language · Computer Science 2026-03-13 Haoyang Zheng , Xinyang Liu , Cindy Xiangrui Kong , Nan Jiang , Zheyuan Hu , Weijian Luo , Wei Deng , Guang Lin

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

Diffusion models (DMs) have demonstrated exceptional generative capabilities across various domains, including image, video, and so on. A key factor contributing to their effectiveness is the high quantity and quality of data used during…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Qianlong Xiang , Miao Zhang , Yuzhang Shang , Jianlong Wu , Yan Yan , Liqiang Nie

Learnability-Guided Diffusion for Dataset Distillation

Training machine learning models on massive datasets is expensive and time-consuming. Dataset distillation addresses this by creating a small synthetic dataset that achieves the same performance as the full dataset. Recent methods use…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Jeffrey A. Chan-Santiago , Mubarak Shah

A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization

Learning to sample from intractable distributions over discrete sets without relying on corresponding training data is a central problem in a wide range of fields, including Combinatorial Optimization. Currently, popular deep learning-based…

Machine Learning · Computer Science 2025-08-25 Sebastian Sanokowski , Sepp Hochreiter , Sebastian Lehner

Data-free Knowledge Distillation with Diffusion Models

Recently Data-Free Knowledge Distillation (DFKD) has garnered attention and can transfer knowledge from a teacher neural network to a student neural network without requiring any access to training data. Although diffusion models are adept…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Xiaohua Qi , Renda Li , Long Peng , Qiang Ling , Jun Yu , Ziyi Chen , Peng Chang , Mei Han , Jing Xiao

Training Neural Samplers with Reverse Diffusive KL Divergence

Training generative models to sample from unnormalized density functions is an important and challenging task in machine learning. Traditional training methods often rely on the reverse Kullback-Leibler (KL) divergence due to its…

Machine Learning · Computer Science 2025-03-05 Jiajun He , Wenlin Chen , Mingtian Zhang , David Barber , José Miguel Hernández-Lobato

A Comprehensive Survey on Knowledge Distillation of Diffusion Models

Diffusion Models (DMs), also referred to as score-based diffusion models, utilize neural networks to specify score functions. Unlike most other probabilistic models, DMs directly model the score functions, which makes them more flexible to…

Machine Learning · Computer Science 2023-04-11 Weijian Luo

Deep Mutual Learning

Model distillation is an effective and widely used technique to transfer knowledge from a teacher to a student network. The typical application is to transfer from a powerful large network or ensemble to a small network, that is better…

Computer Vision and Pattern Recognition · Computer Science 2017-06-02 Ying Zhang , Tao Xiang , Timothy M. Hospedales , Huchuan Lu

Continual Learning of Diffusion Models with Generative Distillation

Diffusion models are powerful generative models that achieve state-of-the-art performance in image synthesis. However, training them demands substantial amounts of data and computational resources. Continual learning would allow for…

Machine Learning · Computer Science 2025-03-05 Sergi Masip , Pau Rodriguez , Tinne Tuytelaars , Gido M. van de Ven

DDIL: Diversity Enhancing Diffusion Distillation With Imitation Learning

Diffusion models excel at generative modeling (e.g., text-to-image) but sampling requires multiple denoising network passes, limiting practicality. Efforts such as progressive distillation or consistency distillation have shown promise by…

Machine Learning · Computer Science 2025-04-01 Risheek Garrepalli , Shweta Mahajan , Munawar Hayat , Fatih Porikli

Improving Discriminator Guidance in Diffusion Models

Discriminator Guidance has become a popular method for efficiently refining pre-trained Score-Matching Diffusion models. However, in this paper, we demonstrate that the standard implementation of this technique does not necessarily lead to…

Machine Learning · Computer Science 2025-06-12 Alexandre Verine , Ahmed Mehdi Inane , Florian Le Bronnec , Benjamin Negrevergne , Yann Chevaleyre

Insights into Closed-form IPM-GAN Discriminator Guidance for Diffusion Modeling

Diffusion models are a state-of-the-art generative modeling framework that transform noise to images via Langevin sampling, guided by the score, which is the gradient of the logarithm of the data distribution. Recent works have shown…

Machine Learning · Computer Science 2025-08-01 Aadithya Srikanth , Siddarth Asokan , Nishanth Shetty , Chandra Sekhar Seelamantula

Guided Transfer Learning for Discrete Diffusion Models

Discrete diffusion models (DMs) have achieved strong performance in language and other discrete domains, offering a compelling alternative to autoregressive modeling. Yet this performance typically depends on large training datasets,…

Machine Learning · Computer Science 2026-04-16 Julian Kleutgens , Claudio Battiloro , Lingkai Kong , Benjamin Grewe , Francesca Dominici , Mauricio Tec

On Distillation of Guided Diffusion Models

Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Chenlin Meng , Robin Rombach , Ruiqi Gao , Diederik P. Kingma , Stefano Ermon , Jonathan Ho , Tim Salimans

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

Training Implicit Generative Models via an Invariant Statistical Loss

Implicit generative models have the capability to learn arbitrary complex data distributions. On the downside, training requires telling apart real data from artificially-generated ones using adversarial discriminators, leading to unstable…

Machine Learning · Computer Science 2024-02-27 José Manuel de Frutos , Pablo M. Olmos , Manuel A. Vázquez , Joaquín Míguez

DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks

Generative models can be categorized into two types: explicit generative models that define explicit density forms and allow exact likelihood inference, such as score-based diffusion models (SDMs) and normalizing flows; implicit generative…

Machine Learning · Statistics 2023-07-06 Jingwei Zhang , Han Shi , Jincheng Yu , Enze Xie , Zhenguo Li

DistiLLM: Towards Streamlined Distillation for Large Language Models

Knowledge distillation (KD) is widely used for compressing a teacher model to a smaller student model, reducing its inference cost and memory footprint while preserving model capabilities. However, current KD methods for auto-regressive…

Computation and Language · Computer Science 2024-07-04 Jongwoo Ko , Sungnyun Kim , Tianyi Chen , Se-Young Yun

Class-Incremental Learning using Diffusion Model for Distillation and Replay

Class-incremental learning aims to learn new classes in an incremental fashion without forgetting the previously learned ones. Several research works have shown how additional data can be used by incremental models to help mitigate…

Machine Learning · Computer Science 2023-10-11 Quentin Jodelet , Xin Liu , Yin Jun Phua , Tsuyoshi Murata

Diffusion-Reward Adversarial Imitation Learning

Imitation learning aims to learn a policy from observing expert demonstrations without access to reward signals from environments. Generative adversarial imitation learning (GAIL) formulates imitation learning as adversarial learning,…

Machine Learning · Computer Science 2024-11-27 Chun-Mao Lai , Hsiang-Chun Wang , Ping-Chun Hsieh , Yu-Chiang Frank Wang , Min-Hung Chen , Shao-Hua Sun