Related papers: Data-free Knowledge Distillation with Diffusion Mo…

Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation

Data-free knowledge distillation (DFKD) has emerged as a pivotal technique in the domain of model compression, substantially reducing the dependency on the original training data. Nonetheless, conventional DFKD methods that employ…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Muquan Li , Dongyang Zhang , Tao He , Xiurui Xie , Yuan-Fang Li , Ke Qin

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation

Data-Free Knowledge Distillation (DFKD) enables the knowledge transfer from the given pre-trained teacher network to the target student model without access to the real training data. Existing DFKD methods focus primarily on improving image…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Zherui Zhang , Changwei Wang , Rongtao Xu , Wenhao Xu , Shibiao Xu , Yu Zhang , Li Guo

Sampling to Distill: Knowledge Transfer from Open-World Data

Data-Free Knowledge Distillation (DFKD) is a novel task that aims to train high-performance student models using only the pre-trained teacher network without original training data. Most of the existing DFKD methods rely heavily on…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Yuzheng Wang , Zhaoyu Chen , Jie Zhang , Dingkang Yang , Zuhao Ge , Yang Liu , Siao Liu , Yunquan Sun , Wenqiang Zhang , Lizhe Qi

Dynamic Data-Free Knowledge Distillation by Easy-to-Hard Learning Strategy

Data-free knowledge distillation (DFKD) is a widely-used strategy for Knowledge Distillation (KD) whose training data is not available. It trains a lightweight student model with the aid of a large pretrained teacher model without any…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Jingru Li , Sheng Zhou , Liangcheng Li , Haishuai Wang , Zhi Yu , Jiajun Bu

Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation

Data-Free Knowledge Distillation (DFKD) is an advanced technique that enables knowledge transfer from a teacher model to a student model without relying on original training data. While DFKD methods have achieved success on smaller datasets…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Minh-Tuan Tran , Trung Le , Xuan-May Le , Jianfei Cai , Mehrtash Harandi , Dinh Phung

Knowledge Diffusion for Distillation

The representation gap between teacher and student is an emerging topic in knowledge distillation (KD). To reduce the gap and improve the performance, current methods often resort to complicated training schemes, loss functions, and feature…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Tao Huang , Yuan Zhang , Mingkai Zheng , Shan You , Fei Wang , Chen Qian , Chang Xu

Data-Free Knowledge Transfer: A Survey

In the last decade, many deep learning models have been well trained and made a great success in various fields of machine intelligence, especially for computer vision and natural language processing. To better leverage the potential of…

Machine Learning · Computer Science 2022-01-03 Yuang Liu , Wei Zhang , Jun Wang , Jianyong Wang

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture

Diffusion models (DMs) have demonstrated exceptional generative capabilities across various domains, including image, video, and so on. A key factor contributing to their effectiveness is the high quantity and quality of data used during…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Qianlong Xiang , Miao Zhang , Yuzhang Shang , Jianlong Wu , Yan Yan , Liqiang Nie

Hybrid Data-Free Knowledge Distillation

Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generation-based methods train…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Jialiang Tang , Shuo Chen , Chen Gong

Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation

Data-free knowledge distillation (DFKD) aims to distill pretrained knowledge to a student model with the help of a generator without using original data. In such data-free scenarios, achieving stable performance of DFKD is essential due to…

Machine Learning · Computer Science 2024-02-21 Hyunjune Shin , Dong-Wan Choi

Up to 100$\times$ Faster Data-free Knowledge Distillation

Data-free knowledge distillation (DFKD) has recently been attracting increasing attention from research communities, attributed to its capability to compress a model only using synthetic data. Despite the encouraging results achieved,…

Machine Learning · Computer Science 2022-02-28 Gongfan Fang , Kanya Mo , Xinchao Wang , Jie Song , Shitao Bei , Haofei Zhang , Mingli Song

Data-Free Distillation of Language Model by Text-to-Text Transfer

Data-Free Knowledge Distillation (DFKD) plays a vital role in compressing the model when original training data is unavailable. Previous works for DFKD in NLP mainly focus on distilling encoder-only structures like BERT on classification…

Computation and Language · Computer Science 2023-11-06 Zheyuan Bai , Xinduo Liu , Hailin Hu , Tianyu Guo , Qinghua Zhang , Yunhe Wang

FiGKD: Fine-Grained Knowledge Distillation via High-Frequency Detail Transfer

Knowledge distillation (KD) is a widely adopted technique for transferring knowledge from a high-capacity teacher model to a smaller student model by aligning their output distributions. However, existing methods often underperform in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Seonghak Kim

PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion

Data-free knowledge distillation (DFKD) transfers knowledge from a teacher to a student without access to the real in-distribution (ID) data. While existing methods perform well on small-scale images, they suffer from mode collapse when…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Xuewan He , Jielei Wang , Zihan Cheng , Yuchen Su , Shiyue Huang , Guoming Lu

Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis

Knowledge distillation (KD) has proved to be an effective approach for deep neural network compression, which learns a compact network (student) by transferring the knowledge from a pre-trained, over-parameterized network (teacher). In…

Machine Learning · Computer Science 2021-04-13 Zi Wang

Relational Diffusion Distillation for Efficient Image Generation

Although the diffusion model has achieved remarkable performance in the field of image generation, its high inference delay hinders its wide application in edge devices with scarce computing resources. Therefore, many training-free sampling…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Weilun Feng , Chuanguang Yang , Zhulin An , Libo Huang , Boyu Diao , Fei Wang , Yongjun Xu

FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks

Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is actually…

Machine Learning · Computer Science 2023-03-28 Kaituo Feng , Changsheng Li , Ye Yuan , Guoren Wang

De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts

Data-Free Knowledge Distillation (DFKD) is a promising task to train high-performance small models to enhance actual deployment without relying on the original training data. Existing methods commonly avoid relying on private data by…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Yuzheng Wang , Dingkang Yang , Zhaoyu Chen , Yang Liu , Siao Liu , Wenqiang Zhang , Lihua Zhang , Lizhe Qi

Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification

Data-Free Knowledge Distillation (DFKD) has shown great potential in creating a compact student model while alleviating the dependency on real training data by synthesizing surrogate data. However, prior arts are seldom discussed under…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Yunyi Xuan , Weijie Chen , Shicai Yang , Di Xie , Luojun Lin , Yueting Zhuang

Knowledge Distillation with Deep Supervision

Knowledge distillation aims to enhance the performance of a lightweight student model by exploiting the knowledge from a pre-trained cumbersome teacher model. However, in the traditional knowledge distillation, teacher predictions are only…

Machine Learning · Computer Science 2023-05-26 Shiya Luo , Defang Chen , Can Wang