Related papers: Data-Free Network Quantization With Adversarial Kn…

Dual Discriminator Adversarial Distillation for Data-free Model Compression

Knowledge distillation has been widely used to produce portable and efficient neural networks which can be well applied on edge devices for computer vision tasks. However, almost all top-performing knowledge distillation methods need to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Haoran Zhao , Xin Sun , Junyu Dong , Hui Yu , Huiyu Zhou

Synthetic data generation method for data-free knowledge distillation in regression neural networks

Knowledge distillation is the technique of compressing a larger neural network, known as the teacher, into a smaller neural network, known as the student, while still trying to maintain the performance of the larger neural network as much…

Machine Learning · Computer Science 2023-05-11 Tianxun Zhou , Keng-Hwee Chiam

Data-Free Adversarial Distillation

Knowledge Distillation (KD) has made remarkable progress in the last few years and become a popular paradigm for model compression and knowledge transfer. However, almost all existing KD algorithms are data-driven, i.e., relying on a large…

Machine Learning · Computer Science 2020-03-03 Gongfan Fang , Jie Song , Chengchao Shen , Xinchao Wang , Da Chen , Mingli Song

Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation

Knowledge distillation refers to a technique of transferring the knowledge from a large learned model or an ensemble of learned models to a small model. This method relies on access to the original training set, which might not always be…

Machine Learning · Computer Science 2021-02-24 Xiaoyang Qu , Jianzong Wang , Jing Xiao

Large-Scale Generative Data-Free Distillation

Knowledge distillation is one of the most popular and effective techniques for knowledge transfer, model compression and semi-supervised learning. Most existing distillation approaches require the access to original or augmented training…

Machine Learning · Computer Science 2020-12-11 Liangchen Luo , Mark Sandler , Zi Lin , Andrey Zhmoginov , Andrew Howard

Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis

Knowledge distillation (KD) has proved to be an effective approach for deep neural network compression, which learns a compact network (student) by transferring the knowledge from a pre-trained, over-parameterized network (teacher). In…

Machine Learning · Computer Science 2021-04-13 Zi Wang

Generative Adversarial Simulator

Knowledge distillation between machine learning models has opened many new avenues for parameter count reduction, performance improvements, or amortizing training time when changing architectures between the teacher and student network. In…

Machine Learning · Computer Science 2020-11-24 Jonathan Raiman

Data-Free Knowledge Distillation for Deep Neural Networks

Recent advances in model compression have provided procedures for compressing large neural networks to a fraction of their original size while retaining most if not all of their accuracy. However, all of these approaches rely on access to…

Machine Learning · Computer Science 2017-11-27 Raphael Gontijo Lopes , Stefano Fenu , Thad Starner

Knowledge Squeezed Adversarial Network Compression

Deep network compression has been achieved notable progress via knowledge distillation, where a teacher-student learning manner is adopted by using predetermined loss. Recently, more focuses have been transferred to employ the adversarial…

Machine Learning · Computer Science 2019-04-26 Shu Changyong , Li Peng , Xie Yuan , Qu Yanyun , Dai Longquan , Ma Lizhuang

Model compression via distillation and quantization

Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classification to translation or reinforcement learning. One aspect of the field receiving considerable attention is efficiently executing deep…

Neural and Evolutionary Computing · Computer Science 2018-02-16 Antonio Polino , Razvan Pascanu , Dan Alistarh

Zero-Shot Knowledge Distillation in Deep Networks

Knowledge distillation deals with the problem of training a smaller model (Student) from a high capacity source model (Teacher) so as to retain most of its performance. Existing approaches use either the training data or meta-data extracted…

Machine Learning · Computer Science 2019-05-21 Gaurav Kumar Nayak , Konda Reddy Mopuri , Vaisakh Shaj , R. Venkatesh Babu , Anirban Chakraborty

Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks

Knowledge distillation which learns a lightweight student model by distilling knowledge from a cumbersome teacher model is an attractive approach for learning compact deep neural networks (DNNs). Recent works further improve student network…

Computer Vision and Pattern Recognition · Computer Science 2022-10-31 Cuong Pham , Tuan Hoang , Thanh-Toan Do

Small Scale Data-Free Knowledge Distillation

Data-free knowledge distillation is able to utilize the knowledge learned by a large teacher network to augment the training of a smaller student network without accessing the original training data, avoiding privacy, security, and…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 He Liu , Yikai Wang , Huaping Liu , Fuchun Sun , Anbang Yao

Learning Privacy-Preserving Student Networks via Discriminative-Generative Distillation

While deep models have proved successful in learning rich knowledge from massive well-annotated data, they may pose a privacy leakage risk in practical deployment. It is necessary to find an effective trade-off between high utility and…

Machine Learning · Computer Science 2024-09-05 Shiming Ge , Bochao Liu , Pengju Wang , Yong Li , Dan Zeng

Knowledge Distillation with Adversarial Samples Supporting Decision Boundary

Many recent works on knowledge distillation have provided ways to transfer the knowledge of a trained network for improving the learning process of a new one, but finding a good technique for knowledge distillation is still an open problem.…

Machine Learning · Computer Science 2018-12-17 Byeongho Heo , Minsik Lee , Sangdoo Yun , Jin Young Choi

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Model quantization is known as a promising method to compress deep neural networks, especially for inferences on lightweight mobile or edge devices. However, model quantization usually requires access to the original training data to…

Machine Learning · Computer Science 2021-11-05 Kanghyun Choi , Deokki Hong , Noseong Park , Youngsok Kim , Jinho Lee

Dataset Distillation via Adversarial Prediction Matching

Dataset distillation is the technique of synthesizing smaller condensed datasets from large original datasets while retaining necessary information to persist the effect. In this paper, we approach the dataset distillation problem from a…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Mingyang Chen , Bo Huang , Junda Lu , Bing Li , Yi Wang , Minhao Cheng , Wei Wang

Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Knowledge Distillation is an effective method to transfer the learning across deep neural networks. Typically, the dataset originally used for training the Teacher model is chosen as the "Transfer Set" to conduct the knowledge transfer to…

Machine Learning · Computer Science 2020-11-19 Gaurav Kumar Nayak , Konda Reddy Mopuri , Anirban Chakraborty

Post-training Model Quantization Using GANs for Synthetic Data Generation

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-11 Athanasios Masouris , Mansi Sharma , Adrian Boguszewski , Alexander Kozlov , Zhuo Wu , Raymond Lo

Hybrid Data-Free Knowledge Distillation

Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generation-based methods train…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Jialiang Tang , Shuo Chen , Chen Gong