English
Related papers

Related papers: Data-Free Network Quantization With Adversarial Kn…

200 papers

Knowledge distillation has been widely used to produce portable and efficient neural networks which can be well applied on edge devices for computer vision tasks. However, almost all top-performing knowledge distillation methods need to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-06 Haoran Zhao , Xin Sun , Junyu Dong , Hui Yu , Huiyu Zhou

Knowledge distillation is the technique of compressing a larger neural network, known as the teacher, into a smaller neural network, known as the student, while still trying to maintain the performance of the larger neural network as much…

Machine Learning · Computer Science 2023-05-11 Tianxun Zhou , Keng-Hwee Chiam

Knowledge Distillation (KD) has made remarkable progress in the last few years and become a popular paradigm for model compression and knowledge transfer. However, almost all existing KD algorithms are data-driven, i.e., relying on a large…

Machine Learning · Computer Science 2020-03-03 Gongfan Fang , Jie Song , Chengchao Shen , Xinchao Wang , Da Chen , Mingli Song

Knowledge distillation refers to a technique of transferring the knowledge from a large learned model or an ensemble of learned models to a small model. This method relies on access to the original training set, which might not always be…

Machine Learning · Computer Science 2021-02-24 Xiaoyang Qu , Jianzong Wang , Jing Xiao

Knowledge distillation is one of the most popular and effective techniques for knowledge transfer, model compression and semi-supervised learning. Most existing distillation approaches require the access to original or augmented training…

Machine Learning · Computer Science 2020-12-11 Liangchen Luo , Mark Sandler , Zi Lin , Andrey Zhmoginov , Andrew Howard

Knowledge distillation (KD) has proved to be an effective approach for deep neural network compression, which learns a compact network (student) by transferring the knowledge from a pre-trained, over-parameterized network (teacher). In…

Machine Learning · Computer Science 2021-04-13 Zi Wang

Knowledge distillation between machine learning models has opened many new avenues for parameter count reduction, performance improvements, or amortizing training time when changing architectures between the teacher and student network. In…

Machine Learning · Computer Science 2020-11-24 Jonathan Raiman

Recent advances in model compression have provided procedures for compressing large neural networks to a fraction of their original size while retaining most if not all of their accuracy. However, all of these approaches rely on access to…

Machine Learning · Computer Science 2017-11-27 Raphael Gontijo Lopes , Stefano Fenu , Thad Starner

Deep network compression has been achieved notable progress via knowledge distillation, where a teacher-student learning manner is adopted by using predetermined loss. Recently, more focuses have been transferred to employ the adversarial…

Machine Learning · Computer Science 2019-04-26 Shu Changyong , Li Peng , Xie Yuan , Qu Yanyun , Dai Longquan , Ma Lizhuang

Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classification to translation or reinforcement learning. One aspect of the field receiving considerable attention is efficiently executing deep…

Neural and Evolutionary Computing · Computer Science 2018-02-16 Antonio Polino , Razvan Pascanu , Dan Alistarh

Knowledge distillation deals with the problem of training a smaller model (Student) from a high capacity source model (Teacher) so as to retain most of its performance. Existing approaches use either the training data or meta-data extracted…

Machine Learning · Computer Science 2019-05-21 Gaurav Kumar Nayak , Konda Reddy Mopuri , Vaisakh Shaj , R. Venkatesh Babu , Anirban Chakraborty

Knowledge distillation which learns a lightweight student model by distilling knowledge from a cumbersome teacher model is an attractive approach for learning compact deep neural networks (DNNs). Recent works further improve student network…

Computer Vision and Pattern Recognition · Computer Science 2022-10-31 Cuong Pham , Tuan Hoang , Thanh-Toan Do

Data-free knowledge distillation is able to utilize the knowledge learned by a large teacher network to augment the training of a smaller student network without accessing the original training data, avoiding privacy, security, and…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 He Liu , Yikai Wang , Huaping Liu , Fuchun Sun , Anbang Yao

While deep models have proved successful in learning rich knowledge from massive well-annotated data, they may pose a privacy leakage risk in practical deployment. It is necessary to find an effective trade-off between high utility and…

Machine Learning · Computer Science 2024-09-05 Shiming Ge , Bochao Liu , Pengju Wang , Yong Li , Dan Zeng

Many recent works on knowledge distillation have provided ways to transfer the knowledge of a trained network for improving the learning process of a new one, but finding a good technique for knowledge distillation is still an open problem.…

Machine Learning · Computer Science 2018-12-17 Byeongho Heo , Minsik Lee , Sangdoo Yun , Jin Young Choi

Model quantization is known as a promising method to compress deep neural networks, especially for inferences on lightweight mobile or edge devices. However, model quantization usually requires access to the original training data to…

Machine Learning · Computer Science 2021-11-05 Kanghyun Choi , Deokki Hong , Noseong Park , Youngsok Kim , Jinho Lee

Dataset distillation is the technique of synthesizing smaller condensed datasets from large original datasets while retaining necessary information to persist the effect. In this paper, we approach the dataset distillation problem from a…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Mingyang Chen , Bo Huang , Junda Lu , Bing Li , Yi Wang , Minhao Cheng , Wei Wang

Knowledge Distillation is an effective method to transfer the learning across deep neural networks. Typically, the dataset originally used for training the Teacher model is chosen as the "Transfer Set" to conduct the knowledge transfer to…

Machine Learning · Computer Science 2020-11-19 Gaurav Kumar Nayak , Konda Reddy Mopuri , Anirban Chakraborty

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-11 Athanasios Masouris , Mansi Sharma , Adrian Boguszewski , Alexander Kozlov , Zhuo Wu , Raymond Lo

Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generation-based methods train…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Jialiang Tang , Shuo Chen , Chen Gong
‹ Prev 1 2 3 10 Next ›