Related papers: ReffAKD: Resource-efficient Autoencoder-based Know…

A Fast Knowledge Distillation Framework for Visual Recognition

While Knowledge Distillation (KD) has been recognized as a useful tool in many visual tasks, such as supervised classification and self-supervised representation learning, the main drawback of a vanilla KD framework is its mechanism, which…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Zhiqiang Shen , Eric Xing

Knowledge Distillation with the Reused Teacher Classifier

Knowledge distillation aims to compress a powerful yet cumbersome teacher model into a lightweight student model without much sacrifice of performance. For this purpose, various approaches have been proposed over the past few years,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Defang Chen , Jian-Ping Mei , Hailin Zhang , Can Wang , Yan Feng , Chun Chen

Student-friendly Knowledge Distillation

In knowledge distillation, the knowledge from the teacher model is often too complex for the student model to thoroughly process. However, good teachers in real life always simplify complex material before teaching it to students. Inspired…

Computer Vision and Pattern Recognition · Computer Science 2023-05-19 Mengyang Yuan , Bo Lang , Fengnan Quan

Robust Active Distillation

Distilling knowledge from a large teacher model to a lightweight one is a widely successful approach for generating compact, powerful models in the semi-supervised learning setting where a limited amount of labeled data is available. In…

Machine Learning · Computer Science 2023-02-07 Cenk Baykal , Khoa Trinh , Fotis Iliopoulos , Gaurav Menghani , Erik Vee

Learning from a Lightweight Teacher for Efficient Knowledge Distillation

Knowledge Distillation (KD) is an effective framework for compressing deep learning models, realized by a student-teacher paradigm requiring small student networks to mimic the soft target generated by well-trained teachers. However, the…

Computer Vision and Pattern Recognition · Computer Science 2020-05-20 Yuang Liu , Wei Zhang , Jun Wang

ResKD: Residual-Guided Knowledge Distillation

Knowledge distillation, aimed at transferring the knowledge from a heavy teacher network to a lightweight student network, has emerged as a promising technique for compressing neural networks. However, due to the capacity gap between the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Xuewei Li , Songyuan Li , Bourahla Omar , Fei Wu , Xi Li

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

In this paper, we introduce a novel knowledge distillation approach for the semantic segmentation task. Unlike previous methods that rely on power-trained teachers or other modalities to provide additional knowledge, our approach does not…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Shoumeng Qiu , Jie Chen , Xinrun Li , Ru Wan , Xiangyang Xue , Jian Pu

MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation

Knowledge Distillation (KD) has been one of the most popu-lar methods to learn a compact model. However, it still suffers from highdemand in time and computational resources caused by sequential train-ing pipeline. Furthermore, the soft…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Benlin Liu , Yongming Rao , Jiwen Lu , Jie Zhou , Cho-jui Hsieh

Revisiting Knowledge Distillation via Label Smoothing Regularization

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Li Yuan , Francis E. H. Tay , Guilin Li , Tao Wang , Jiashi Feng

Knowledge Distillation in Document Retrieval

Complex deep learning models now achieve state of the art performance for many document retrieval tasks. The best models process the query or claim jointly with the document. However for fast scalable search it is desirable to have document…

Information Retrieval · Computer Science 2019-11-26 Siamak Shakeri , Abhinav Sethy , Cheng Cheng

Knowledge Distillation from Internal Representations

Knowledge distillation is typically conducted by training a small model (the student) to mimic a large and cumbersome model (the teacher). The idea is to compress the knowledge from the teacher by using its output probabilities as…

Computation and Language · Computer Science 2020-01-17 Gustavo Aguilar , Yuan Ling , Yu Zhang , Benjamin Yao , Xing Fan , Chenlei Guo

Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation

Knowledge distillation is a method of transferring the knowledge from a pretrained complex teacher model to a student model, so a smaller network can replace a large teacher network at the deployment stage. To reduce the necessity of…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Mingi Ji , Seungjae Shin , Seunghyun Hwang , Gibeom Park , Il-Chul Moon

Lightweight Neural Network with Knowledge Distillation for CSI Feedback

Deep learning has shown promise in enhancing channel state information (CSI) feedback. However, many studies indicate that better feedback performance often accompanies higher computational complexity. Pursuing better performance-complexity…

Signal Processing · Electrical Eng. & Systems 2024-03-05 Yiming Cui , Jiajia Guo , Zheng Cao , Huaze Tang , Chao-Kai Wen , Shi Jin , Xin Wang , Xiaolin Hou

Distillation from Heterogeneous Models for Top-K Recommendation

Recent recommender systems have shown remarkable performance by using an ensemble of heterogeneous models. However, it is exceedingly costly because it requires resources and inference latency proportional to the number of models, which…

Information Retrieval · Computer Science 2023-03-03 SeongKu Kang , Wonbin Kweon , Dongha Lee , Jianxun Lian , Xing Xie , Hwanjo Yu

Data-Efficient Ranking Distillation for Image Retrieval

Recent advances in deep learning has lead to rapid developments in the field of image retrieval. However, the best performing architectures incur significant computational cost. Recent approaches tackle this issue using knowledge…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zakaria Laskar , Juho Kannala

Parameter-Efficient and Student-Friendly Knowledge Distillation

Knowledge distillation (KD) has been extensively employed to transfer the knowledge from a large teacher model to the smaller students, where the parameters of the teacher are fixed (or partially) during training. Recent studies show that…

Machine Learning · Computer Science 2022-06-01 Jun Rao , Xv Meng , Liang Ding , Shuhan Qi , Dacheng Tao

Subclass Knowledge Distillation with Known Subclass Labels

This work introduces a novel knowledge distillation framework for classification tasks where information on existing subclasses is available and taken into consideration. In classification tasks with a small number of classes or binary…

Machine Learning · Computer Science 2022-07-19 Ahmad Sajedi , Yuri A. Lawryshyn , Konstantinos N. Plataniotis

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Knowledge distillation (KD) aims at improving the performance of a compact student model by distilling the knowledge from a high-performing teacher model. In this paper, we present an adaptive KD approach, namely AdaDistill, for deep face…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Fadi Boutros , Vitomir Štruc , Naser Damer

Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model

Knowledge distillation (KD) is a successful approach for deep neural network acceleration, with which a compact network (student) is trained by mimicking the softmax output of a pre-trained high-capacity network (teacher). In tradition, KD…

Machine Learning · Computer Science 2021-06-08 Zi Wang

On the Efficiency of Subclass Knowledge Distillation in Classification Tasks

This work introduces a novel knowledge distillation framework for classification tasks where information on existing subclasses is available and taken into consideration. In classification tasks with a small number of classes or binary…

Machine Learning · Computer Science 2022-07-06 Ahmad Sajedi , Konstantinos N. Plataniotis