English

QUEST: Quantized embedding space for transferring knowledge

Computer Vision and Pattern Recognition 2020-07-21 v2 Machine Learning

Abstract

Knowledge distillation refers to the process of training a compact student network to achieve better accuracy by learning from a high capacity teacher network. Most of the existing knowledge distillation methods direct the student to follow the teacher by matching the teacher's output, feature maps or their distribution. In this work, we propose a novel way to achieve this goal: by distilling the knowledge through a quantized space. According to our method, the teacher's feature maps are quantized to represent the main visual concepts encompassed in the feature maps. The student is then asked to predict the quantized representation, which thus forms the task that the student uses to learn from the teacher. Despite its simplicity, we show that our approach is able to yield results that improve the state of the art on knowledge distillation. To that end, we provide an extensive evaluation across several network architectures and most commonly used benchmark datasets.

Keywords

Cite

@article{arxiv.1912.01540,
  title  = {QUEST: Quantized embedding space for transferring knowledge},
  author = {Himalaya Jain and Spyros Gidaris and Nikos Komodakis and Patrick Pérez and Matthieu Cord},
  journal= {arXiv preprint arXiv:1912.01540},
  year   = {2020}
}

Comments

Accepted at ECCV 2020

R2 v1 2026-06-23T12:34:40.077Z