English

On Compressing U-net Using Knowledge Distillation

Machine Learning 2018-12-04 v1 Machine Learning

Abstract

We study the use of knowledge distillation to compress the U-net architecture. We show that, while standard distillation is not sufficient to reliably train a compressed U-net, introducing other regularization methods, such as batch normalization and class re-weighting, in knowledge distillation significantly improves the training process. This allows us to compress a U-net by over 1000x, i.e., to 0.1% of its original number of parameters, at a negligible decrease in performance.

Keywords

Cite

@article{arxiv.1812.00249,
  title  = {On Compressing U-net Using Knowledge Distillation},
  author = {Karttikeya Mangalam and Mathieu Salzamann},
  journal= {arXiv preprint arXiv:1812.00249},
  year   = {2018}
}

Comments

4 pages, 1 figure

R2 v1 2026-06-23T06:28:00.676Z