Teacher Network Calibration Improves Cross-Quality Knowledge Distillation

2023-04-15Code Available0· sign in to hype

Pia Čuk, Robin Senge, Mikko Lauri, Simone Frintrop

Code Available — Be the first to reproduce this paper.

Code

github.com/piacuk/distillistic
OfficialIn paperpytorch★ 5

Abstract

We investigate cross-quality knowledge distillation (CQKD), a knowledge distillation method where knowledge from a teacher network trained with full-resolution images is transferred to a student network that takes as input low-resolution images. As image size is a deciding factor for the computational load of computer vision applications, CQKD notably reduces the requirements by only using the student network at inference time. Our experimental results show that CQKD outperforms supervised learning in large-scale image classification problems. We also highlight the importance of calibrating neural networks: we show that with higher temperature smoothing of the teacher's output distribution, the student distribution exhibits a higher entropy, which leads to both, a lower calibration error and a higher network accuracy.

Tasks

image-classification Image Classification Knowledge Distillation

Teacher Network Calibration Improves Cross-Quality Knowledge Distillation

Code

Abstract

Tasks

Reproductions