Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2501–2525 of 4240 papers

Title	Date	Tasks	Status
Real-time Spatio-temporal Action Localization via Learning Motion Representation	Nov 30, 2020	Action ClassificationAction Localization	—Unverified
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Oct 7, 2024	Decision MakingInformation Retrieval	—Unverified
Rebalancing Multi-Label Class-Incremental Learning	Aug 22, 2024	class-incremental learningClass Incremental Learning	—Unverified
Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy	Jun 24, 2025	Knowledge DistillationLearning with noisy labels	—Unverified
Recent Advances in Direct Speech-to-text Translation	Jun 20, 2023	Data AugmentationDecoder	—Unverified
Recent Advances of Continual Learning in Computer Vision: An Overview	Sep 23, 2021	Continual LearningKnowledge Distillation	—Unverified
Membership Privacy for Machine Learning Models Through Knowledge Transfer	Jun 15, 2019	BIG-bench Machine LearningGeneral Classification	—Unverified
Reconstructing Perceived Images from Brain Activity by Visually-guided Cognitive Representation and Adversarial Learning	Jun 27, 2019	Generative Adversarial NetworkImage Reconstruction	—Unverified
Rectified Decision Trees: Exploring the Landscape of Interpretable and Effective Machine Learning	Aug 21, 2020	BIG-bench Machine LearningKnowledge Distillation	—Unverified
Rectified Decision Trees: Towards Interpretability, Compression and Empirical Soundness	Mar 14, 2019	Knowledge Distillation	—Unverified
Rectifying the Data Bias in Knowledge Distillation	Oct 11, 2021	Face RecognitionFace Verification	—Unverified
Recurrent knowledge distillation	May 18, 2018	Knowledge Distillation	—Unverified
Recurrent Stacking of Layers in Neural Networks: An Application to Neural Machine Translation	Jun 18, 2021	Knowledge DistillationMachine Translation	—Unverified
Redistributing Low-Frequency Words: Making the Most of Monolingual Data in Non-Autoregressive Translation	Nov 16, 2021	Knowledge DistillationTranslation	—Unverified
Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation	Jun 27, 2023	Knowledge Distillationspeech-recognition	—Unverified
Reducing the Teacher-Student Gap via Adaptive Temperatures	Sep 29, 2021	Knowledge Distillation	—Unverified
RefBERT: Compressing BERT by Referencing to Pre-computed Representations	Jun 11, 2021	Knowledge Distillation	—Unverified
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation	Oct 25, 2022	Knowledge DistillationSentence	—Unverified
Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation	Mar 11, 2019	Depth EstimationDepth Prediction	—Unverified
Region-aware Knowledge Distillation for Efficient Image-to-Image Translation	May 25, 2022	Contrastive Learningimage-classification	—Unverified
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates	May 7, 2021	Knowledge Distillationmodel	—Unverified
Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition	Jun 1, 2021	Cross-Lingual NERKnowledge Distillation	—Unverified
Reinforced Multi-Teacher Selection for Knowledge Distillation	Dec 11, 2020	GPUKnowledge Distillation	—Unverified
Relational Subsets Knowledge Distillation for Long-tailed Retinal Diseases Recognition	Apr 22, 2021	Knowledge Distillation	—Unverified
Relation Modeling and Distillation for Learning with Noisy Labels	May 30, 2024	Contrastive LearningKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 101 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified