Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2551–2575 of 4240 papers

Title	Date	Tasks	Status
Rethinking the Knowledge Distillation From the Perspective of Model Calibration	Oct 31, 2021	Knowledge Distillation	—Unverified
Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution	Jan 30, 2025	Hyperspectral Image Super-ResolutionImage Super-Resolution	—Unverified
Retrieve Anything To Augment Large Language Models	Oct 11, 2023	Knowledge DistillationRetrieval	—Unverified
Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition	Apr 28, 2024	Data AugmentationKnowledge Distillation	—Unverified
Reverse-engineering recurrent neural network solutions to a hierarchical inference task for mice	Dec 1, 2020	Knowledge DistillationModel Compression	—Unverified
Reverse Thinking Makes LLMs Stronger Reasoners	Nov 29, 2024	Data AugmentationKnowledge Distillation	—Unverified
Review helps learn better: Temporal Supervised Knowledge Distillation	Jul 3, 2023	image-classificationImage Classification	—Unverified
Revisiting Architecture-aware Knowledge Distillation: Smaller Models and Faster Search	Jun 27, 2022	Bayesian OptimizationKnowledge Distillation	—Unverified
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study	May 22, 2023	Data AugmentationKnowledge Distillation	—Unverified
Revisiting Graph based Social Recommendation: A Distillation Enhanced Social Graph Network	Apr 25, 2022	Knowledge DistillationRecommendation Systems	—Unverified
Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)	Feb 6, 2025	Knowledge Distillation	—Unverified
Revisiting Knowledge Distillation for Object Detection	May 22, 2021	Domain AdaptationKnowledge Distillation	—Unverified
Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving	Oct 13, 2023	3D Semantic SegmentationAutonomous Driving	—Unverified
Revisiting Self-Distillation	Jun 17, 2022	Knowledge DistillationModel Compression	—Unverified
Reward-Based 1-bit Compressed Federated Distillation on Blockchain	Jun 27, 2021	Federated LearningKnowledge Distillation	—Unverified
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd	Nov 19, 2024	Knowledge Distillation	—Unverified
Rich Feature Distillation with Feature Affinity Module for Efficient Image Dehazing	Jul 13, 2022	Contrastive Learningimage-classification	—Unverified
RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models	Jun 4, 2024	Knowledge DistillationLanguage Modeling	—Unverified
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Jan 29, 2025	Knowledge DistillationNatural Language Understanding	—Unverified
RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation	Jan 19, 2023	Knowledge DistillationNeural Architecture Search	—Unverified
Robust Active Distillation	Oct 3, 2022	Active LearningInformativeness	—Unverified
Robust Distillation for Worst-class Performance	Jun 13, 2022	Knowledge Distillation	—Unverified
Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples	Jan 1, 2024	Adversarial RobustnessKnowledge Distillation	—Unverified
RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness	Feb 18, 2023	Knowledge DistillationMulti-Task Learning	—Unverified
Robust feature knowledge distillation for enhanced performance of lightweight crack segmentation models	Apr 9, 2024	Crack SegmentationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 103 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified