Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3851–3875 of 4240 papers

Title	Date	Tasks	Status
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom	Apr 28, 2025	Domain AdaptationKnowledge Distillation	—Unverified
Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments	Dec 26, 2023	Knowledge DistillationMathematical Reasoning	—Unverified
Knowledge Distillation of Transformer-based Language Models Revisited	Jun 29, 2022	GPUKnowledge Distillation	—Unverified
Knowledge Distillation on Graphs: A Survey	Feb 1, 2023	Knowledge DistillationModel Compression	—Unverified
Knowledge Distillation on Spatial-Temporal Graph Convolutional Network for Traffic Prediction	Jan 22, 2024	Graph Neural NetworkKnowledge Distillation	—Unverified
Knowledge Distillation to Ensemble Global and Interpretable Prototype-Based Mammogram Classification Models	Sep 26, 2022	DiversityKnowledge Distillation	—Unverified
Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks	Oct 10, 2022	domain classificationintent-classification	—Unverified
Knowledge Distillation Under Ideal Joint Classifier Assumption	Apr 19, 2023	Domain AdaptationKnowledge Distillation	—Unverified
Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Oct 24, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
Knowledge distillation using unlabeled mismatched images	Mar 21, 2017	General Classificationimage-classification	—Unverified
Knowledge distillation via adaptive instance normalization	Mar 9, 2020	Knowledge DistillationModel Compression	—Unverified
Knowledge Distillation via Instance-level Sequence Learning	Jun 21, 2021	General KnowledgeKnowledge Distillation	—Unverified
Knowledge Distillation via Query Selection for Detection Transformer	Sep 10, 2024	Knowledge Distillationobject-detection	—Unverified
Knowledge distillation via softmax regression representation learning	Jan 1, 2021	Knowledge DistillationModel Compression	—Unverified
Knowledge Distillation via Token-level Relationship Graph	Jun 20, 2023	Knowledge DistillationTransfer Learning	—Unverified
Knowledge Distillation via Weighted Ensemble of Teaching Assistants	Jun 23, 2022	Ensemble LearningKnowledge Distillation	—Unverified
Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget	Apr 30, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Knowledge distillation with a class-aware loss for endoscopic disease detection	Jul 19, 2022	DiagnosticKnowledge Distillation	—Unverified
Knowledge Distillation with Adapted Weight	Jan 6, 2025	4kFairness	—Unverified
Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays	Dec 30, 2020	Fracture detectionKnowledge Distillation	—Unverified
Knowledge Distillation with BERT for Image Tag-Based Privacy Prediction	Sep 1, 2021	Knowledge DistillationTAG	—Unverified
Knowledge distillation with error-correcting transfer learning for wind power prediction	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified
Knowledge Distillation with Feature Maps for Image Classification	Dec 3, 2018	ClassificationGeneral Classification	—Unverified
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution	Apr 3, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified
Knowledge Distillation with Noisy Labels for Natural Language Understanding	Sep 21, 2021	Knowledge DistillationNatural Language Understanding	—Unverified

Show:10 25 50

← PrevPage 155 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified