Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2576–2600 of 4240 papers

Title	Date	Tasks	Status
Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss	Mar 10, 2023	Knowledge Distillation	—Unverified
Robustly Optimized and Distilled Training for Natural Language Understanding	Mar 16, 2021	Knowledge DistillationMachine Reading Comprehension	—Unverified
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning	Nov 23, 2023	Data AugmentationKnowledge Distillation	—Unverified
Robustness to distribution shifts of compressed networks for edge devices	Jan 22, 2024	Knowledge DistillationQuantization	—Unverified
Robust Overfitting may be mitigated by properly learned smoothening	Jan 1, 2021	Knowledge Distillation	—Unverified
Robust & Precise Knowledge Distillation-based Novel Context-Aware Predictor for Disease Detection in Brain and Gastrointestinal	May 9, 2025	Disease PredictionKnowledge Distillation	—Unverified
Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data	Feb 2, 2025	Data AugmentationKnowledge Distillation	—Unverified
RoSearch: Search for Robust Student Architectures When Distilling Pre-trained Language Models	Jun 7, 2021	Adversarial RobustnessKnowledge Distillation	—Unverified
RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging	Oct 15, 2022	ClassificationKnowledge Distillation	—Unverified
RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content	Nov 20, 2024	4kKnowledge Distillation	—Unverified
RW-KD: Sample-wise Loss Terms Re-Weighting for Knowledge Distillation	Nov 1, 2021	Knowledge Distillation	—Unverified
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning	Oct 9, 2024	Knowledge Distillation	—Unverified
S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification	Aug 11, 2020	General ClassificationKnowledge Distillation	—Unverified
S2P3: Self-Supervised Polarimetric Pose Prediction	Dec 2, 2023	Knowledge DistillationPose Prediction	—Unverified
Safe Distillation Box	Dec 5, 2021	Knowledge Distillation	—Unverified
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior	Oct 22, 2024	Knowledge Distillation	—Unverified
SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection	Aug 20, 2024	Knowledge Distillationobject-detection	—Unverified
SAM-Guided Masked Token Prediction for 3D Scene Understanding	Oct 16, 2024	3D Object DetectionKnowledge Distillation	—Unverified
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Apr 29, 2025	General KnowledgeImage Segmentation	—Unverified
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation	Mar 16, 2022	Data AugmentationKnowledge Distillation	—Unverified
Sampling to Distill: Knowledge Transfer from Open-World Data	Jul 31, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task	Aug 1, 2021	Domain AdaptationKnowledge Distillation	—Unverified
SC2 Benchmark: Supervised Compression for Split Computing	Mar 16, 2022	Data CompressionEdge-computing	—Unverified
Scalable Collaborative Learning via Representation Sharing	Nov 20, 2022	Federated LearningKnowledge Distillation	—Unverified
Scalable Detection of Salient Entities in News Articles	May 30, 2024	ArticlesKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 104 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified