Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2401–2425 of 4240 papers

Title	Date	Tasks	Status
Understanding and Improving Knowledge Distillation	Feb 10, 2020	Knowledge DistillationModel Compression	—Unverified
Understanding and Improving Lexical Choice in Non-Autoregressive Translation	Dec 29, 2020	Knowledge DistillationTranslation	—Unverified
Understanding Knowledge Distillation	Jan 1, 2021	Knowledge Distillation	—Unverified
Understanding Knowledge Distillation in Non-autoregressive Machine Translation	Nov 7, 2019	Knowledge DistillationMachine Translation	—Unverified
Understanding the Effect of Data Augmentation on Knowledge Distillation	May 21, 2023	Data AugmentationKnowledge Distillation	—Unverified
Understanding the Gains from Repeated Self-Distillation	Jul 5, 2024	Knowledge Distillationregression	—Unverified
Understanding the Overfitting of the Episodic Meta-training	Jun 29, 2023	Knowledge Distillation	—Unverified
Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective	Sep 29, 2021	Active LearningData Augmentation	—Unverified
UNDO: Understanding Distillation as Optimization	Apr 3, 2025	Knowledge Distillation	—Unverified
UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation	May 27, 2024	Image CompressionKnowledge Distillation	—Unverified
UNIDEAL: Curriculum Knowledge Distillation Federated Learning	Sep 16, 2023	Federated LearningKnowledge Distillation	—Unverified
Unified and Effective Ensemble Knowledge Distillation	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified
Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization	Jul 3, 2024	Anomaly DetectionCPU	—Unverified
Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Apr 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds	Mar 12, 2025	Deep Reinforcement LearningKnowledge Distillation	—Unverified
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors	Jan 1, 2023	Knowledge Distillation	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation	Sep 13, 2021	Abstractive Text SummarizationDecoder	—Unverified
Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search	Feb 13, 2022	Contrastive LearningKnowledge Distillation	—Unverified
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling	Oct 12, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Universal-KD: Attention-based Output-Grounded Intermediate Layer Knowledge Distillation	Nov 1, 2021	Knowledge Distillation	—Unverified
Unlabeled Data Deployment for Classification of Diabetic Retinopathy Images Using Knowledge Transfer	Feb 9, 2020	General ClassificationKnowledge Distillation	—Unverified
Unlearning Clients, Features and Samples in Vertical Federated Learning	Jan 23, 2025	Federated LearningInference Attack	—Unverified
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Sep 17, 2024	3D Object DetectionAutonomous Driving	—Unverified

Show:10 25 50

← PrevPage 97 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified