Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2176–2200 of 4240 papers

Title	Date	Tasks	Status
Embedded Knowledge Distillation in Depth-Level Dynamic Neural Network	Mar 1, 2021	Dynamic neural networksKnowledge Distillation	—Unverified
Embedding Compression for Teacher-to-Student Knowledge Transfer	Feb 9, 2024	Knowledge DistillationTransfer Learning	—Unverified
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval	Jan 27, 2023	Information RetrievalKnowledge Distillation	—Unverified
Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation	Jul 6, 2021	Domain Generalizationimage-classification	—Unverified
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Apr 23, 2025	Emotion ClassificationGPU	—Unverified
Knowledge distillation for optimization of quantized deep neural networks	Sep 4, 2019	Knowledge Distillation	—Unverified
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Apr 19, 2025	Knowledge DistillationState Space Models	—Unverified
Empowering Dual-Encoder with Query Generator for Cross-Lingual Dense Retrieval	Mar 27, 2023	Knowledge DistillationRetrieval	—Unverified
Empowering Knowledge Distillation via Open Set Recognition for Robust 3D Point Cloud Classification	Oct 25, 2020	3D Point Cloud ClassificationGeneral Classification	—Unverified
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation	Nov 16, 2021	Image CaptioningKnowledge Distillation	—Unverified
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation	Mar 12, 2022	Image CaptioningKnowledge Distillation	—Unverified
Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning	Mar 14, 2025	Federated LearningKnowledge Distillation	—Unverified
EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder	Apr 21, 2024	image-classificationImage Classification	—Unverified
Endpoints Weight Fusion for Class Incremental Semantic Segmentation	Jan 1, 2023	class-incremental learningClass Incremental Learning	—Unverified
End-to-End Automatic Speech Recognition with Deep Mutual Learning	Feb 16, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-end fully-binarized network design: from Generic Learned Thermometer to Block Pruning	May 5, 2025	Knowledge DistillationQuantization	—Unverified
End-to-End Simultaneous Speech Translation with Pretraining and Distillation: Huawei Noah’s System for AutoSimTranS 2022	Jul 1, 2022	DecoderKnowledge Distillation	—Unverified
End-to-End Speech Translation with Knowledge Distillation	Apr 17, 2019	Knowledge Distillationspeech-recognition	—Unverified
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020	Jun 4, 2020	Data AugmentationKnowledge Distillation	—Unverified
Energy-efficient Knowledge Distillation for Spiking Neural Networks	Jun 14, 2021	Knowledge DistillationModel Compression	—Unverified
Enhanced Multimodal Representation Learning with Cross-modal KD	Jun 13, 2023	Contrastive LearningEmotion Classification	—Unverified
Enhanced Sparsification via Stimulative Training	Mar 11, 2024	Knowledge DistillationModel Compression	—Unverified
Enhancing Abstractiveness of Summarization Models through Calibrated Distillation	Oct 20, 2023	Abstractive Text SummarizationInformativeness	—Unverified
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization	Jun 29, 2024	Knowledge Distillation	—Unverified
Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation	Apr 28, 2024	Action RecognitionGeneral Knowledge	—Unverified

Show:10 25 50

← PrevPage 88 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified