Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1726–1750 of 4240 papers

Title	Date	Tasks	Status
An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models	Apr 19, 2023	Knowledge DistillationMachine Translation	—Unverified
A Deep Hierarchical Feature Sparse Framework for Occluded Person Re-Identification	Jan 15, 2024	Data AugmentationKnowledge Distillation	—Unverified
Supervised domain adaptation for building extraction from off-nadir aerial images	Nov 7, 2023	Domain AdaptationEarth Observation	—Unverified
Disentanglement, Visualization and Analysis of Complex Features in DNNs	Jan 1, 2021	DisentanglementKnowledge Distillation	—Unverified
An Empirical Study of Efficient ASR Rescoring with Transformers	Oct 24, 2019	Knowledge DistillationLanguage Modeling	—Unverified
Bridging Fairness and Environmental Sustainability in Natural Language Processing	Nov 8, 2022	Dimensionality ReductionFairness	—Unverified
An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation	Jan 12, 2024	Knowledge Distillation	—Unverified
Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning	Feb 1, 2024	Ensemble LearningKnowledge Distillation	—Unverified
DiReDi: Distillation and Reverse Distillation for AIoT Applications	Sep 12, 2024	Knowledge DistillationManagement	—Unverified
Direct Preference Knowledge Distillation for Large Language Models	Jun 28, 2024	Knowledge Distillation	—Unverified
Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation	Nov 23, 2023	Dimensionality ReductionImage Classification	—Unverified
An Empirical Analysis of the Impact of Data Augmentation on Knowledge Distillation	Jun 6, 2020	Data AugmentationKnowledge Distillation	—Unverified
Direct Distillation between Different Domains	Jan 12, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs	Feb 29, 2024	Dataset GenerationKnowledge Distillation	—Unverified
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling	Oct 7, 2020	Knowledge DistillationQuestion Answering	—Unverified
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking	May 20, 2025	Document RankingInformation Retrieval	—Unverified
An Efficient Private GPT Never Autoregressively Decodes	May 21, 2025	Knowledge Distillation	—Unverified
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models	Oct 13, 2023	Knowledge Distillation	—Unverified
Ground Reaction Force Estimation via Time-aware Knowledge Distillation	Jun 12, 2025	Knowledge Distillation	—Unverified
DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems	Mar 3, 2025	Edge-computingKnowledge Distillation	—Unverified
Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation	Nov 5, 2022	Knowledge DistillationSpeech Enhancement	—Unverified
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation	Sep 22, 2024	Image GenerationKnowledge Distillation	—Unverified
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning	Mar 10, 2023	Federated LearningKnowledge Distillation	—Unverified
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs	Apr 24, 2025	Image-text RetrievalInstruction Following	—Unverified
Digging Deeper into CRNN Model in Chinese Text Images Recognition	Nov 17, 2020	DenoisingKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 70 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified