Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3651–3675 of 4240 papers

Title	Date	Tasks	Status
Improved training of binary networks for human pose estimation and image recognition	Apr 11, 2019	BinarizationClassification with Binary Neural Network	—Unverified
Improve Knowledge Distillation via Label Revision and Data Selection	Apr 3, 2024	Knowledge DistillationModel Compression	—Unverified
Improving Acoustic Scene Classification in Low-Resource Conditions	Dec 30, 2024	Acoustic Scene ClassificationClassification	—Unverified
Improving Apple Object Detection with Occlusion-Enhanced Distillation	Sep 3, 2024	Knowledge DistillationObject	—Unverified
Improving Autoregressive NMT with Non-Autoregressive Model	Jul 1, 2020	Decoderde-en	—Unverified
Improving CLIP Robustness with Knowledge Distillation and Self-Training	Sep 19, 2023	Knowledge Distillation	—Unverified
Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings	Sep 19, 2024	Computed Tomography (CT)Image Generation	—Unverified
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment	Jul 3, 2024	ChatbotComputational Efficiency	—Unverified
Improving Defensive Distillation using Teacher Assistant	May 14, 2023	Face RecognitionKnowledge Distillation	—Unverified
Improving De-Raining Generalization via Neural Reorganization	Jan 1, 2021	Knowledge Distillation	—Unverified
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Apr 9, 2024	Emotion RecognitionFacial Landmark Detection	—Unverified
Improving Feature Generalizability with Multitask Learning in Class Incremental Learning	Apr 26, 2022	class-incremental learningClass Incremental Learning	—Unverified
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition	Jun 9, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Noise as a Resource for Learning in Knowledge Distillation	Oct 11, 2019	Knowledge Distillation	—Unverified
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging	Dec 12, 2022	Knowledge DistillationQuestion Answering	—Unverified
Improving Knowledge Distillation for BERT Models: Loss Functions, Mapping Methods, and Weight Tuning	Aug 26, 2023	Knowledge DistillationModel Compression	—Unverified
Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates	Jul 5, 2024	Knowledge DistillationTransfer Learning	—Unverified
Improving Knowledge Distillation with Teacher's Explanation	Oct 4, 2023	Knowledge Distillation	—Unverified
Confidence-aware Self-Semantic Distillation on Knowledge Graph Embedding	Jun 7, 2022	Graph EmbeddingKnowledge Distillation	—Unverified
Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation	Nov 22, 2024	Knowledge DistillationMathematical Reasoning	—Unverified
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding	Apr 20, 2019	Ensemble LearningKnowledge Distillation	—Unverified
Improving Neural Machine Translation by Denoising Training	Jan 19, 2022	DenoisingKnowledge Distillation	—Unverified
Improving Neural ODEs via Knowledge Distillation	Mar 10, 2022	Knowledge Distillation	—Unverified
Improving Non-autoregressive Neural Machine Translation with Monolingual Data	May 2, 2020	Data AugmentationKnowledge Distillation	—Unverified
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Oct 19, 2024	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 147 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified