Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1926–1950 of 4240 papers

Title	Date	Tasks	Status
Improving Acoustic Scene Classification in Low-Resource Conditions	Dec 30, 2024	Acoustic Scene ClassificationClassification	—Unverified
Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Sep 3, 2024	Data AugmentationKnowledge Distillation	—Unverified
Improving Apple Object Detection with Occlusion-Enhanced Distillation	Sep 3, 2024	Knowledge DistillationObject	—Unverified
Improving Autoregressive NMT with Non-Autoregressive Model	Jul 1, 2020	Decoderde-en	—Unverified
Improving CLIP Robustness with Knowledge Distillation and Self-Training	Sep 19, 2023	Knowledge Distillation	—Unverified
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Aug 8, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment	Jul 3, 2024	ChatbotComputational Efficiency	—Unverified
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection	Jul 18, 2024	Knowledge DistillationObject	—Unverified
Improving Defensive Distillation using Teacher Assistant	May 14, 2023	Face RecognitionKnowledge Distillation	—Unverified
Improving De-Raining Generalization via Neural Reorganization	Jan 1, 2021	Knowledge Distillation	—Unverified
Efficient Object Detection in Optical Remote Sensing Imagery via Attention-based Feature Distillation	Oct 28, 2023	Knowledge DistillationObject	—Unverified
CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Jan 1, 2025	Knowledge DistillationSemantic Segmentation	—Unverified
A Survey on Model Compression for Large Language Models	Aug 15, 2023	BenchmarkingKnowledge Distillation	—Unverified
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Apr 9, 2024	Emotion RecognitionFacial Landmark Detection	—Unverified
Improving Feature Generalizability with Multitask Learning in Class Incremental Learning	Apr 26, 2022	class-incremental learningClass Incremental Learning	—Unverified
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition	Jun 9, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficient Machine Translation with Model Pruning and Quantization	Nov 1, 2021	CPUDecoder	—Unverified
Noise as a Resource for Learning in Knowledge Distillation	Oct 11, 2019	Knowledge Distillation	—Unverified
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging	Dec 12, 2022	Knowledge DistillationQuestion Answering	—Unverified
Improving Knowledge Distillation for BERT Models: Loss Functions, Mapping Methods, and Weight Tuning	Aug 26, 2023	Knowledge DistillationModel Compression	—Unverified
Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation	Nov 1, 2021	Dialogue GenerationKnowledge Distillation	—Unverified
Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks	Aug 20, 2022	Knowledge DistillationNeural Network Compression	—Unverified
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression	May 26, 2023	Knowledge Distillation	—Unverified
JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition	Mar 4, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 78 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified