Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1751–1775 of 4240 papers

Title	Date	Tasks	Status
Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates	Jul 5, 2024	Knowledge DistillationTransfer Learning	—Unverified
AMD: Automatic Multi-step Distillation of Large-scale Vision Models	Jul 5, 2024	image-classificationImage Classification	—Unverified
Understanding the Gains from Repeated Self-Distillation	Jul 5, 2024	Knowledge Distillationregression	—Unverified
Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners	Jul 4, 2024	Domain GeneralizationFew-Shot Learning	—Unverified
DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment	Jul 4, 2024	Data AugmentationImage Quality Assessment	CodeCode Available
Relative Difficulty Distillation for Semantic Segmentation	Jul 4, 2024	Knowledge DistillationSemantic Segmentation	CodeCode Available
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models	Jul 3, 2024	Extractive Question-AnsweringKnowledge Distillation	—Unverified
Accelerated Proton Resonance Frequency-based Magnetic Resonance Thermometry by Optimized Deep Learning Method	Jul 3, 2024	Knowledge Distillation	CodeCode Available
Edge AI-Enabled Chicken Health Detection Based on Enhanced FCOS-Lite and Knowledge Distillation	Jul 3, 2024	Knowledge DistillationQuantization	—Unverified
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment	Jul 3, 2024	ChatbotComputational Efficiency	—Unverified
Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models	Jul 3, 2024	Contrastive LearningCPU	—Unverified
Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization	Jul 3, 2024	Anomaly DetectionCPU	—Unverified
Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection	Jul 2, 2024	EEGElectroencephalogram (EEG)	CodeCode Available
ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation	Jul 2, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation	Jul 2, 2024	Action RecognitionKnowledge Distillation	CodeCode Available
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application	Jul 2, 2024	Knowledge DistillationSurvey	—Unverified
Self-Cooperation Knowledge Distillation for Novel Class Discovery	Jul 2, 2024	Knowledge DistillationNovel Class Discovery	—Unverified
uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes	Jul 1, 2024	Knowledge Distillation	CodeCode Available
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization	Jun 30, 2024	Continual LearningGeneral Knowledge	—Unverified
FANFOLD: Graph Normalizing Flows-driven Asymmetric Network for Unsupervised Graph-Level Anomaly Detection	Jun 29, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization	Jun 29, 2024	Knowledge Distillation	—Unverified
Direct Preference Knowledge Distillation for Large Language Models	Jun 28, 2024	Knowledge Distillation	—Unverified
MuGSI: Distilling GNNs with Multi-Granularity Structural Information for Graph Classification	Jun 28, 2024	ClassificationGraph Classification	CodeCode Available
Aligning Teacher with Student Preferences for Tailored Training Data Generation	Jun 27, 2024	In-Context LearningKnowledge Distillation	—Unverified
Instance Temperature Knowledge Distillation	Jun 27, 2024	Decision MakingEfficient Exploration	CodeCode Available

Show:10 25 50

← PrevPage 71 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified