Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4101–4125 of 4240 papers

Title	Date	Tasks	Status
Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection	May 10, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation	Aug 30, 2022	Knowledge DistillationSegmentation	CodeCode Available
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available
On the Transferability of Visual Features in Generalized Zero-Shot Learning	Nov 22, 2022	Generalized Zero-Shot LearningKnowledge Distillation	CodeCode Available
A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation	Mar 6, 2024	Knowledge Distillation	CodeCode Available
On the Use of External Data for Spoken Named Entity Recognition	Dec 14, 2021	Knowledge Distillationnamed-entity-recognition	CodeCode Available
OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Feb 11, 2025	Knowledge DistillationMMLU	CodeCode Available
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Oct 3, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available
Born Again Neural Networks	May 12, 2018	Image ClassificationKnowledge Distillation	CodeCode Available
Data-Free Knowledge Distillation for Image Super-Resolution	Jun 19, 2021	Data-free Knowledge DistillationImage Super-Resolution	CodeCode Available
Faithful Label-free Knowledge Distillation	Nov 22, 2024	Inductive BiasKnowledge Distillation	CodeCode Available
Data-free Knowledge Distillation for Fine-grained Visual Categorization	Apr 18, 2024	Data-free Knowledge DistillationFine-Grained Visual Categorization	CodeCode Available
Self-Attentive Spatio-Temporal Calibration for Precise Intermediate Layer Matching in ANN-to-SNN Distillation	Jan 14, 2025	Knowledge Distillation	CodeCode Available
Fairness without Demographics through Knowledge Distillation	Nov 1, 2022	FairnessKnowledge Distillation	CodeCode Available
Towards Real-time Video Compressive Sensing on Mobile Devices	Aug 14, 2024	Compressive SensingKnowledge Distillation	CodeCode Available
Boosting Summarization with Normalizing Flows and Aggressive Training	Nov 1, 2023	DecoderKnowledge Distillation	CodeCode Available
Self-Distillation for Gaussian Process Regression and Classification	Apr 5, 2023	ClassificationGPR	CodeCode Available
Data-free Knowledge Distillation for Segmentation using Data-Enriching GAN	Nov 2, 2020	Data-free Knowledge DistillationDiversity	CodeCode Available
Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking	Jun 4, 2024	Entity LinkingKnowledge Distillation	CodeCode Available
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning	Jun 1, 2023	Incremental LearningKnowledge Distillation	CodeCode Available
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models	Mar 20, 2024	ChatbotKnowledge Distillation	CodeCode Available
Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation	Sep 18, 2023	ChatbotKnowledge Distillation	CodeCode Available
Optimizing edge AI models on HPC systems with the edge in the loop	May 26, 2025	Hardware Aware Neural Architecture SearchKnowledge Distillation	CodeCode Available
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation	Feb 18, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers	Feb 23, 2023	Knowledge DistillationQuantization	CodeCode Available

Show:10 25 50

← PrevPage 165 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified