Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2001–2025 of 4240 papers

Title	Date	Tasks	Status
Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Jan 28, 2025	Computational EfficiencyDecoder	—Unverified
Collective Wisdom: Improving Low-resource Neural Machine Translation using Adaptive Knowledge Distillation	Oct 12, 2020	Knowledge DistillationLow Resource Neural Machine Translation	—Unverified
INDUS: Effective and Efficient Language Models for Scientific Applications	May 17, 2024	Contrastive LearningInformation Retrieval	—Unverified
Industry Scale Semi-Supervised Learning for Natural Language Understanding	Mar 29, 2021	intent-classificationIntent Classification	—Unverified
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Sep 29, 2024	Knowledge DistillationModel Compression	—Unverified
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jun 25, 2024	Knowledge Distillation	—Unverified
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified
Information-Theoretic GAN Compression with Variational Energy-based Model	Mar 28, 2023	Image EnhancementKnowledge Distillation	—Unverified
Collective Knowledge Graph Completion with Mutual Knowledge Distillation	May 25, 2023	Knowledge DistillationKnowledge Graph Completion	—Unverified
Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Mar 21, 2025	intent-classificationIntent Classification	—Unverified
A Survey on Green Deep Learning	Nov 8, 2021	Deep LearningKnowledge Distillation	—Unverified
Efficient Inference via Universal LSH Kernel	Jun 21, 2021	Knowledge DistillationQuantization	—Unverified
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer	Mar 20, 2025	Knowledge DistillationModel Compression	—Unverified
Initial Classifier Weights Replay for Memoryless Class Incremental Learning	Aug 31, 2020	Allclass-incremental learning	—Unverified
Efficient Image Compression Using Advanced State Space Models	Sep 4, 2024	Computational EfficiencyImage Compression	—Unverified
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior Knowledge	Apr 1, 2020	3D Hand Pose EstimationHand Pose Estimation	—Unverified
Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN	Nov 21, 2017	General Classificationimage-classification	—Unverified
Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation	Dec 2, 2022	Knowledge DistillationSpeech Enhancement	—Unverified
Inplace knowledge distillation with teacher assistant for improved training of flexible deep neural networks	May 18, 2021	image-classificationImage Classification	—Unverified
In-situ animal behavior classification using knowledge distillation and fixed-point quantization	Sep 9, 2022	ClassificationKnowledge Distillation	—Unverified
Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation	Nov 15, 2022	Domain AdaptationKnowledge Distillation	—Unverified
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Apr 15, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Dec 11, 2024	AstronomyComputational Efficiency	—Unverified
Distill, Adapt, Distill: Training Small, In-Domain Models for Neural Machine Translation	Mar 5, 2020	Domain AdaptationKnowledge Distillation	—Unverified
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders	Sep 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 81 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified