Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2675 of 4240 papers

Title	Date	Tasks	Status
A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning	Jul 1, 2017	Knowledge DistillationTransfer Learning	—Unverified
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation	Jul 24, 2023	Knowledge DistillationSemantic Segmentation	—Unverified
AI can evolve without labels: self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation	Feb 13, 2022	Deep LearningDiagnostic	—Unverified
AIDE: Agentically Improve Visual Language Model with Domain Experts	Feb 13, 2025	Knowledge DistillationLanguage Modeling	—Unverified
AI-KD: Adversarial learning and Implicit regularization for self-Knowledge Distillation	Nov 20, 2022	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
AirNet: Neural Network Transmission over the Air	May 24, 2021	Knowledge Distillation	—Unverified
A Joint Sequential and Relational Model for Frame-Semantic Parsing	Sep 1, 2017	Knowledge DistillationMachine Translation	—Unverified
AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks	May 5, 2025	Code CompletionCode Generation	—Unverified
A Knowledge Distillation Approach for Sepsis Outcome Prediction from Multivariate Clinical Time Series	Nov 16, 2023	Knowledge DistillationTime Series	—Unverified
A Knowledge Distillation-Based Backdoor Attack in Federated Learning	Aug 12, 2022	Backdoor AttackFederated Learning	—Unverified
A Knowledge Distillation framework for Multi-Organ Segmentation of Medaka Fish in Tomographic Image	Feb 24, 2023	Computed Tomography (CT)Image Segmentation	—Unverified
A Light-weight Deep Learning Model for Remote Sensing Image Classification	Feb 25, 2023	image-classificationImage Classification	—Unverified
A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition	May 12, 2023	EEGElectroencephalogram (EEG)	—Unverified
A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction	Feb 28, 2024	Image EnhancementKnowledge Distillation	—Unverified
A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation	Feb 15, 2023	Data AugmentationDefect Detection	—Unverified
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate	Sep 14, 2021	DecoderKnowledge Distillation	—Unverified
AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Oct 24, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Aligned Weight Regularizers for Pruning Pretrained Neural Networks	Nov 16, 2021	Knowledge DistillationLanguage Modeling	—Unverified
Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures	May 28, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Aligning Teacher with Student Preferences for Tailored Training Data Generation	Jun 27, 2024	In-Context LearningKnowledge Distillation	—Unverified
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition	Feb 28, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Alleviating Catastrophic Forgetting of Incremental Object Detection via Within-Class and Between-Class Knowledge Distillation	Jan 1, 2023	Knowledge Distillationobject-detection	—Unverified
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Mar 27, 2025	HallucinationKnowledge Distillation	—Unverified
All You Need in Knowledge Distillation Is a Tailored Coordinate System	Dec 12, 2024	AllFew-Shot Learning	—Unverified
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation	Dec 27, 2020	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 107 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified