Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2700 of 4240 papers

Title	Date	Tasks	Status
Self-Slimming Vision Transformer	Sep 29, 2021	Knowledge Distillation	—Unverified
Self-Supervised Generative Adversarial Compression	Dec 1, 2020	image-classificationImage Classification	—Unverified
Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Oct 4, 2024	Keypoint DetectionKnowledge Distillation	—Unverified
Self-supervised Models are Good Teaching Assistants for Vision Transformers	Sep 29, 2021	Image ClassificationKnowledge Distillation	—Unverified
Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Feb 20, 2025	Depth EstimationKnowledge Distillation	—Unverified
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning	Jan 3, 2024	ClusteringKnowledge Distillation	—Unverified
Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features	Aug 25, 2023	Contrastive LearningKnowledge Distillation	—Unverified
SAIL: Self-Augmented Graph Contrastive Learning	Sep 2, 2020	Contrastive LearningKnowledge Distillation	—Unverified
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention	Jan 25, 2024	Knowledge DistillationObject	—Unverified
Self-Training and Multi-Task Learning for Limited Data: Evaluation Study on Object Detection	Sep 12, 2023	Knowledge DistillationMulti-Task Learning	—Unverified
Self-Updatable Large Language Models with Parameter Integration	Oct 1, 2024	Continual LearningConversational Recommendation	—Unverified
Semantically-Aware Game Image Quality Assessment	May 16, 2025	Feature ImportanceImage Quality Assessment	—Unverified
Semantically-Conditioned Negative Samples for Efficient Contrastive Learning	Feb 12, 2021	Contrastive LearningKnowledge Distillation	—Unverified
Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning	Mar 6, 2021	class-incremental learningClass Incremental Learning	—Unverified
Semantic Objective Functions: A distribution-aware method for adding logical constraints in deep learning	May 3, 2024	Knowledge Distillation	—Unverified
Rapid Bone Scintigraphy Enhancement via Semantic Prior Distillation from Segment Anything Model	Mar 4, 2025	Image RestorationKnowledge Distillation	—Unverified
Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation	Apr 30, 2021	Image-to-Image TranslationKnowledge Distillation	—Unverified
Semi-supervised Acoustic Event Detection based on tri-training	Apr 29, 2019	Event DetectionKnowledge Distillation	—Unverified
Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models	Sep 27, 2024	Anomaly DetectionKnowledge Distillation	—Unverified
VFed-SSD: Towards Practical Vertical Federated Advertising	May 31, 2022	Federated LearningKnowledge Distillation	—Unverified
Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study	Jun 18, 2023	Data AugmentationDiagnostic	—Unverified
Semi-supervised object detection based on single-stage detector for thighbone fracture localization	Oct 20, 2022	Fracture detectionImage Augmentation	—Unverified
Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction	Nov 17, 2023	Generative Adversarial NetworkKnowledge Distillation	—Unverified
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR	Aug 2, 2021	Data AugmentationKnowledge Distillation	—Unverified
Semi-UFormer: Semi-supervised Uncertainty-aware Transformer for Image Dehazing	Oct 28, 2022	Image DehazingKnowledge Distillation	—Unverified
Sentence Embeddings by Ensemble Distillation	Apr 14, 2021	Knowledge DistillationSemantic Textual Similarity	—Unverified
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation	Apr 23, 2024	Knowledge DistillationMachine Translation	—Unverified
Sentiment Interpretable Logic Tensor Network for Aspect-Term Sentiment Analysis	Oct 1, 2022	Computational EfficiencyKnowledge Distillation	—Unverified
SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation	May 6, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach	Jul 25, 2024	Anomaly DetectionKnowledge Distillation	—Unverified
SeqPATE: Differentially Private Text Generation via Knowledge Distillation	Sep 29, 2021	Knowledge DistillationSentence	—Unverified
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition	Nov 12, 2018	Knowledge DistillationModel Compression	—Unverified
Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding	May 23, 2023	Continual LearningDecoder	—Unverified
Sequential Editing for Lifelong Training of Speech Recognition Models	Jun 25, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Jul 27, 2024	Computational EfficiencyImage Super-Resolution	—Unverified
SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation	Jul 11, 2025	Federated LearningKnowledge Distillation	—Unverified
Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs	Apr 25, 2023	3D geometry3D Reconstruction	—Unverified
Shared Growth of Graph Neural Networks via Prompted Free-direction Knowledge Distillation	Jul 2, 2023	Knowledge DistillationPrompt Learning	—Unverified
Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online Learning	Jun 27, 2023	Knowledge Distillation	—Unverified
Siamese Sleep Transformer For Robust Sleep Stage Scoring With Self-knowledge Distillation and Selective Batch Sampling	Dec 12, 2022	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation	Aug 27, 2021	Knowledge DistillationSegmentation	—Unverified
Similarity of Neural Architectures using Adversarial Attack Transferability	Oct 20, 2022	Adversarial AttackDiversity	—Unverified
Similarity-Preserving Knowledge Distillation	Jul 23, 2019	Knowledge DistillationNeural Network Compression	—Unverified
Similarity Transfer for Knowledge Distillation	Mar 18, 2021	Knowledge Distillation	—Unverified
Simple Regularisation for Uncertainty-Aware Knowledge Distillation	May 19, 2022	BIG-bench Machine LearningDiversity	—Unverified
Simple Unsupervised Knowledge Distillation With Space Similarity	Sep 20, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Simplification Is All You Need against Out-of-Distribution Overconfidence	Jan 1, 2025	AllAttribute	—Unverified
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers	Nov 22, 2024	Data AugmentationGPU	—Unverified
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey	Sep 24, 2020	Deep Reinforcement LearningDomain Adaptation	—Unverified
SimulSpeech: End-to-End Simultaneous Speech to Text Translation	Jul 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 54 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified