Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2151–2200 of 4240 papers

Title	Date	Tasks	Status
Toward Student-Oriented Teacher Network Training For Knowledge Distillation	Jun 14, 2022	Data AugmentationKnowledge Distillation	—Unverified
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation	Apr 13, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Source-Target Unified Knowledge Distillation for Memory-Efficient Federated Domain Adaptation on Edge Devices	Sep 29, 2021	Domain AdaptationKnowledge Distillation	—Unverified
Space-Time Distillation for Video Super-Resolution	Jun 19, 2021	Knowledge DistillationSuper-Resolution	—Unverified
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm	Oct 15, 2021	Knowledge Distillation	—Unverified
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm	Nov 16, 2021	Knowledge Distillation	—Unverified
Spatial Knowledge Distillation to aid Visual Reasoning	Dec 10, 2018	DiagnosticKnowledge Distillation	—Unverified
Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection	Apr 14, 2022	Knowledge DistillationMultiple Instance Learning	—Unverified
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading	Aug 7, 2021	Audio-Visual Speech RecognitionKnowledge Distillation	—Unverified
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation	Mar 31, 2020	Knowledge DistillationObject	—Unverified
Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency	Apr 10, 2019	GPUKnowledge Distillation	—Unverified
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation	Jul 2, 2022	Knowledge DistillationMulti-Task Learning	—Unverified
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations	Sep 9, 2023	Emotion RecognitionKnowledge Distillation	—Unverified
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23	Jun 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Spiking CenterNet: A Distillation-boosted Spiking Neural Network for Object Detection	Feb 2, 2024	DecoderKnowledge Distillation	—Unverified
Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer	Apr 29, 2021	General KnowledgeKnowledge Distillation	—Unverified
Spirit Distillation: Precise Real-time Semantic Segmentation of Road Scenes with Insufficient Data	Mar 25, 2021	Autonomous DrivingFew-Shot Learning	—Unverified
Split Knowledge Distillation for Large Models in IoT: Architecture, Challenges, and Solutions	Dec 17, 2024	Knowledge DistillationManagement	—Unverified
Squeezing nnU-Nets with Knowledge Distillation for On-Board Cloud Detection	Jun 16, 2023	Cloud DetectionKnowledge Distillation	—Unverified
SRIL: Selective Regularization for Class-Incremental Learning	May 9, 2023	class-incremental learningClass Incremental Learning	—Unverified
SSKD: Self-Supervised Knowledge Distillation for Cross Domain Adaptive Person Re-Identification	Sep 13, 2020	ClusteringDomain Adaptive Person Re-Identification	—Unverified
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection	Jul 16, 2022	Anomaly DetectionKnowledge Distillation	—Unverified
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning	May 18, 2025	Knowledge DistillationSpatial Reasoning	—Unverified
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders	May 12, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Static Word Embeddings for Sentence Semantic Representation	Jun 5, 2025	Contrastive LearningKnowledge Distillation	—Unverified
Stealing Neural Networks via Timing Side Channels	Dec 31, 2018	Knowledge DistillationReinforcement Learning	—Unverified
Step Out and Seek Around: On Warm-Start Training with Incremental Data	Jun 6, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction	May 20, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency	Jan 22, 2024	Depth EstimationKnowledge Distillation	—Unverified
STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft	Jun 17, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Stingy Teacher: Sparse Logits Suffice to Fail Knowledge Distillation	Sep 29, 2021	Knowledge Distillation	—Unverified
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks	Sep 30, 2020	image-classificationImage Classification	—Unverified
Strategic Fusion Optimizes Transformer Compression	Jan 5, 2025	Knowledge DistillationModel Compression	—Unverified
Streaming egocentric action anticipation: An evaluation scheme and approach	Jun 29, 2023	Action AnticipationKnowledge Distillation	—Unverified
Streaming Transformer ASR with Blockwise Synchronous Inference	Jun 25, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation	May 6, 2023	Knowledge DistillationQuantization	—Unverified
Structural Knowledge Distillation for Object Detection	Nov 23, 2022	Feature ImportanceKnowledge Distillation	—Unverified
Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization	Feb 27, 2024	Anomaly DetectionKnowledge Distillation	—Unverified
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems	May 2, 2023	Incremental LearningKnowledge Distillation	—Unverified
Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Oct 9, 2024	Depth EstimationKnowledge Distillation	—Unverified
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection	Nov 14, 2022	Knowledge Distillation	—Unverified
Structured Pruning of Neural Networks with Budget-Aware Regularization	Nov 23, 2018	Knowledge Distillation	—Unverified
StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition	Dec 2, 2022	Image RetrievalKnowledge Distillation	—Unverified
Student as an Inherent Denoiser of Noisy Teacher	Dec 15, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher	Jan 1, 2021	image-classificationImage Classification	—Unverified
Student-friendly Knowledge Distillation	May 18, 2023	Knowledge Distillation	—Unverified
Student Network Learning via Evolutionary Knowledge Distillation	Mar 23, 2021	Knowledge DistillationTransfer Learning	—Unverified
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Sep 27, 2024	Knowledge DistillationTransfer Learning	—Unverified
Students Parrot Their Teachers: Membership Inference on Model Distillation	Mar 6, 2023	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 44 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified