Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2526–2550 of 4240 papers

Title	Date	Tasks	Status
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Oct 28, 2024	Knowledge Distillation	—Unverified
Remembering Transformer for Continual Learning	Apr 11, 2024	Continual LearningKnowledge Distillation	—Unverified
Remining Hard Negatives for Generative Pseudo Labeled Domain Adaptation	Jan 24, 2025	Domain AdaptationInformation Retrieval	—Unverified
Remote Sensing Image Classification with Decoupled Knowledge Distillation	May 25, 2025	Classificationimage-classification	—Unverified
Removing Rain Streaks via Task Transfer Learning	Aug 28, 2022	Knowledge DistillationRain Removal	—Unverified
Representation Consolidation from Multiple Expert Teachers	Sep 29, 2021	Knowledge Distillation	—Unverified
Representation Disparity-aware Distillation for 3D Object Detection	Aug 20, 2023	3D Object DetectionKnowledge Distillation	—Unverified
Representation Transfer by Optimal Transport	Jul 13, 2020	Knowledge DistillationModel Compression	—Unverified
Research on Multilingual News Clustering Based on Cross-Language Word Embeddings	May 30, 2023	ClusteringKnowledge Distillation	—Unverified
Research on the Online Update Method for Retrieval-Augmented Generation (RAG) Model with Incremental Learning	Jan 13, 2025	Incremental LearningKnowledge Distillation	—Unverified
Residual Knowledge Distillation	Feb 21, 2020	Knowledge DistillationModel Compression	—Unverified
ResKD: Residual-Guided Knowledge Distillation	Jun 8, 2020	Knowledge Distillation	—Unverified
Resolution-Based Distillation for Efficient Histology Image Classification	Jan 11, 2021	ClassificationComputational Efficiency	—Unverified
Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Apr 7, 2025	Autonomous DrivingBeam Prediction	—Unverified
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments	Aug 25, 2023	Federated Learningimage-classification	—Unverified
Respecting Transfer Gap in Knowledge Distillation	Oct 23, 2022	Knowledge Distillation	—Unverified
Response-based Distillation for Incremental Object Detection	Oct 26, 2021	Incremental LearningKnowledge Distillation	—Unverified
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers	Nov 17, 2023	Knowledge Distillation	—Unverified
Rethinking Attention Mechanism in Time Series Classification	Jul 14, 2022	ClassificationKnowledge Distillation	—Unverified
Rethinking Feature-Based Knowledge Distillation for Face Recognition	Jan 1, 2023	Face RecognitionGPU	—Unverified
Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off	Feb 22, 2024	Adversarial DefenseKnowledge Distillation	—Unverified
Rethinking Knowledge Distillation via Cross-Entropy	Aug 22, 2022	Knowledge Distillation	—Unverified
Rethinking Knowledge in Distillation: An In-context Sample Retrieval Perspective	Jan 13, 2025	Knowledge DistillationRetrieval	—Unverified
Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction	Apr 1, 2022	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Rethinking Soft Labels for Knowledge Distillation: A Bias–Variance Tradeoff Perspective	Jan 1, 2021	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 102 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified