Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4176–4200 of 4240 papers

Title	Date	Tasks	Status
Enhancing New-item Fairness in Dynamic Recommender Systems	Apr 30, 2025	FairnessKnowledge Distillation	CodeCode Available
D^2TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization	May 22, 2023	Knowledge Distillation	CodeCode Available
cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation	Jun 7, 2022	Knowledge DistillationQuestion Answering	CodeCode Available
Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference	Nov 21, 2024	Graph LearningKnowledge Distillation	CodeCode Available
Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption	May 19, 2025	Knowledge DistillationTest-time Adaptation	CodeCode Available
Meta-Learned Modality-Weighted Knowledge Distillation for Robust Multi-Modal Learning with Missing Data	May 12, 2024	Brain Tumor SegmentationClassification	CodeCode Available
Customizing Synthetic Data for Data-Free Student Learning	Jul 10, 2023	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study	Jul 9, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
CXR Segmentation by AdaIN-based Domain Adaptation and Knowledge Distillation	Apr 13, 2021	Domain AdaptationKnowledge Distillation	CodeCode Available
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers	Jul 26, 2024	Domain AdaptationDomain Generalization	CodeCode Available
Unifying Heterogeneous Classifiers with Distillation	Apr 12, 2019	Knowledge Distillation	CodeCode Available
Blind Knowledge Distillation for Robust Image Classification	Nov 21, 2022	Classificationimage-classification	CodeCode Available
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment	Sep 19, 2024	Knowledge DistillationModel Compression	CodeCode Available
CSE: Surface Anomaly Detection with Contrastively Selected Embedding	Mar 4, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning	Feb 1, 2020	Knowledge DistillationMuJoCo	CodeCode Available
Cross-View Consistency Regularisation for Knowledge Distillation	Dec 21, 2024	Knowledge Distillation	CodeCode Available
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training	May 3, 2023	Knowledge DistillationText Generation	CodeCode Available
Cross-modal Knowledge Distillation for Vision-to-Sensor Action Recognition	Oct 8, 2021	Action RecognitionActivity Recognition	CodeCode Available
Cross Modality Knowledge Distillation for Multi-Modal Aerial View Object Classification	Jun 19, 2021	Image ClassificationKnowledge Distillation	CodeCode Available
Unifying Synergies between Self-supervised Learning and Dynamic Computation	Jan 22, 2023	image-classificationImage Classification	CodeCode Available
SELF-VS: Self-supervised Encoding Learning For Video Summarization	Mar 28, 2023	Knowledge DistillationRepresentation Learning	CodeCode Available
TQCompressor: improving tensor decomposition methods in neural networks via permutations	Jan 29, 2024	Knowledge DistillationModel Compression	CodeCode Available
Technical Report for the 5th CLVision Challenge at CVPR: Addressing the Class-Incremental with Repetition using Unlabeled Data -- 4th Place Solution	Mar 19, 2025	class-incremental learningClass Incremental Learning	CodeCode Available
Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting	Dec 18, 2024	GSM8KKnowledge Distillation	CodeCode Available
Enhancing Adversarial Robustness in Low-Label Regime via Adaptively Weighted Regularization and Knowledge Distillation	Aug 8, 2023	Adversarial RobustnessKnowledge Distillation	CodeCode Available

Show:10 25 50

← PrevPage 168 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified