Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2201–2225 of 4240 papers

Title	Date	Tasks	Status
LayerCollapse: Adaptive compression of neural networks	Nov 29, 2023	Computational Efficiencyimage-classification	—Unverified
The Devil is in the Data: Learning Fair Graph Neural Networks via Partial Knowledge Distillation	Nov 29, 2023	FairnessKnowledge Distillation	CodeCode Available
Propagate & Distill: Towards Effective Graph Learners Using Propagation-Embracing MLPs	Nov 29, 2023	Graph Neural NetworkKnowledge Distillation	—Unverified
Rethinking Intermediate Layers design in Knowledge Distillation for Kidney and Liver Tumor Segmentation	Nov 28, 2023	DiagnosticKnowledge Distillation	CodeCode Available
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser	Nov 28, 2023	3D Face AnimationContrastive Learning	—Unverified
FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning	Nov 28, 2023	Knowledge DistillationTransfer Learning	—Unverified
UFIN: Universal Feature Interaction Network for Multi-Domain Click-Through Rate Prediction	Nov 27, 2023	Click-Through Rate PredictionKnowledge Distillation	CodeCode Available
Wired Perspectives: Multi-View Wire Art Embraces Generative AI	Nov 26, 2023	Knowledge Distillation	—Unverified
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification	Nov 26, 2023	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
Cosine Similarity Knowledge Distillation for Individual Class Information Transfer	Nov 24, 2023	Knowledge DistillationModel Compression	—Unverified
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Pseudo-label Correction for Instance-dependent Noise Using Teacher-student Framework	Nov 24, 2023	Knowledge DistillationPseudo Label	—Unverified
Maximizing Discrimination Capability of Knowledge Distillation with Energy Function	Nov 24, 2023	Data AugmentationKnowledge Distillation	—Unverified
Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation	Nov 23, 2023	Dimensionality ReductionImage Classification	—Unverified
Efficient and Robust Jet Tagging at the LHC with Knowledge Distillation	Nov 23, 2023	Inductive BiasJet Tagging	CodeCode Available
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning	Nov 23, 2023	Data AugmentationKnowledge Distillation	—Unverified
Knowledge Distillation Based Semantic Communications For Multiple Users	Nov 23, 2023	DecoderKnowledge Distillation	—Unverified
Education distillation:getting student models to learn in shcools	Nov 23, 2023	Incremental LearningKnowledge Distillation	—Unverified
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified
EA-KD: Entropy-based Adaptive Knowledge Distillation	Nov 22, 2023	image-classificationImage Classification	—Unverified
Unveiling the Unseen Potential of Graph Learning through MLPs: Effective Graph Learners Using Propagation-Embracing MLPs	Nov 20, 2023	Graph LearningGraph Neural Network	—Unverified
LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation	Nov 18, 2023	Knowledge DistillationLesion Detection	—Unverified
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers	Nov 17, 2023	Knowledge Distillation	—Unverified
Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction	Nov 17, 2023	Generative Adversarial NetworkKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 89 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified