Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–1975 of 4240 papers

Title	Date	Tasks	Status	Hype
On enhancing the robustness of Vision Transformers: Defensive Diffusion	May 14, 2023	Computational EfficiencyDenoising	CodeCode Available	0
Analyzing Compression Techniques for Computer Vision	May 14, 2023	Knowledge DistillationQuantization	—Unverified	0
Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation	May 14, 2023	Knowledge DistillationMachine Translation	CodeCode Available	0
AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference	May 13, 2023	Knowledge Distillation	—Unverified	0
Black-box Source-free Domain Adaptation via Two-stage Knowledge Distillation	May 13, 2023	Domain AdaptationKnowledge Distillation	—Unverified	0
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples	May 13, 2023	BinarizationKnowledge Distillation	CodeCode Available	0
Knowledge distillation with Segment Anything (SAM) model for Planetary Geological Mapping	May 12, 2023	DecoderImage Segmentation	—Unverified	0
A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition	May 12, 2023	EEGElectroencephalogram (EEG)	—Unverified	0
Improving Continual Relation Extraction by Distinguishing Analogous Semantics	May 11, 2023	Continual Relation ExtractionKnowledge Distillation	CodeCode Available	1
Long-Tailed Question Answering in an Open World	May 11, 2023	Knowledge DistillationLanguage Modelling	—Unverified	0
Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction	May 11, 2023	Contrastive LearningKnowledge Distillation	CodeCode Available	1
A Survey on the Robustness of Computer Vision Models against Common Corruptions	May 10, 2023	Data AugmentationKnowledge Distillation	CodeCode Available	0
Explainable Knowledge Distillation for On-device Chest X-Ray Classification	May 10, 2023	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	—Unverified	0
SRIL: Selective Regularization for Class-Incremental Learning	May 9, 2023	class-incremental learningClass Incremental Learning	—Unverified	0
FedNoRo: Towards Noise-Robust Federated Learning by Addressing Class Imbalance and Label Noise Heterogeneity	May 9, 2023	Federated LearningKnowledge Distillation	CodeCode Available	1
Multi-Teacher Knowledge Distillation For Text Image Machine Translation	May 9, 2023	DecoderKnowledge Distillation	CodeCode Available	0
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models	May 9, 2023	Image GenerationKnowledge Distillation	CodeCode Available	1
DynamicKD: An Effective Knowledge Distillation via Dynamic Entropy Correction-Based Distillation for Gap Optimizing	May 9, 2023	Knowledge Distillation	—Unverified	0
Distilling Script Knowledge from Large Language Models for Constrained Language Planning	May 9, 2023	Knowledge Distillation	CodeCode Available	1
Web Content Filtering through knowledge distillation of Large Language Models	May 8, 2023	Knowledge Distillation	—Unverified	0
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation	May 8, 2023	Knowledge Distillation	—Unverified	0
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge	May 8, 2023	Knowledge Distillationvalid	—Unverified	0
Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation	May 6, 2023	Knowledge DistillationQuantization	—Unverified	0
Distilled Mid-Fusion Transformer Networks for Multi-Modal Human Activity Recognition	May 5, 2023	Activity RecognitionFeature Engineering	—Unverified	0
Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty	May 4, 2023	Knowledge Distillationobject-detection	CodeCode Available	1

Show:10 25 50

← PrevPage 79 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified