Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3176–3200 of 4240 papers

Title	Date	Tasks	Status	Hype
Neural Architecture Search via Ensemble-based Knowledge Distillation	Sep 29, 2021	DiversityKnowledge Distillation	—Unverified	0
Wakening Past Concepts without Past Data: Class-incremental Learning from Placebos	Sep 29, 2021	class-incremental learningClass Incremental Learning	—Unverified	0
Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective	Sep 29, 2021	Active LearningData Augmentation	—Unverified	0
Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning	Sep 29, 2021	Image Super-ResolutionKnowledge Distillation	—Unverified	0
Representation Consolidation from Multiple Expert Teachers	Sep 29, 2021	Knowledge Distillation	—Unverified	0
Self-Slimming Vision Transformer	Sep 29, 2021	Knowledge Distillation	—Unverified	0
Self-Distilled Pruning Of Neural Networks	Sep 29, 2021	Knowledge DistillationLanguage Modeling	—Unverified	0
SeqPATE: Differentially Private Text Generation via Knowledge Distillation	Sep 29, 2021	Knowledge DistillationSentence	—Unverified	0
Reducing the Teacher-Student Gap via Adaptive Temperatures	Sep 29, 2021	Knowledge Distillation	—Unverified	0
Source-Target Unified Knowledge Distillation for Memory-Efficient Federated Domain Adaptation on Edge Devices	Sep 29, 2021	Domain AdaptationKnowledge Distillation	—Unverified	0
Pseudo Knowledge Distillation: Towards Learning Optimal Instance-specific Label Smoothing Regularization	Sep 29, 2021	image-classificationImage Classification	—Unverified	0
Feature Kernel Distillation	Sep 29, 2021	image-classificationImage Classification	—Unverified	0
Scaling Fair Learning to Hundreds of Intersectional Groups	Sep 29, 2021	AttributeFairness	—Unverified	0
Exploiting Knowledge Distillation for Few-Shot Image Generation	Sep 29, 2021	DiversityImage Generation	—Unverified	0
To Smooth or not to Smooth? On Compatibility between Label Smoothing and Knowledge Distillation	Sep 29, 2021	image-classificationImage Classification	—Unverified	0
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition	Sep 29, 2021	image-classificationImage Classification	—Unverified	0
Deep Structured Instance Graph for Distilling Object Detectors	Sep 27, 2021	Instance SegmentationKnowledge Distillation	CodeCode Available	1
Improving Question Answering Performance Using Knowledge Distillation and Active Learning	Sep 26, 2021	Active LearningKnowledge Distillation	CodeCode Available	0
Partial to Whole Knowledge Distillation: Progressive Distilling Decomposed Knowledge Boosts Student Better	Sep 26, 2021	Knowledge Distillation	—Unverified	0
Dynamic Knowledge Distillation for Pre-trained Language Models	Sep 23, 2021	Knowledge Distillation	CodeCode Available	1
Recent Advances of Continual Learning in Computer Vision: An Overview	Sep 23, 2021	Continual LearningKnowledge Distillation	—Unverified	0
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network	Sep 22, 2021	Knowledge DistillationLanguage Modeling	—Unverified	0
The NiuTrans Machine Translation Systems for WMT21	Sep 22, 2021	Knowledge DistillationMachine Translation	—Unverified	0
K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering	Sep 22, 2021	CPUKnowledge Distillation	—Unverified	0
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation	Sep 22, 2021	cross-modal alignmentKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 128 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified