Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–725 of 4240 papers

Title	Date	Tasks	Status	Hype
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models	Oct 14, 2021	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
FocusNet: Classifying Better by Focusing on Confusing Classes	Oct 14, 2021	Classificationimage-classification	CodeCode Available	1
Object DGCNN: 3D Object Detection using Dynamic Graphs	Oct 13, 2021	2D Object Detection3D Object Detection	CodeCode Available	1
Towards Accurate Cross-Domain In-Bed Human Pose Estimation	Oct 7, 2021	Data AugmentationKnowledge Distillation	CodeCode Available	1
KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks	Oct 6, 2021	Emotion RecognitionEmotion Recognition in Conversation	CodeCode Available	1
Prune Your Model Before Distill It	Sep 30, 2021	Knowledge Distillationmodel	CodeCode Available	1
Multilingual AMR Parsing with Noisy Knowledge Distillation	Sep 30, 2021	AMR ParsingKnowledge Distillation	CodeCode Available	1
Deep Structured Instance Graph for Distilling Object Detectors	Sep 27, 2021	Instance SegmentationKnowledge Distillation	CodeCode Available	1
Dynamic Knowledge Distillation for Pre-trained Language Models	Sep 23, 2021	Knowledge Distillation	CodeCode Available	1
Segmentation with mixed supervision: Confidence maximization helps knowledge distillation	Sep 21, 2021	Image SegmentationKnowledge Distillation	CodeCode Available	1
Distilling Linguistic Context for Language Model Compression	Sep 17, 2021	Knowledge DistillationLanguage Modeling	CodeCode Available	1
The NiuTrans System for the WMT21 Efficiency Task	Sep 16, 2021	GPUKnowledge Distillation	CodeCode Available	1
The NiuTrans System for WNGT 2020 Efficiency Task	Sep 16, 2021	DecoderKnowledge Distillation	CodeCode Available	1
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation	Sep 15, 2021	Data AugmentationKnowledge Distillation	CodeCode Available	1
Multi-Scale Aligned Distillation for Low-Resolution Detection	Sep 14, 2021	Knowledge Distillationobject-detection	CodeCode Available	1
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding	Sep 13, 2021	Adversarial RobustnessAll	CodeCode Available	1
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression	Sep 7, 2021	Knowledge DistillationQuantization	CodeCode Available	1
Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution	Sep 7, 2021	image-classificationImage Classification	CodeCode Available	1
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction	Sep 1, 2021	Data PoisoningKnowledge Distillation	CodeCode Available	1
Cross-category Video Highlight Detection via Set-based Learning	Aug 26, 2021	Domain AdaptationHighlight Detection	CodeCode Available	1
PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation	Aug 24, 2021	Face RecognitionKnowledge Distillation	CodeCode Available	1
Efficient Medical Image Segmentation Based on Knowledge Distillation	Aug 23, 2021	Image SegmentationKnowledge Distillation	CodeCode Available	1
Supervised Compression for Resource-Constrained Edge Computing Systems	Aug 21, 2021	Data CompressionEdge-computing	CodeCode Available	1
Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better	Aug 18, 2021	Adversarial RobustnessKnowledge Distillation	CodeCode Available	1
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment	Aug 18, 2021	Image Quality AssessmentImage Restoration	CodeCode Available	1

Show:10 25 50

← PrevPage 29 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified