Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3226–3250 of 4240 papers

Title	Date	Tasks	Status	Hype
Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution	Sep 7, 2021	image-classificationImage Classification	CodeCode Available	1
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression	Sep 7, 2021	Knowledge DistillationQuantization	CodeCode Available	1
Complementary Calibration: Boosting General Continual Learning with Collaborative Distillation and Self-Supervision	Sep 3, 2021	Continual LearningContrastive Learning	CodeCode Available	0
CAM-loss: Towards Learning Spatially Discriminative Feature Representations	Sep 3, 2021	Few-Shot Learningimage-classification	—Unverified	0
Knowledge Distillation with BERT for Image Tag-Based Privacy Prediction	Sep 1, 2021	Knowledge DistillationTAG	—Unverified	0
Decoupled Transformer for Scalable Inference in Open-domain Question Answering	Sep 1, 2021	Knowledge DistillationMachine Reading Comprehension	—Unverified	0
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction	Sep 1, 2021	Data PoisoningKnowledge Distillation	CodeCode Available	1
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation	Sep 1, 2021	Deep Reinforcement LearningGeneral Reinforcement Learning	CodeCode Available	0
FedKD: Communication Efficient Federated Learning via Knowledge Distillation	Aug 30, 2021	Federated LearningKnowledge Distillation	—Unverified	0
Lipschitz Continuity Guided Knowledge Distillation	Aug 29, 2021	Knowledge DistillationModel Compression	—Unverified	0
Distilling the Knowledge of Large-scale Generative Models into Retrieval Models for Efficient Open-domain Conversation	Aug 28, 2021	Knowledge DistillationRetrieval	CodeCode Available	0
SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation	Aug 27, 2021	Knowledge DistillationSegmentation	—Unverified	0
CoCo DistillNet: a Cross-layer Correlation Distillation Network for Pathological Gastric Cancer Segmentation	Aug 27, 2021	Image SegmentationKnowledge Distillation	—Unverified	0
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation	Aug 26, 2021	Density EstimationKnowledge Distillation	—Unverified	0
Cross-category Video Highlight Detection via Set-based Learning	Aug 26, 2021	Domain AdaptationHighlight Detection	CodeCode Available	1
PocketNet: Extreme Lightweight Face Recognition Network using Neural Architecture Search and Multi-Step Knowledge Distillation	Aug 24, 2021	Face RecognitionKnowledge Distillation	CodeCode Available	1
Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches	Aug 23, 2021	CPUData Augmentation	—Unverified	0
Efficient Medical Image Segmentation Based on Knowledge Distillation	Aug 23, 2021	Image SegmentationKnowledge Distillation	CodeCode Available	1
Personalised Federated Learning: A Combinational Approach	Aug 22, 2021	Federated LearningKnowledge Distillation	—Unverified	0
Supervised Compression for Resource-Constrained Edge Computing Systems	Aug 21, 2021	Data CompressionEdge-computing	CodeCode Available	1
Boosting of Head Pose Estimation by Knowledge Distillation	Aug 20, 2021	Head Pose EstimationKnowledge Distillation	—Unverified	0
Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better	Aug 18, 2021	Adversarial RobustnessKnowledge Distillation	CodeCode Available	1
Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment	Aug 18, 2021	Image Quality AssessmentImage Restoration	CodeCode Available	1
BERT Learns to Teach: Knowledge Distillation with Meta Learning	Aug 17, 2021	Knowledge DistillationMeta-Learning	—Unverified	0
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation	Aug 17, 2021	Knowledge Distillationobject-detection	—Unverified	0

Show:10 25 50

← PrevPage 130 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified