Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3776–3800 of 4240 papers

Title	Date	Tasks	Status
Real-Time Decentralized knowledge Transfer at the Edge	Nov 11, 2020	Knowledge DistillationTransfer Learning	CodeCode Available
EGAD: Evolving Graph Representation Learning with Self-Attention and Knowledge Distillation for Live Video Streaming Events	Nov 11, 2020	Graph Representation LearningKnowledge Distillation	CodeCode Available
Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation	Nov 11, 2020	Graph Representation LearningKnowledge Distillation	CodeCode Available
On Estimating the Training Cost of Conversational Recommendation Systems	Nov 10, 2020	Conversational RecommendationKnowledge Distillation	—Unverified
Knowledge Distillation for Singing Voice Detection	Nov 9, 2020	Information RetrievalKnowledge Distillation	CodeCode Available
Ensemble Knowledge Distillation for CTR Prediction	Nov 8, 2020	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Robustness and Diversity Seeking Data-Free Knowledge Distillation	Nov 7, 2020	Data-free Knowledge DistillationDiversity	CodeCode Available
Human-Like Active Learning: Machines Simulating the Human Learning Process	Nov 7, 2020	Active LearningForm	—Unverified
Channel Planting for Deep Neural Networks using Knowledge Distillation	Nov 4, 2020	Knowledge DistillationNetwork Pruning	—Unverified
On Self-Distilling Graph Neural Network	Nov 4, 2020	Graph EmbeddingGraph Neural Network	—Unverified
Paralinguistic Privacy Protection at the Edge	Nov 4, 2020	CPUKnowledge Distillation	—Unverified
A Comprehensive Study of Class Incremental Learning Algorithms for Visual Tasks	Nov 3, 2020	class-incremental learningClass Incremental Learning	—Unverified
Distilling Knowledge by Mimicking Features	Nov 3, 2020	Knowledge Distillationobject-detection	CodeCode Available
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech	Nov 2, 2020	Knowledge DistillationSpeech Synthesis	—Unverified
Data-free Knowledge Distillation for Segmentation using Data-Enriching GAN	Nov 2, 2020	Data-free Knowledge DistillationDiversity	CodeCode Available
The NiuTrans Machine Translation Systems for WMT20	Nov 1, 2020	Knowledge DistillationMachine Translation	—Unverified
IIE’s Neural Machine Translation Systems for WMT20	Nov 1, 2020	Domain AdaptationKnowledge Distillation	—Unverified
HW-TSC’s Participation in the WMT 2020 News Translation Shared Task	Nov 1, 2020	Knowledge DistillationTranslation	—Unverified
High Performance Natural Language Processing	Nov 1, 2020	Knowledge DistillationQuantization	—Unverified
Using the Past Knowledge to Improve Sentiment Classification	Nov 1, 2020	ClassificationKnowledge Distillation	—Unverified
Distilling Structured Knowledge for Text-Based Relational Reasoning	Nov 1, 2020	Contrastive LearningKnowledge Distillation	—Unverified
Fast End-to-end Coreference Resolution for Korean	Nov 1, 2020	coreference-resolutionCoreference Resolution	—Unverified
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation	Nov 1, 2020	DecoderDialogue Generation	—Unverified
FedED: Federated Learning via Ensemble Distillation for Medical Relation Extraction	Nov 1, 2020	Federated LearningKnowledge Distillation	—Unverified
MixKD: Towards Efficient Distillation of Large-scale Language Models	Nov 1, 2020	Data AugmentationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 152 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified