Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4201–4240 of 4240 papers

Title	Date	Tasks	Status
Visual Relationship Detection Based on Guided Proposals and Semantic Knowledge Distillation	May 28, 2018	Common Sense ReasoningKnowledge Distillation	—Unverified
Recurrent knowledge distillation	May 18, 2018	Knowledge Distillation	—Unverified
Knowledge Distillation with Adversarial Samples Supporting Decision Boundary	May 15, 2018	Adversarial AttackKnowledge Distillation	CodeCode Available
Knowledge Distillation in Generations: More Tolerant Teachers Educate Better Students	May 15, 2018	General Classificationimage-classification	—Unverified
Born Again Neural Networks	May 12, 2018	Image ClassificationKnowledge Distillation	CodeCode Available
Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems	May 1, 2018	Knowledge DistillationRetrieval	CodeCode Available
Neural Compatibility Modeling with Attentive Knowledge Distillation	Apr 17, 2018	image-classificationImage Classification	—Unverified
Few-shot learning of neural networks from scratch by pseudo example optimization	Feb 8, 2018	Few-Shot LearningKnowledge Distillation	—Unverified
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography	Jan 31, 2018	ClassificationGeneral Classification	—Unverified
Faster gaze prediction with dense networks and Fisher pruning	Jan 17, 2018	Gaze EstimationGaze Prediction	CodeCode Available
Deep Net Triage: Analyzing the Importance of Network Layers via Structural Compression	Jan 15, 2018	Knowledge Distillation	—Unverified
Generation and Consolidation of Recollections for Efficient Deep Lifelong Learning	Jan 1, 2018	Knowledge DistillationLifelong learning	—Unverified
Learning Deep and Compact Models for Gesture Recognition	Dec 29, 2017	Gesture RecognitionKnowledge Distillation	CodeCode Available
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks	Dec 11, 2017	Knowledge DistillationScheduling	—Unverified
StrassenNets: Deep Learning with a Multiplication Budget	Dec 11, 2017	Deep Learningimage-classification	CodeCode Available
Learning Efficient Object Detection Models with Knowledge Distillation	Dec 1, 2017	Knowledge DistillationModel Compression	—Unverified
Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN	Nov 21, 2017	General Classificationimage-classification	—Unverified
MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Face Images	Nov 19, 2017	CPUFacial Expression Recognition	CodeCode Available
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy	Nov 15, 2017	image-classificationImage Classification	—Unverified
Non-Autoregressive Neural Machine Translation	Nov 7, 2017	Knowledge DistillationMachine Translation	CodeCode Available
A Survey of Model Compression and Acceleration for Deep Neural Networks	Oct 23, 2017	BenchmarkingKnowledge Distillation	—Unverified
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification	Sep 9, 2017	ClassificationFace Recognition	—Unverified
Training Shallow and Thin Networks for Acceleration via Knowledge Distillation with Conditional Adversarial Networks	Sep 2, 2017	General ClassificationKnowledge Distillation	—Unverified
Knowledge Distillation for Bilingual Dictionary Induction	Sep 1, 2017	Knowledge DistillationTranslation	—Unverified
A Joint Sequential and Relational Model for Frame-Semantic Parsing	Sep 1, 2017	Knowledge DistillationMachine Translation	—Unverified
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation	Jul 28, 2017	Knowledge DistillationRelationship Detection	—Unverified
WebChild 2.0 : Fine-Grained Commonsense Knowledge Distillation	Jul 1, 2017	Knowledge DistillationSemantic Parsing	—Unverified
A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning	Jul 1, 2017	Knowledge DistillationTransfer Learning	—Unverified
TIP: Typifying the Interpretability of Procedures	Jun 9, 2017	Knowledge Distillation	—Unverified
Knowledge distillation using unlabeled mismatched images	Mar 21, 2017	General Classificationimage-classification	—Unverified
Collaborative Deep Reinforcement Learning	Feb 19, 2017	Deep Reinforcement LearningKnowledge Distillation	CodeCode Available
Knowledge Adaptation: Teaching to Adapt	Feb 7, 2017	Domain AdaptationKnowledge Distillation	—Unverified
Ensemble Distillation for Neural Machine Translation	Feb 6, 2017	Knowledge DistillationMachine Translation	—Unverified
Neural Machine Translation from Simplified Translations	Dec 19, 2016	Knowledge DistillationMachine Translation	—Unverified
In Teacher We Trust: Learning Compressed Models for Pedestrian Detection	Dec 1, 2016	Knowledge DistillationPedestrian Detection	—Unverified
A scalable convolutional neural network for task-specified scenarios via knowledge distillation	Sep 19, 2016	Knowledge Distillation	—Unverified
Knowledge Distillation for Small-footprint Highway Networks	Aug 2, 2016	Acoustic ModellingKnowledge Distillation	—Unverified
Adapting Models to Signal Degradation using Distillation	Apr 1, 2016	Domain AdaptationKnowledge Distillation	—Unverified
Distilling Knowledge from Deep Networks with Applications to Healthcare Domain	Dec 11, 2015	Computational PhenotypingDecision Making	—Unverified
Distilling Model Knowledge	Oct 8, 2015	Bayesian InferenceBIG-bench Machine Learning	CodeCode Available

Show:10 25 50

← PrevPage 85 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified