Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3151–3175 of 4240 papers

Title	Date	Tasks	Status	Hype
Peer Collaborative Learning for Polyphonic Sound Event Detection	Oct 7, 2021	Event DetectionKnowledge Distillation	—Unverified	0
Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-trained Models	Oct 7, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Towards Accurate Cross-Domain In-Bed Human Pose Estimation	Oct 7, 2021	Data AugmentationKnowledge Distillation	CodeCode Available	1
Inter-Domain Alignment for Predicting High-Resolution Brain Networks Using Teacher-Student Learning	Oct 6, 2021	DecoderDomain Adaptation	CodeCode Available	0
Online Hyperparameter Meta-Learning with Hypergradient Distillation	Oct 6, 2021	Hyperparameter OptimizationKnowledge Distillation	—Unverified	0
KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks	Oct 6, 2021	Emotion RecognitionEmotion Recognition in Conversation	CodeCode Available	1
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis	Oct 4, 2021	Knowledge DistillationSpeech Synthesis	—Unverified	0
Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation	Oct 1, 2021	Knowledge DistillationSelf-Knowledge Distillation	CodeCode Available	0
Multilingual AMR Parsing with Noisy Knowledge Distillation	Sep 30, 2021	AMR ParsingKnowledge Distillation	CodeCode Available	1
Prune Your Model Before Distill It	Sep 30, 2021	Knowledge Distillationmodel	CodeCode Available	1
Improving Neural Ranking via Lossless Knowledge Distillation	Sep 30, 2021	Knowledge DistillationLearning-To-Rank	—Unverified	0
Deep Neural Compression Via Concurrent Pruning and Self-Distillation	Sep 30, 2021	Knowledge DistillationLanguage Modeling	—Unverified	0
A Comprehensive Overhaul of Distilling Unconditional GANs	Sep 29, 2021	Knowledge Distillation	—Unverified	0
Prototypical Contrastive Predictive Coding	Sep 29, 2021	Contrastive LearningKnowledge Distillation	—Unverified	0
Self-supervised Models are Good Teaching Assistants for Vision Transformers	Sep 29, 2021	Image ClassificationKnowledge Distillation	—Unverified	0
A Unified Knowledge Distillation Framework for Deep Directed Graphical Models	Sep 29, 2021	Continual LearningFederated Learning	—Unverified	0
Not All Regions are Worthy to be Distilled: Region-aware Knowledge Distillation Towards Efficient Image-to-Image Translation	Sep 29, 2021	AllContrastive Learning	—Unverified	0
Explaining Knowledge Graph Embedding via Latent Rule Learning	Sep 29, 2021	Graph EmbeddingKnowledge Distillation	—Unverified	0
Adaptive Label Smoothing with Self-Knowledge	Sep 29, 2021	Knowledge DistillationMachine Translation	—Unverified	0
Automated Channel Pruning with Learned Importance	Sep 29, 2021	DenoisingGPU	—Unverified	0
Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data	Sep 29, 2021	Image GenerationKnowledge Distillation	—Unverified	0
Stingy Teacher: Sparse Logits Suffice to Fail Knowledge Distillation	Sep 29, 2021	Knowledge Distillation	—Unverified	0
MOBA: Multi-teacher Model Based Reinforcement Learning	Sep 29, 2021	Decision MakingKnowledge Distillation	—Unverified	0
Fast and Efficient Once-For-All Networks for Diverse Hardware Deployment	Sep 29, 2021	AllGPU	—Unverified	0
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation	Sep 29, 2021	Few-Shot LearningKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 127 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified