Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3326–3350 of 4240 papers

Title	Date	Tasks	Status	Hype
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval	Jun 24, 2021	Computational EfficiencyKnowledge Distillation	CodeCode Available	1
Dealing with training and test segmentation mismatch: FBK@IWSLT2021	Jun 23, 2021	Action DetectionActivity Detection	—Unverified	0
SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning	Jun 22, 2021	class-incremental learningClass Incremental Learning	CodeCode Available	1
Efficient Inference via Universal LSH Kernel	Jun 21, 2021	Knowledge DistillationQuantization	—Unverified	0
Structured Sparse R-CNN for Direct Scene Graph Generation	Jun 21, 2021	graph constructionGraph Generation	CodeCode Available	1
Knowledge Distillation via Instance-level Sequence Learning	Jun 21, 2021	General KnowledgeKnowledge Distillation	—Unverified	0
Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner	Jun 19, 2021	Knowledge DistillationModel Compression	—Unverified	0
Tree-Like Decision Distillation	Jun 19, 2021	Decision MakingKnowledge Distillation	—Unverified	0
Data-Free Knowledge Distillation for Image Super-Resolution	Jun 19, 2021	Data-free Knowledge DistillationImage Super-Resolution	CodeCode Available	0
Learning Student Networks in the Wild	Jun 19, 2021	Knowledge DistillationModel Compression	CodeCode Available	2
Positive-Unlabeled Data Purification in the Wild for Object Detection	Jun 19, 2021	Knowledge Distillationobject-detection	—Unverified	0
CapsuleRRT: Relationships-Aware Regression Tracking via Capsules	Jun 19, 2021	image-classificationImage Classification	—Unverified	0
Space-Time Distillation for Video Super-Resolution	Jun 19, 2021	Knowledge DistillationSuper-Resolution	—Unverified	0
Teacher's pet: understanding and mitigating biases in distillation	Jun 19, 2021	image-classificationImage Classification	—Unverified	0
Cross Modality Knowledge Distillation for Multi-Modal Aerial View Object Classification	Jun 19, 2021	Image ClassificationKnowledge Distillation	CodeCode Available	0
Recurrent Stacking of Layers in Neural Networks: An Application to Neural Machine Translation	Jun 18, 2021	Knowledge DistillationMachine Translation	—Unverified	0
Dual-Teacher Class-Incremental Learning With Data-Free Generative Replay	Jun 17, 2021	class-incremental learningClass Incremental Learning	—Unverified	0
Dynamic Knowledge Distillation With Noise Elimination for RGB-D Salient Object Detection	Jun 17, 2021	Knowledge Distillationobject-detection	—Unverified	0
Knowledge distillation from multi-modal to mono-modal segmentation networks	Jun 17, 2021	Brain Tumor SegmentationImage Segmentation	—Unverified	0
Topology Distillation for Recommender System	Jun 16, 2021	Knowledge DistillationModel Compression	—Unverified	0
Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation	Jun 15, 2021	FairnessKnowledge Distillation	CodeCode Available	0
CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition	Jun 14, 2021	DecoderKnowledge Distillation	—Unverified	0
Context-Aware Image Inpainting with Learned Semantic Priors	Jun 14, 2021	Image InpaintingKnowledge Distillation	CodeCode Available	1
Energy-efficient Knowledge Distillation for Spiking Neural Networks	Jun 14, 2021	Knowledge DistillationModel Compression	—Unverified	0
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation	Jun 12, 2021	DecoderKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 134 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified