Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3126–3150 of 4240 papers

Title	Date	Tasks	Status	Hype
Class Incremental Online Streaming Learning	Oct 20, 2021	class-incremental learningClass Incremental Learning	—Unverified	0
FedHe: Heterogeneous Models and Communication-Efficient Federated Learning	Oct 19, 2021	Federated LearningKnowledge Distillation	CodeCode Available	0
Adaptive Distillation: Aggregating Knowledge from Multiple Paths for Efficient Distillation	Oct 19, 2021	Knowledge DistillationNeural Network Compression	CodeCode Available	0
Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation	Oct 17, 2021	Knowledge DistillationNode Classification	CodeCode Available	1
Know your tools well: Better and faster QA with synthetic examples	Oct 16, 2021	DiversityKnowledge Distillation	—Unverified	0
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression	Oct 16, 2021	Few-Shot LearningKnowledge Distillation	CodeCode Available	0
A Short Study on Compressing Decoder-Based Language Models	Oct 16, 2021	DecoderKnowledge Distillation	—Unverified	0
Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding	Oct 16, 2021	Knowledge DistillationModel Compression	—Unverified	0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher	Oct 16, 2021	image-classificationImage Classification	—Unverified	0
From Multimodal to Unimodal Attention in Transformers using Knowledge Distillation	Oct 15, 2021	Knowledge DistillationMultimodal Deep Learning	—Unverified	0
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm	Oct 15, 2021	Knowledge Distillation	—Unverified	0
Multilingual Neural Machine Translation:Can Linguistic Hierarchies Help?	Oct 15, 2021	Knowledge DistillationMachine Translation	—Unverified	0
Kronecker Decomposition for GPT Compression	Oct 15, 2021	Knowledge DistillationLanguage Modeling	—Unverified	0
FocusNet: Classifying Better by Focusing on Confusing Classes	Oct 14, 2021	Classificationimage-classification	CodeCode Available	1
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models	Oct 14, 2021	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
Language Modelling via Learning to Rank	Oct 13, 2021	Knowledge DistillationLanguage Modelling	—Unverified	0
False Negative Distillation and Contrastive Learning for Personalized Outfit Recommendation	Oct 13, 2021	Contrastive LearningData Augmentation	—Unverified	0
Object DGCNN: 3D Object Detection using Dynamic Graphs	Oct 13, 2021	2D Object Detection3D Object Detection	CodeCode Available	1
CONetV2: Efficient Auto-Channel Size Optimization for CNNs	Oct 13, 2021	Knowledge DistillationNeural Architecture Search	CodeCode Available	0
Rectifying the Data Bias in Knowledge Distillation	Oct 11, 2021	Face RecognitionFace Verification	—Unverified	0
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices	Oct 11, 2021	Knowledge DistillationNetwork Pruning	—Unverified	0
Towards Streaming Egocentric Action Anticipation	Oct 11, 2021	Action AnticipationKnowledge Distillation	—Unverified	0
Visualizing the embedding space to explain the effect of knowledge distillation	Oct 9, 2021	Knowledge Distillation	—Unverified	0
Towards Data-Free Domain Generalization	Oct 9, 2021	Data-free Knowledge DistillationDomain Generalization	CodeCode Available	0
Cross-modal Knowledge Distillation for Vision-to-Sensor Action Recognition	Oct 8, 2021	Action RecognitionActivity Recognition	CodeCode Available	0

Show:10 25 50

← PrevPage 126 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified