Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 4240 papers

Title	Date	Tasks	Status	Hype
Consistent Representation Learning for Continual Relation Extraction	Mar 5, 2022	Continual Relation ExtractionContrastive Learning	CodeCode Available	1
Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning	Mar 5, 2022	GPUKnowledge Distillation	CodeCode Available	1
X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning	Mar 2, 2022	3D dense captioningDense Captioning	CodeCode Available	1
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology	Mar 1, 2022	DiversityKnowledge Distillation	CodeCode Available	1
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation	Feb 27, 2022	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
Content-Variant Reference Image Quality Assessment via Knowledge Distillation	Feb 26, 2022	Image Quality AssessmentKnowledge Distillation	CodeCode Available	1
CaMEL: Mean Teacher Learning for Image Captioning	Feb 21, 2022	Image CaptioningKnowledge Distillation	CodeCode Available	1
General Cyclical Training of Neural Networks	Feb 17, 2022	Data AugmentationKnowledge Distillation	CodeCode Available	1
ZeroGen: Efficient Zero-shot Learning via Dataset Generation	Feb 16, 2022	Data-free Knowledge DistillationDataset Generation	CodeCode Available	1
FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction	Feb 16, 2022	Active LearningKnowledge Distillation	CodeCode Available	1
Point-Level Region Contrast for Object Detection Pre-Training	Feb 9, 2022	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation	Feb 8, 2022	DiversityKnowledge Distillation	CodeCode Available	1
Local Feature Matching with Transformers for low-end devices	Feb 1, 2022	Knowledge Distillation	CodeCode Available	1
Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding	Jan 28, 2022	Graph AttentionKnowledge Distillation	CodeCode Available	1
It's All in the Head: Representation Knowledge Distillation through Classifier Sharing	Jan 18, 2022	AllClassification	CodeCode Available	1
SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation	Jan 13, 2022	Knowledge Distillationregression	CodeCode Available	1
Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay	Jan 9, 2022	Data-free Knowledge Distillationimage-classification	CodeCode Available	1
Learn From Others and Be Yourself in Heterogeneous Federated Learning	Jan 1, 2022	Continual LearningFederated Learning	CodeCode Available	1
Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data	Jan 1, 2022	Data AugmentationKnowledge Distillation	CodeCode Available	1
Confidence-Aware Multi-Teacher Knowledge Distillation	Dec 30, 2021	Knowledge DistillationTransfer Learning	CodeCode Available	1
Deep Graph-level Anomaly Detection by Glocal Knowledge Distillation	Dec 19, 2021	Anomaly DetectionKnowledge Distillation	CodeCode Available	1
Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition	Dec 17, 2021	image-classificationImage Classification	CodeCode Available	1
Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation	Dec 17, 2021	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Learning Cross-Lingual IR from an English Retriever	Dec 15, 2021	Cross-Lingual Information RetrievalInformation Retrieval	CodeCode Available	1
A Deep Knowledge Distillation framework for EEG assisted enhancement of single-lead ECG based sleep staging	Dec 14, 2021	ECG based Sleep StagingEEG	CodeCode Available	1

Show:10 25 50

← PrevPage 27 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified