Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2051–2075 of 4240 papers

Title	Date	Tasks	Status
Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model	Feb 21, 2024	Knowledge Distillationmodel	—Unverified
Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions	Feb 21, 2024	In-Context LearningKnowledge Distillation	—Unverified
FGAD: Self-boosted Knowledge Distillation for An Effective Federated Graph Anomaly Detection Framework	Feb 20, 2024	Anomaly DetectionFederated Learning	—Unverified
ELAD: Explanation-Guided Large Language Models Active Distillation	Feb 20, 2024	Active LearningKnowledge Distillation	—Unverified
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods	Feb 20, 2024	Information RetrievalKnowledge Distillation	—Unverified
Induced Model Matching: How Restricted Models Can Help Larger Ones	Feb 19, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
On the Byzantine-Resilience of Distillation-Based Federated Learning	Feb 19, 2024	Federated LearningKnowledge Distillation	CodeCode Available
Revisiting Knowledge Distillation for Autoregressive Language Models	Feb 19, 2024	Knowledge Distillation	CodeCode Available
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation	Feb 18, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models	Feb 17, 2024	Data AugmentationKnowledge Distillation	—Unverified
FedD2S: Personalized Data-Free Federated Knowledge Distillation	Feb 16, 2024	Data-free Knowledge DistillationFairness	—Unverified
Cultural Commonsense Knowledge for Intercultural Dialogues	Feb 16, 2024	Knowledge DistillationSpecificity	—Unverified
NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models	Feb 15, 2024	Knowledge Distillation	CodeCode Available
Model Compression and Efficient Inference for Large Language Models: A Survey	Feb 15, 2024	Knowledge DistillationModel Compression	—Unverified
Walsh-domain Neural Network for Power Amplifier Behavioral Modelling and Digital Predistortion	Feb 15, 2024	Knowledge Distillation	—Unverified
Distilled Gradual Pruning with Pruned Fine-tuning	Feb 15, 2024	Image ClassificationKnowledge Distillation	CodeCode Available
Integrating ChatGPT into Secure Hospital Networks: A Case Study on Improving Radiology Report Analysis	Feb 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
FedSiKD: Clients Similarity and Knowledge Distillation: Addressing Non-i.i.d. and Constraints in Federated Learning	Feb 14, 2024	Federated LearningKnowledge Distillation	CodeCode Available
Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies	Feb 14, 2024	Knowledge Distillationnamed-entity-recognition	—Unverified
APALU: A Trainable, Adaptive Activation Function for Deep Learning Networks	Feb 13, 2024	Anomaly DetectionDeep Learning	—Unverified
Two-Stage Multi-task Self-Supervised Learning for Medical Image Segmentation	Feb 11, 2024	Auxiliary LearningImage Segmentation	—Unverified
Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance	Feb 10, 2024	Computational EfficiencyKnowledge Distillation	CodeCode Available
Embedding Compression for Teacher-to-Student Knowledge Transfer	Feb 9, 2024	Knowledge DistillationTransfer Learning	—Unverified
Multi-source-free Domain Adaptation via Uncertainty-aware Adaptive Distillation	Feb 9, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available
Large Language Model Meets Graph Neural Network in Knowledge Distillation	Feb 8, 2024	Contrastive LearningGraph Attention	—Unverified

Show:10 25 50

← PrevPage 83 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified