Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1076–1100 of 4240 papers

Title	Date	Tasks	Status	Hype
Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection	May 10, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available	0
For the Misgendered Chinese in Gender Bias Research: Multi-Task Learning with Knowledge Distillation for Pinyin Name-Gender Prediction	May 10, 2024	Gender PredictionKnowledge Distillation	—Unverified	0
MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis	May 10, 2024	Federated LearningKnowledge Distillation	—Unverified	0
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks	May 9, 2024	Knowledge DistillationModel Compression	—Unverified	0
Less-supervised learning with knowledge distillation for sperm morphology analysis	May 8, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available	0
CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization	May 8, 2024	DiversityKnowledge Distillation	—Unverified	0
Markowitz Meets Bellman: Knowledge-distilled Reinforcement Learning for Portfolio Management	May 8, 2024	Knowledge DistillationManagement	—Unverified	0
A Review on Discriminative Self-supervised Learning Methods in Computer Vision	May 8, 2024	ClusteringKnowledge Distillation	—Unverified	0
ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation	May 7, 2024	Knowledge DistillationLIDAR Semantic Segmentation	—Unverified	0
GOVERN: Gradient Orientation Vote Ensemble for Multi-Teacher Reinforced Distillation	May 6, 2024	Knowledge DistillationQuestion Answering	—Unverified	0
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data	May 6, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified	0
Sub-goal Distillation: A Method to Improve Small Language Agents	May 4, 2024	Imitation LearningKnowledge Distillation	CodeCode Available	0
Exploring Extreme Quantization in Spiking Language Models	May 4, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Semantic Objective Functions: A distribution-aware method for adding logical constraints in deep learning	May 3, 2024	Knowledge Distillation	—Unverified	0
Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection	May 3, 2024	Anomaly DetectionAttribute	CodeCode Available	1
Efficient Compression of Multitask Multilingual Speech Models	May 2, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Error Exponent in Agnostic PAC Learning	May 1, 2024	Binary ClassificationKnowledge Distillation	—Unverified	0
Wake Vision: A Tailored Dataset and Benchmark Suite for TinyML Computer Vision Applications	May 1, 2024	Human DetectionKnowledge Distillation	—Unverified	0
CrossMatch: Enhance Semi-Supervised Medical Image Segmentation with Perturbation Strategies and Knowledge Distillation	May 1, 2024	Image SegmentationKnowledge Distillation	CodeCode Available	1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model	May 1, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism	Apr 30, 2024	Data AugmentationDiversity	CodeCode Available	0
Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget	Apr 30, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies	Apr 29, 2024	Knowledge Distillationreinforcement-learning	—Unverified	0
Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition	Apr 28, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Retrieval-Oriented Knowledge for Click-Through Rate Prediction	Apr 28, 2024	Click-Through Rate PredictionContrastive Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 44 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified