Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1175 of 4240 papers

Title	Date	Tasks	Status	Hype
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	0
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought	Apr 4, 2024	Extractive Question-AnsweringKnowledge Distillation	—Unverified	0
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available	0
Improve Knowledge Distillation via Label Revision and Data Selection	Apr 3, 2024	Knowledge DistillationModel Compression	—Unverified	0
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models	Apr 3, 2024	DiversityKnowledge Distillation	CodeCode Available	1
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings	Apr 3, 2024	Data IntegrationKnowledge Distillation	—Unverified	0
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution	Apr 3, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified	0
Foundation Models for Structural Health Monitoring	Apr 3, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available	0
Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity and Performance Restoration	Apr 3, 2024	Knowledge Distillation	CodeCode Available	1
Federated Distillation: A Survey	Apr 2, 2024	Federated LearningKnowledge Distillation	—Unverified	0
TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Apr 2, 2024	Knowledge DistillationVisual Place Recognition	CodeCode Available	1
Class-Incremental Few-Shot Event Detection	Apr 2, 2024	Event DetectionFew-Shot Learning	—Unverified	0
Task Integration Distillation for Object Detectors	Apr 2, 2024	Knowledge DistillationObject	—Unverified	0
Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners	Apr 2, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	2
Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation	Apr 2, 2024	Autonomous VehiclesDecision Making	—Unverified	0
A Comprehensive Review of Knowledge Distillation in Computer Vision	Apr 1, 2024	Deep LearningKnowledge Distillation	—Unverified	0
SUGAR: Pre-training 3D Visual Representations for Robotics	Apr 1, 2024	3D Instance Segmentation3D Object Recognition	—Unverified	0
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation	Apr 1, 2024	DecoderKnowledge Distillation	CodeCode Available	1
LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation	Apr 1, 2024	Knowledge Distillation	—Unverified	0
Weak-to-Strong 3D Object Detection with X-Ray Distillation	Mar 31, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	0
DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection	Mar 31, 2024	Dimensionality ReductionKnowledge Distillation	CodeCode Available	0
Orchestrate Latent Expertise: Advancing Online Continual Learning with Multi-Level Supervision and Reverse Self-Distillation	Mar 30, 2024	Continual LearningKnowledge Distillation	CodeCode Available	1
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning	Mar 29, 2024	Continual LearningContinual Panoptic Segmentation	CodeCode Available	2
GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation	Mar 28, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified	0
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts	Mar 28, 2024	Causal InferenceData-free Knowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 47 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified