Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1126–1150 of 4240 papers

Title	Date	Tasks	Status	Hype
Camera clustering for scalable stream-based active distillation	Apr 16, 2024	ClusteringKnowledge Distillation	CodeCode Available	1
Digging into contrastive learning for robust depth estimation with diffusion models	Apr 15, 2024	Contrastive LearningDenoising	CodeCode Available	1
ReffAKD: Resource-efficient Autoencoder-based Knowledge Distillation	Apr 15, 2024	Knowledge Distillation	CodeCode Available	0
AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation	Apr 15, 2024	Face AlignmentFace Image Quality	CodeCode Available	0
MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution	Apr 15, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified	0
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers	Apr 14, 2024	Knowledge Distillation	CodeCode Available	0
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies	Apr 13, 2024	Few-Shot LearningKnowledge Distillation	CodeCode Available	0
Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis	Apr 11, 2024	Knowledge DistillationModel Optimization	—Unverified	0
Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers	Apr 11, 2024	Adversarial RobustnessKnowledge Distillation	—Unverified	0
Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Apr 11, 2024	Depth EstimationDepth Prediction	—Unverified	0
Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising	Apr 11, 2024	Computational EfficiencyDenoising	CodeCode Available	2
Remembering Transformer for Continual Learning	Apr 11, 2024	Continual LearningKnowledge Distillation	—Unverified	0
A predictive machine learning force field framework for liquid electrolyte development	Apr 10, 2024	Knowledge Distillation	—Unverified	0
Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation	Apr 9, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	2
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Apr 9, 2024	Emotion RecognitionFacial Landmark Detection	—Unverified	0
Robust feature knowledge distillation for enhanced performance of lightweight crack segmentation models	Apr 9, 2024	Crack SegmentationKnowledge Distillation	—Unverified	0
CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers	Apr 9, 2024	Knowledge DistillationZero-shot Generalization	CodeCode Available	1
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts	Apr 8, 2024	DescriptiveImage Segmentation	—Unverified	0
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models	Apr 7, 2024	Contrastive LearningDiagnostic	—Unverified	0
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection	Apr 7, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	1
Diffusion Time-step Curriculum for One Image to 3D Generation	Apr 6, 2024	3D GenerationImage to 3D	CodeCode Available	2
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models	Apr 6, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model	Apr 6, 2024	Knowledge Distillation	—Unverified	0
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available	0
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 46 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified