Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 826–850 of 4240 papers

Title	Date	Tasks	Status	Hype
Do You Remember . . . the Future? Weak-to-Strong generalization in 3D Object Detection	Aug 3, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	0
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning	Aug 2, 2024	Continual LearningKnowledge Distillation	CodeCode Available	0
DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects	Aug 1, 2024	Depth CompletionFeature Correlation	—Unverified	0
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation	Aug 1, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Jul 31, 2024	Knowledge DistillationNeRF	—Unverified	0
Gemma 2: Improving Open Language Models at a Practical Size	Jul 31, 2024	Knowledge Distillation	—Unverified	0
Lifelong Person Search	Jul 31, 2024	Knowledge DistillationPerson Search	—Unverified	0
Dynamic Object Queries for Transformer-based Incremental Object Detection	Jul 31, 2024	Knowledge DistillationObject	—Unverified	0
VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Continual Learning	Jul 31, 2024	Continual LearningKnowledge Distillation	—Unverified	0
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins	Jul 31, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training	Jul 30, 2024	GPUKnowledge Distillation	CodeCode Available	1
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation	Jul 29, 2024	DecoderKnowledge Distillation	CodeCode Available	0
ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Jul 29, 2024	Activity RecognitionGroup Activity Recognition	—Unverified	0
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation	Jul 28, 2024	Knowledge DistillationSequential Diagnosis	CodeCode Available	0
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	0
LLAVADI: What Matters For Multimodal Large Language Models Distillation	Jul 28, 2024	Knowledge Distillation	—Unverified	0
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making	Jul 28, 2024	Decision MakingKnowledge Distillation	—Unverified	0
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Jul 27, 2024	Computational EfficiencyImage Super-Resolution	—Unverified	0
Modality-Balanced Learning for Multimedia Recommendation	Jul 26, 2024	Collaborative Filteringcounterfactual	CodeCode Available	1
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers	Jul 26, 2024	Domain AdaptationDomain Generalization	CodeCode Available	0
Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation	Jul 26, 2024	Knowledge DistillationQuestion Answering	CodeCode Available	2
FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction	Jul 26, 2024	Click-Through Rate PredictionFederated Learning	—Unverified	0
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT	Jul 25, 2024	Knowledge DistillationMulti-Object Tracking	CodeCode Available	0
How to Train the Teacher Model for Effective Knowledge Distillation	Jul 25, 2024	Knowledge Distillation	CodeCode Available	0
NC-NCD: Novel Class Discovery for Node Classification	Jul 25, 2024	ClassificationKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 34 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified