Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 4240 papers

Title	Date	Tasks	Status	Hype
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Oct 9, 2024	Knowledge DistillationNeural Network Compression	—Unverified	0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server	Oct 8, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
Progressive distillation induces an implicit curriculum	Oct 7, 2024	Knowledge Distillation	—Unverified	0
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Oct 7, 2024	Decision MakingInformation Retrieval	—Unverified	0
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Oct 6, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available	0
CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Oct 6, 2024	DescriptiveImage Captioning	CodeCode Available	0
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech	Oct 5, 2024	HallucinationKnowledge Distillation	—Unverified	0
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified	0
Accelerating Diffusion Models with One-to-Many Knowledge Distillation	Oct 5, 2024	Image GenerationKnowledge Distillation	—Unverified	0
Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution	Oct 5, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available	2
Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Oct 4, 2024	Keypoint DetectionKnowledge Distillation	—Unverified	0
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Oct 4, 2024	document understandingKnowledge Distillation	—Unverified	0
Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review	Oct 4, 2024	Knowledge DistillationLogical Reasoning	CodeCode Available	2
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Oct 3, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available	0
BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation	Oct 2, 2024	Knowledge DistillationTime Series Analysis	CodeCode Available	0
PairDistill: Pairwise Relevance Distillation for Dense Retrieval	Oct 2, 2024	Information RetrievalKnowledge Distillation	CodeCode Available	1
"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning	Oct 2, 2024	Backdoor Attackbackdoor defense	CodeCode Available	0
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Oct 2, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	1
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks	Oct 2, 2024	Knowledge Distillation	CodeCode Available	0
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Oct 2, 2024	Knowledge Distillation	—Unverified	0
Self-Updatable Large Language Models with Parameter Integration	Oct 1, 2024	Continual LearningConversational Recommendation	—Unverified	0
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Oct 1, 2024	Code GenerationHumanEval	CodeCode Available	0
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Oct 1, 2024	Computational EfficiencyKnowledge Distillation	—Unverified	0
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity	Oct 1, 2024	DecoderKnowledge Distillation	—Unverified	0
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified	0

Show:10 25 50

← PrevPage 27 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified