Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–525 of 4240 papers

Title	Date	Tasks	Status	Hype
FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning	Dec 5, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation	Dec 5, 2024	Bilevel OptimizationComputational Efficiency	—Unverified	0
Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer	Dec 5, 2024	Deep LearningKnowledge Distillation	—Unverified	0
Multi-Branch Mutual-Distillation Transformer for EEG-Based Seizure Subtype Classification	Dec 4, 2024	EEGElectroencephalogram (EEG)	—Unverified	0
Distillation of Diffusion Features for Semantic Correspondence	Dec 4, 2024	3D ReconstructionData Augmentation	—Unverified	0
Enhancing CLIP Conceptual Embedding through Knowledge Distillation	Dec 4, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0
Mutli-View 3D Reconstruction using Knowledge Distillation	Dec 2, 2024	3D ReconstructionDepth Estimation	CodeCode Available	0
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model	Dec 2, 2024	cross-modal alignmentKnowledge Distillation	CodeCode Available	1
QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval	Dec 1, 2024	ArticlesKnowledge Distillation	—Unverified	0
Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps	Dec 1, 2024	Earth ObservationKnowledge Distillation	—Unverified	0
Continuous Concepts Removal in Text-to-image Diffusion Models	Nov 30, 2024	Knowledge Distillation	—Unverified	0
Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation	Nov 30, 2024	FairnessGraph Representation Learning	—Unverified	0
Reverse Thinking Makes LLMs Stronger Reasoners	Nov 29, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems	Nov 28, 2024	Knowledge DistillationNatural Language Understanding	—Unverified	0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs	Nov 28, 2024	GPUKnowledge Distillation	—Unverified	0
Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling	Nov 28, 2024	Knowledge DistillationNavigate	—Unverified	0
Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG	Nov 28, 2024	EEGKnowledge Distillation	—Unverified	0
Active Data Curation Effectively Distills Large-Scale Multimodal Models	Nov 27, 2024	DecoderImage Captioning	—Unverified	0
Vision Mamba Distillation for Low-resolution Fine-grained Image Classification	Nov 27, 2024	ClassificationFine-Grained Image Classification	CodeCode Available	1
Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery	Nov 27, 2024	Knowledge Distillation	—Unverified	0
Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation	Nov 26, 2024	Data-free Knowledge DistillationDiversity	CodeCode Available	0
Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation	Nov 26, 2024	Code GenerationContrastive Learning	CodeCode Available	0
Leveraging Foundation Models To learn the shape of semi-fluid deformable objects	Nov 25, 2024	Knowledge DistillationObject	—Unverified	0
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models	Nov 25, 2024	Knowledge DistillationNatural Language Understanding	—Unverified	0
Ensemble Learning via Knowledge Transfer for CTR Prediction	Nov 25, 2024	Click-Through Rate PredictionEnsemble Learning	CodeCode Available	0

Show:10 25 50

← PrevPage 21 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified