Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 526–550 of 4240 papers

Title	Date	Tasks	Status	Hype
Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Nov 25, 2024	Knowledge DistillationMulti-Task Learning	—Unverified	0
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?	Nov 25, 2024	HallucinationKnowledge Distillation	CodeCode Available	7
When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?	Nov 25, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	0
Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Nov 25, 2024	Instance SegmentationKnowledge Distillation	CodeCode Available	1
TransFair: Transferring Fairness from Ocular Disease Classification to Progression Prediction	Nov 24, 2024	ClassificationFairness	—Unverified	0
Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance	Nov 23, 2024	Computational EfficiencyKnowledge Distillation	CodeCode Available	0
Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning	Nov 23, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Faithful Label-free Knowledge Distillation	Nov 22, 2024	Inductive BiasKnowledge Distillation	CodeCode Available	0
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques	Nov 22, 2024	Hate Speech DetectionKnowledge Distillation	—Unverified	0
Adversarial Prompt Distillation for Vision-Language Models	Nov 22, 2024	Adversarial RobustnessAutonomous Driving	—Unverified	0
RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency	Nov 22, 2024	Knowledge DistillationRepresentation Learning	—Unverified	0
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers	Nov 22, 2024	Data AugmentationGPU	—Unverified	0
Adaptive Group Robust Ensemble Knowledge Distillation	Nov 22, 2024	Knowledge Distillation	—Unverified	0
Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation	Nov 22, 2024	Knowledge DistillationMathematical Reasoning	—Unverified	0
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Nov 22, 2024	Anomaly Detectiondocument understanding	—Unverified	0
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models	Nov 21, 2024	image-classificationImage Classification	CodeCode Available	2
WARLearn: Weather-Adaptive Representation Learning	Nov 21, 2024	2D Object DetectionAdversarial Robustness	CodeCode Available	0
Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference	Nov 21, 2024	Graph LearningKnowledge Distillation	CodeCode Available	0
CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition	Nov 21, 2024	Continual LearningFace Recognition	—Unverified	0
Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning	Nov 20, 2024	Knowledge DistillationLarge Language Model	—Unverified	0
RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content	Nov 20, 2024	4kKnowledge Distillation	—Unverified	0
What Makes a Good Dataset for Knowledge Distillation?	Nov 19, 2024	Continual LearningKnowledge Distillation	—Unverified	0
Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes	Nov 19, 2024	Knowledge DistillationKnowledge Graphs	—Unverified	0
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd	Nov 19, 2024	Knowledge Distillation	—Unverified	0
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder	Nov 19, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 22 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified