Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4151–4175 of 4240 papers

Title	Date	Tasks	Status
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models	Feb 21, 2024	image-classificationImage Classification	CodeCode Available
ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression	Jun 4, 2021	Knowledge Distillation	CodeCode Available
Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation	May 14, 2023	Knowledge DistillationMachine Translation	CodeCode Available
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization	Jan 9, 2023	Knowledge DistillationLanguage Modelling	CodeCode Available
Data-Free Adversarial Distillation	Dec 23, 2019	Knowledge DistillationModel Compression	CodeCode Available
ACT-Net: Asymmetric Co-Teacher Network for Semi-supervised Memory-efficient Medical Image Segmentation	Jul 5, 2022	Image SegmentationKnowledge Distillation	CodeCode Available
Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge Distillation	Jul 20, 2024	Knowledge Distillation	CodeCode Available
Ensemble Modeling with Contrastive Knowledge Distillation for Sequential Recommendation	Apr 28, 2023	AttributeContrastive Learning	CodeCode Available
Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated data	Nov 7, 2023	Knowledge DistillationMulti-Task Learning	CodeCode Available
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression	Dec 5, 2020	Knowledge DistillationNeural Network Compression	CodeCode Available
Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation	Mar 3, 2024	Knowledge DistillationMachine Translation	CodeCode Available
DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification	Dec 12, 2024	Exemplar-FreeKnowledge Distillation	CodeCode Available
DAD++: Improved Data-free Test Time Adversarial Defense	Sep 10, 2023	Adversarial DefenseAdversarial Robustness	CodeCode Available
Ensemble Learning via Knowledge Transfer for CTR Prediction	Nov 25, 2024	Click-Through Rate PredictionEnsemble Learning	CodeCode Available
Aligning (Medical) LLMs for (Counterfactual) Fairness	Aug 22, 2024	counterfactualFairness	CodeCode Available
A Tailored Pre-Training Model for Task-Oriented Dialog Generation	Apr 24, 2020	Knowledge DistillationLanguage Modeling	CodeCode Available
Ensemble Knowledge Distillation for Learning Improved and Efficient Networks	Sep 17, 2019	Ensemble LearningGeneral Classification	CodeCode Available
Ensemble diverse hypotheses and knowledge distillation for unsupervised cross-subject adaptation	Apr 15, 2022	Activity RecognitionDomain Adaptation	CodeCode Available
Patient Knowledge Distillation for BERT Model Compression	Aug 25, 2019	Knowledge Distillationmodel	CodeCode Available
Ensemble Distillation for Robust Model Fusion in Federated Learning	Jun 12, 2020	BIG-bench Machine LearningFederated Learning	CodeCode Available
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels	Jul 14, 2024	Image SegmentationKnowledge Distillation	CodeCode Available
Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT Distillation	Sep 19, 2024	Data AugmentationEdge-computing	CodeCode Available
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Oct 6, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available
Self-supervised Knowledge Distillation Using Singular Value Decomposition	Jul 18, 2018	Knowledge DistillationTransfer Learning	CodeCode Available
Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images	Jan 8, 2025	Cloud RemovalKnowledge Distillation	CodeCode Available

Show:10 25 50

← PrevPage 167 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified