Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1801–1825 of 4240 papers

Title	Date	Tasks	Status
Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation	Jun 18, 2024	Computed Tomography (CT)Knowledge Distillation	—Unverified
Graph Knowledge Distillation to Mixture of Experts	Jun 17, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation	Jun 17, 2024	Knowledge DistillationNeRF	—Unverified
STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft	Jun 17, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Mutual Learning for Finetuning Click-Through Rate Prediction Models	Jun 17, 2024	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions	Jun 16, 2024	Federated LearningKnowledge Distillation	—Unverified
Self-Knowledge Distillation for Learning Ambiguity	Jun 14, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation	Jun 13, 2024	Knowledge DistillationModel Compression	—Unverified
Contextual Distillation Model for Diversified Recommendation	Jun 13, 2024	DiversityKnowledge Distillation	—Unverified
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications	Jun 12, 2024	document-image-classificationDocument Image Classification	—Unverified
Adaptive Teaching with Shared Classifier for Knowledge Distillation	Jun 12, 2024	Knowledge Distillation	CodeCode Available
Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation	Jun 12, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning	Jun 12, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified
GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model	Jun 12, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network	Jun 12, 2024	Acoustic Scene ClassificationData Augmentation	CodeCode Available
Self-Distillation Learning Based on Temporal-Spatial Consistency for Spiking Neural Networks	Jun 12, 2024	Knowledge Distillation	—Unverified
FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation	Jun 11, 2024	Audio ClassificationKnowledge Distillation	CodeCode Available
TernaryLLM: Ternarized Large Language Model	Jun 11, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection	Jun 11, 2024	Knowledge Distillationobject-detection	—Unverified
Weighted KL-Divergence for Document Ranking Model Refinement	Jun 10, 2024	Contrastive LearningDocument Ranking	—Unverified
BS-PLCNet 2: Two-stage Band-split Packet Loss Concealment Network with Intra-model Knowledge Distillation	Jun 10, 2024	Knowledge DistillationPacket Loss Concealment	—Unverified
Online Policy Distillation with Decision-Attention	Jun 8, 2024	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios	Jun 8, 2024	Knowledge Distillation	—Unverified
Data-Free Generative Replay for Class-Incremental Learning on Imbalanced Data	Jun 7, 2024	class-incremental learningClass Incremental Learning	CodeCode Available
IOR: Inversed Objects Replay for Incremental Object Detection	Jun 7, 2024	Knowledge DistillationObject	—Unverified

Show:10 25 50

← PrevPage 73 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified