Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3751–3800 of 4240 papers

Title	Date	Tasks	Status
Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes	Nov 19, 2024	Knowledge DistillationKnowledge Graphs	—Unverified
K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering	Sep 22, 2021	CPUKnowledge Distillation	—Unverified
KAT-V1: Kwai-AutoThink Technical Report	Jul 11, 2025	Knowledge DistillationLarge Language Model	—Unverified
KD^2M: An unifying framework for feature knowledge distillation	Apr 2, 2025	Knowledge Distillation	—Unverified
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder	Nov 19, 2024	Contrastive LearningKnowledge Distillation	—Unverified
KDCTime: Knowledge Distillation with Calibration on InceptionTime for Time-series Classification	Dec 4, 2021	Knowledge DistillationTime Series	—Unverified
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling	Jan 1, 2024	General KnowledgeKnowledge Distillation	—Unverified
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation	Mar 30, 2023	DiversityImage Generation	—Unverified
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation	May 10, 2021	Knowledge DistillationMixture-of-Experts	—Unverified
KD-FixMatch: Knowledge Distillation Siamese Neural Networks	Sep 11, 2023	Knowledge Distillation	—Unverified
KDGAN: Knowledge Distillation with Generative Adversarial Networks	Dec 1, 2018	Knowledge DistillationMulti-Label Learning	—Unverified
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification	May 12, 2025	ClassificationHyperparameter Optimization	—Unverified
KDk: A Defense Mechanism Against Label Inference Attacks in Vertical Federated Learning	Apr 18, 2024	Federated LearningKnowledge Distillation	—Unverified
KDLSQ-BERT: A Quantized Bert Combining Knowledge Distillation with Learned Step Size Quantization	Jan 15, 2021	Knowledge DistillationLanguage Modelling	—Unverified
KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning	Jun 2, 2025	Knowledge DistillationLarge Language Model	—Unverified
KDSM: An uplift modeling framework based on knowledge distillation and sample matching	Mar 6, 2023	counterfactualKnowledge Distillation	—Unverified
KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation	Jul 4, 2023	ClassificationKnowledge Distillation	—Unverified
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation	Jan 16, 2022	cross-modal alignmentKnowledge Distillation	—Unverified
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers	Jan 22, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Kendall's τ Coefficient for Logits Distillation	Sep 26, 2024	Knowledge Distillation	—Unverified
Kernel Based Progressive Distillation for Adder Neural Networks	Sep 28, 2020	Knowledge Distillation	—Unverified
Kernel Methods in Hyperbolic Spaces	Jan 1, 2021	Few-Shot Learningimage-classification	—Unverified
KEYword based Sampling (KEYS) for Large Language Models	May 30, 2023	Knowledge DistillationLanguage Modeling	—Unverified
KGEx: Explaining Knowledge Graph Embeddings via Subgraph Sampling and Knowledge Distillation	Oct 2, 2023	Knowledge DistillationKnowledge Graph Embeddings	—Unverified
Enhancing CLIP Conceptual Embedding through Knowledge Distillation	Dec 4, 2024	Contrastive LearningKnowledge Distillation	—Unverified
KnFu: Effective Knowledge Fusion	Mar 18, 2024	Federated LearningKnowledge Distillation	—Unverified
KNIFE: Distilling Reasoning Knowledge From Free-Text Rationales	Dec 19, 2022	Knowledge DistillationLanguage Modelling	—Unverified
Knowledge Adaptation for Efficient Semantic Segmentation	Mar 12, 2019	Knowledge DistillationSegmentation	—Unverified
Knowledge Adaptation: Teaching to Adapt	Feb 7, 2017	Domain AdaptationKnowledge Distillation	—Unverified
Knowledge as Priors: Cross-Modal Knowledge Generalization for Datasets without Superior Knowledge	Apr 1, 2020	3D Hand Pose EstimationHand Pose Estimation	—Unverified
Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN	Nov 21, 2017	General Classificationimage-classification	—Unverified
Knowledge Cross-Distillation for Membership Privacy	Nov 2, 2021	Inference AttackKnowledge Distillation	—Unverified
Knowledge Distillation and Data Selection for Semi-Supervised Learning in CTC Acoustic Models	Aug 10, 2020	Knowledge Distillationspeech-recognition	—Unverified
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Apr 20, 2025	Dataset DistillationDiversity	—Unverified
Knowledge Distillation and Enhanced Subdomain Adaptation Using Graph Convolutional Network for Resource-Constrained Bearing Fault Diagnosis	Jan 13, 2025	DiagnosticFault Diagnosis	—Unverified
Knowledge Distillation Applied to Optical Channel Equalization: Solving the Parallelization Problem of Recurrent Connection	Dec 8, 2022	Knowledge Distillation	—Unverified
Knowledge Distillation Label Smoothing: Fact or Fallacy?	Jan 30, 2023	Knowledge Distillationtext-classification	—Unverified
Knowledge Distillation as Self-Supervised Learning	Jan 17, 2022	Knowledge DistillationSelf-Supervised Learning	—Unverified
Knowledge Distillation: A Survey	Jun 9, 2020	Knowledge DistillationModel Compression	—Unverified
Knowledge Distillation: Bad Models Can Be Good Role Models	Mar 28, 2022	Knowledge DistillationLearning Theory	—Unverified
Knowledge Distillation based Contextual Relevance Matching for E-commerce Product Search	Oct 4, 2022	Knowledge Distillation	—Unverified
Knowledge Distillation based Ensemble Learning for Neural Machine Translation	Jan 1, 2021	Ensemble LearningKnowledge Distillation	—Unverified
Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System	Feb 8, 2023	Knowledge Distillation	—Unverified
Knowledge Distillation Based Semantic Communications For Multiple Users	Nov 23, 2023	DecoderKnowledge Distillation	—Unverified
Knowledge Distillation Beyond Model Compression	Jul 3, 2020	Knowledge Distillationmodel	—Unverified
Knowledge Distillation Circumvents Nonlinearity for Optical Convolutional Neural Networks	Feb 26, 2021	Computational EfficiencyKnowledge Distillation	—Unverified
Knowledge Distillation-Empowered Digital Twin for Anomaly Detection	Sep 8, 2023	Anomaly DetectionKnowledge Distillation	—Unverified
Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions	May 30, 2022	6D Pose Estimation6D Pose Estimation using RGB	—Unverified
Knowledge Distillation for Action Anticipation via Label Smoothing	Apr 16, 2020	Action AnticipationAutonomous Driving	—Unverified
Knowledge Distillation for Adaptive MRI Prostate Segmentation Based on Limit-Trained Multi-Teacher Models	Mar 16, 2023	Knowledge DistillationMRI segmentation	—Unverified

Show:10 25 50

← PrevPage 76 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified