Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2851–2900 of 4240 papers

Title	Date	Tasks	Status
C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging	May 24, 2025	Knowledge Distillation	—Unverified
CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation	Apr 30, 2025	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence	Oct 17, 2024	Binary ClassificationKnowledge Distillation	—Unverified
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT	Dec 22, 2022	Knowledge Distillation	—Unverified
Can a student Large Language Model perform as well as it's teacher?	Oct 3, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Can Current Explainability Help Provide References in Clinical Notes to Support Humans Annotate Medical Codes?	Oct 28, 2022	Knowledge DistillationMedical Code Prediction	—Unverified
Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?	Apr 13, 2025	Computational EfficiencyEfficient Neural Network	—Unverified
Can Low-Rank Knowledge Distillation in LLMs be Useful for Microelectronic Reasoning?	Jun 19, 2024	Knowledge Distillation	—Unverified
Can Model Compression Improve NLP Fairness	Jan 21, 2022	FairnessKnowledge Distillation	—Unverified
Can Small Language Models be Good Reasoners for Sequential Recommendation?	Mar 7, 2024	Knowledge DistillationRecommendation Systems	—Unverified
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought	Apr 4, 2024	Extractive Question-AnsweringKnowledge Distillation	—Unverified
Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias	Dec 13, 2024	Knowledge DistillationModel Compression	—Unverified
Can Students Outperform Teachers in Knowledge Distillation based Model Compression?	Jan 1, 2021	Knowledge DistillationModel Compression	—Unverified
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?	Jan 27, 2023	Knowledge DistillationNatural Language Understanding	—Unverified
CAP-GAN: Towards Adversarial Robustness with Cycle-consistent Attentional Purification	Feb 15, 2021	Adversarial AttackAdversarial Robustness	—Unverified
CapsuleRRT: Relationships-Aware Regression Tracking via Capsules	Jun 19, 2021	image-classificationImage Classification	—Unverified
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning	Feb 19, 2025	Knowledge DistillationObject	—Unverified
Cascaded channel pruning using hierarchical self-distillation	Aug 16, 2020	Knowledge DistillationModel Compression	—Unverified
CASIA's System for IWSLT 2020 Open Domain Translation	Jul 1, 2020	Knowledge DistillationMachine Translation	—Unverified
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation	May 28, 2025	Domain AdaptationInstance Segmentation	—Unverified
Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation	Jun 19, 2023	Knowledge DistillationRelation	—Unverified
Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities	Mar 9, 2025	Graph AttentionKnowledge Distillation	—Unverified
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation	Apr 3, 2025	DecoderKnowledge Distillation	—Unverified
Causes of Catastrophic Forgetting in Class-Incremental Semantic Segmentation	Sep 16, 2022	class-incremental learningClass Incremental Learning	—Unverified
CBNN: 3-Party Secure Framework for Customized Binary Neural Networks Inference	Dec 21, 2024	BinarizationKnowledge Distillation	—Unverified
CCFace: Classification Consistency for Low-Resolution Face Recognition	Aug 18, 2023	ClassificationClassification Consistency	—Unverified
CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Dec 6, 2024	Action RecognitionKnowledge Distillation	—Unverified
CDKT-FL: Cross-Device Knowledge Transfer using Proxy Dataset in Federated Learning	Apr 4, 2022	Federated LearningKnowledge Distillation	—Unverified
CEKD:Cross Ensemble Knowledge Distillation for Augmented Fine-grained Data	Mar 13, 2022	Data AugmentationKnowledge Distillation	—Unverified
Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Nov 5, 2024	Knowledge Distillationobject-detection	—Unverified
CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation	Sep 15, 2022	Knowledge Distillation	—Unverified
Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN	Mar 26, 2024	Knowledge DistillationModel Compression	—Unverified
Channel Fingerprint Construction for Massive MIMO: A Deep Conditional Generative Approach	May 12, 2025	DenoisingKnowledge Distillation	—Unverified
Channel Planting for Deep Neural Networks using Knowledge Distillation	Nov 4, 2020	Knowledge DistillationNetwork Pruning	—Unverified
Channel Self-Supervision for Online Knowledge Distillation	Mar 22, 2022	DiversityKnowledge Distillation	—Unverified
CILDA: Contrastive Data Augmentation using Intermediate Layer Knowledge Distillation	Apr 15, 2022	Contrastive LearningData Augmentation	—Unverified
Improving Acoustic Scene Classification with City Features	Mar 21, 2025	Acoustic Scene ClassificationClassification	—Unverified
CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Oct 22, 2024	Data AugmentationKnowledge Distillation	—Unverified
Claim Matching Beyond English to Scale Global Fact-Checking	Jun 1, 2021	Fact CheckingKnowledge Distillation	—Unverified
Class-aware Information for Logit-based Knowledge Distillation	Nov 27, 2022	Knowledge Distillation	—Unverified
CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks	Dec 5, 2021	ClassificationContinual Learning	—Unverified
Classification of Diabetic Retinopathy Using Unlabeled Data and Knowledge Distillation	Sep 1, 2020	ClassificationGeneral Classification	—Unverified
Classification Under Misspecification: Halfspaces, Generalized Linear Models, and Evolvability	Dec 1, 2020	ClassificationFairness	—Unverified
Class-Incremental Continual Learning into the eXtended DER-verse	Jan 3, 2022	Continual LearningKnowledge Distillation	—Unverified
Class-Incremental Few-Shot Event Detection	Apr 2, 2024	Event DetectionFew-Shot Learning	—Unverified
Class-Incremental Few-Shot Object Detection	May 17, 2021	ClusteringFew-Shot Object Detection	—Unverified
Class-Incremental Learning for Action Recognition in Videos	Mar 25, 2022	Action RecognitionAction Recognition In Videos	—Unverified
Class-Incremental Learning of Plant and Disease Detection: Growing Branches with Knowledge Distillation	Apr 13, 2023	class-incremental learningClass Incremental Learning	—Unverified
Class Incremental Learning with Self-Supervised Pre-Training and Prototype Learning	Aug 4, 2023	class-incremental learningClass Incremental Learning	—Unverified
Class Incremental Online Streaming Learning	Oct 20, 2021	class-incremental learningClass Incremental Learning	—Unverified

Show:10 25 50

← PrevPage 58 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified