Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2401–2450 of 4240 papers

Title	Date	Tasks	Status
Population-Based Evolutionary Gaming for Unsupervised Person Re-identification	Jun 8, 2023	DiversityKnowledge Distillation	—Unverified
Regularized Evolutionary Population-Based Training	Feb 11, 2020	Diversityimage-classification	—Unverified
Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification	Jul 31, 2021	Knowledge DistillationOccluded Person Re-Identification	—Unverified
Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer	Aug 1, 2022	Activity RecognitionContrastive Learning	—Unverified
Positive-Unlabeled Data Purification in the Wild for Object Detection	Jun 19, 2021	Knowledge Distillationobject-detection	—Unverified
Poster: Self-Supervised Quantization-Aware Knowledge Distillation	Sep 22, 2023	Knowledge DistillationQuantization	—Unverified
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation	Feb 21, 2025	Knowledge DistillationPrivacy Preserving	—Unverified
PP-StructureV2: A Stronger Document Analysis System	Oct 11, 2022	Key Information ExtractionKnowledge Distillation	—Unverified
PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation	Feb 24, 2025	Knowledge DistillationStyle Transfer	—Unverified
PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation	Jun 25, 2021	Keyword SpottingKnowledge Distillation	—Unverified
Practical Insights into Knowledge Distillation for Pre-Trained Models	Feb 22, 2024	Federated LearningKnowledge Distillation	—Unverified
Practical Knowledge Distillation: Using DNNs to Beat DNNs	Feb 23, 2023	DenoisingKnowledge Distillation	—Unverified
PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation	Aug 1, 2021	Knowledge DistillationLanguage Modeling	—Unverified
Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation	Oct 31, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Jan 4, 2025	DecoderKnowledge Distillation	—Unverified
Preserving Node Distinctness in Graph Autoencoders via Similarity Distillation	Jun 25, 2024	DecoderKnowledge Distillation	—Unverified
Preserving Privacy in Federated Learning with Ensemble Cross-Domain Knowledge Distillation	Sep 10, 2022	Federated Learningimage-classification	—Unverified
Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation	Sep 23, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning	Jan 25, 2025	Adversarial RobustnessFederated Learning	—Unverified
Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis	Mar 3, 2023	Emotion RecognitionKnowledge Distillation	—Unverified
Pre-Trained Vision-Language Models as Partial Annotators	May 23, 2024	Contrastive Learningimage-classification	—Unverified
Pre-training Distillation for Large Language Models: A Design Space Exploration	Oct 21, 2024	Knowledge Distillation	—Unverified
Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG	Nov 28, 2024	EEGKnowledge Distillation	—Unverified
Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data	Aug 11, 2021	Knowledge DistillationModel Compression	—Unverified
Preventing Distillation-based Attacks on Neural Network IP	Apr 1, 2022	Knowledge Distillation	—Unverified
Preview-based Category Contrastive Learning for Knowledge Distillation	Oct 18, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Prime-Aware Adaptive Distillation	Aug 4, 2020	Knowledge DistillationMetric Learning	—Unverified
Prior knowledge distillation based on financial time series	Jun 16, 2020	Knowledge DistillationTime Series	—Unverified
Prior Knowledge Distillation Network for Face Super-Resolution	Sep 22, 2024	Knowledge DistillationSuper-Resolution	—Unverified
Prior Knowledge Guided Network for Video Anomaly Detection	Sep 4, 2023	Anomaly DetectionKnowledge Distillation	—Unverified
Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models	Jun 2, 2023	Knowledge Distillation	—Unverified
Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator	Sep 11, 2024	DiversityFederated Learning	—Unverified
Privacy-preserving Fine-tuning of Large Language Models through Flatness	Mar 7, 2024	Knowledge DistillationPrivacy Preserving	—Unverified
Private Deep Learning with Teacher Ensembles	Jun 5, 2019	Deep LearningEnsemble Learning	—Unverified
Private Model Compression via Knowledge Distillation	Nov 13, 2018	Knowledge Distillationmodel	—Unverified
Privileged Knowledge Distillation for Online Action Detection	Nov 18, 2020	Action DetectionKnowledge Distillation	—Unverified
Proactive Detection and Calibration of Seasonal Advertisements with Multimodal Large Language Models	Oct 16, 2024	Knowledge Distillation	—Unverified
Proactive Guidance of Multi-Turn Conversation in Industrial Search	May 30, 2025	Knowledge Distillationreinforcement-learning	—Unverified
Proactive Sequence Generator via Knowledge Acquisition	Sep 25, 2019	de-enKnowledge Distillation	—Unverified
Probabilistic Integration of Object Level Annotations in Chest X-ray Classification	Oct 13, 2022	Knowledge DistillationVariational Inference	—Unverified
Probabilistic Knowledge Distillation of Face Ensembles	Jan 1, 2023	Face Image QualityFace Recognition	—Unverified
Probabilistic Self-supervised Learning via Scoring Rules Minimization	Sep 5, 2023	Knowledge DistillationOut-of-Distribution Detection	—Unverified
PROD: Progressive Distillation for Dense Retrieval	Sep 27, 2022	Knowledge DistillationNatural Questions	—Unverified
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes	Dec 15, 2024	Federated LearningKnowledge Distillation	—Unverified
Progressive Class-level Distillation	May 30, 2025	BenchmarkingKnowledge Distillation	—Unverified
Progressive Collaborative and Semantic Knowledge Fusion for Generative Recommendation	Feb 10, 2025	Knowledge Distillation	—Unverified
Progressive Cross-modal Knowledge Distillation for Human Action Recognition	Aug 17, 2022	Action RecognitionKnowledge Distillation	—Unverified
Progressive distillation induces an implicit curriculum	Oct 7, 2024	Knowledge Distillation	—Unverified
Progressive Label Distillation: Learning Input-Efficient Deep Neural Networks	Jan 26, 2019	Knowledge Distillationspeech-recognition	—Unverified
ProKD: An Unsupervised Prototypical Knowledge Distillation Network for Zero-Resource Cross-Lingual Named Entity Recognition	Jan 21, 2023	Contrastive LearningCross-Lingual NER	—Unverified

Show:10 25 50

← PrevPage 49 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified