Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2351–2400 of 4240 papers

Title	Date	Tasks	Status
FEED: Feature-level Ensemble for Knowledge Distillation	Sep 24, 2019	Knowledge Distillation	—Unverified
Few-shot 3D LiDAR Semantic Segmentation for Autonomous Driving	Feb 17, 2023	Autonomous DrivingFew-Shot Learning	—Unverified
Few-shot Face Image Translation via GAN Prior Distillation	Jan 28, 2023	Knowledge DistillationTranslation	—Unverified
Few-shot learning of neural networks from scratch by pseudo example optimization	Feb 8, 2018	Few-Shot LearningKnowledge Distillation	—Unverified
Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations	Jul 25, 2022	Few-Shot Object DetectionKnowledge Distillation	—Unverified
Optimizing Vision Transformers with Data-Free Knowledge Transfer	Aug 12, 2024	Knowledge Distillationobject-detection	—Unverified
Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Oct 16, 2024	Knowledge DistillationObject	—Unverified
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models	Nov 5, 2021	Knowledge DistillationMachine Translation	—Unverified
Orderly Dual-Teacher Knowledge Distillation for Lightweight Human Pose Estimation	Apr 21, 2021	BinarizationKnowledge Distillation	—Unverified
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling	May 19, 2025	Graph GenerationKnowledge Distillation	—Unverified
Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation	Jan 10, 2025	Knowledge DistillationQuestion Answering	—Unverified
P4: Towards private, personalized, and Peer-to-Peer learning	May 27, 2024	Knowledge Distillation	—Unverified
Pacemaker: Intermediate Teacher Knowledge Distillation For On-The-Fly Convolutional Neural Network	Mar 9, 2020	Knowledge DistillationModel Compression	—Unverified
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval	Aug 13, 2021	Knowledge DistillationNatural Questions	—Unverified
Pan-infection Foundation Framework Enables Multiple Pathogen Prediction	Dec 31, 2024	DiagnosticKnowledge Distillation	—Unverified
PANLP at MEDIQA 2019: Pre-trained Language Models, Transfer Learning and Knowledge Distillation	Aug 1, 2019	Knowledge DistillationRe-Ranking	—Unverified
Papago’s Submission for the WMT21 Quality Estimation Shared Task	Nov 1, 2021	Knowledge DistillationMulti-Task Learning	—Unverified
Paralinguistic Privacy Protection at the Edge	Nov 4, 2020	CPUKnowledge Distillation	—Unverified
Parameter-Efficient and Student-Friendly Knowledge Distillation	May 28, 2022	Knowledge DistillationTransfer Learning	—Unverified
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition	Sep 17, 2022	Knowledge DistillationMixture-of-Experts	—Unverified
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation	Apr 19, 2024	DiversityKnowledge Distillation	—Unverified
Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning	Nov 23, 2024	Federated LearningKnowledge Distillation	—Unverified
Partial to Whole Knowledge Distillation: Progressive Distilling Decomposed Knowledge Boosts Student Better	Sep 26, 2021	Knowledge Distillation	—Unverified
PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation	Jun 13, 2024	Knowledge DistillationModel Compression	—Unverified
PDALN: Progressive Domain Adaptation over a Pre-trained Model for Low-Resource Cross-Domain Named Entity Recognition	Nov 1, 2021	Cross-Domain Named Entity RecognitionData Augmentation	—Unverified
Peak-Controlled Logits Poisoning Attack in Federated Distillation	Jul 25, 2024	Knowledge DistillationTransfer Learning	—Unverified
Pea-KD: Parameter-efficient and Accurate Knowledge Distillation on BERT	Sep 30, 2020	Knowledge DistillationModel Compression	—Unverified
Pea-KD: Parameter-efficient and accurate Knowledge Distillation	Sep 28, 2020	Knowledge DistillationModel Compression	—Unverified
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization	Nov 7, 2022	Knowledge Distillation	—Unverified
Peer Collaborative Learning for Polyphonic Sound Event Detection	Oct 7, 2021	Event DetectionKnowledge Distillation	—Unverified
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech	Nov 2, 2020	Knowledge DistillationSpeech Synthesis	—Unverified
Performance-Aware Mutual Knowledge Distillation for Improving Neural Architecture Search	Jan 1, 2022	Knowledge DistillationNeural Architecture Search	—Unverified
Performance-Efficiency Trade-Offs in Adapting Language Models to Text Classification Tasks	Oct 21, 2022	Knowledge Distillationtext-classification	—Unverified
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale	Nov 7, 2024	Active LearningBenchmarking	—Unverified
Periocular Embedding Learning with Consistent Knowledge Distillation from Face	Dec 12, 2020	Knowledge DistillationPrediction	—Unverified
Personalised Federated Learning: A Combinational Approach	Aug 22, 2021	Federated LearningKnowledge Distillation	—Unverified
Personalized Decentralized Federated Learning with Knowledge Distillation	Feb 23, 2023	Federated LearningKnowledge Distillation	—Unverified
PGX: A Multi-level GNN Explanation Framework Based on Separate Knowledge Distillation Processes	Aug 5, 2022	Knowledge DistillationRepresentation Learning	—Unverified
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Oct 2, 2024	Knowledge Distillation	—Unverified
PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications	Jun 23, 2025	Knowledge DistillationPrivacy Preserving	—Unverified
PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation	Nov 11, 2022	Knowledge Distillation	—Unverified
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods	Feb 20, 2024	Information RetrievalKnowledge Distillation	—Unverified
PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Jan 27, 2025	GPUKnowledge Distillation	—Unverified
Pixel Invisibility: Detecting Objects Invisible in Color Images	Jun 15, 2020	Knowledge Distillationobject-detection	—Unverified
P-KDGAN: Progressive Knowledge Distillation with GANs for One-class Novelty Detection	Jul 14, 2020	Anomaly DetectionDecoder	—Unverified
PKD: General Distillation Framework for Object Detectors via Pearson Correlation Coefficient	Jul 5, 2022	Knowledge Distillationobject-detection	—Unverified
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs	Jun 5, 2024	Knowledge DistillationLanguage Modeling	—Unverified
PlaStIL: Plastic and Stable Memory-Free Class-Incremental Learning	Sep 14, 2022	class-incremental learningClass Incremental Learning	—Unverified
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control	Mar 24, 2025	Image GenerationKnowledge Distillation	—Unverified
Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition	Aug 26, 2020	Adversarial AttackData Augmentation	—Unverified

Show:10 25 50

← PrevPage 48 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified