Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3601–3650 of 4240 papers

Title	Date	Tasks	Status
High Performance Natural Language Processing	Nov 1, 2020	Knowledge DistillationQuantization	—Unverified
Hint-dynamic Knowledge Distillation	Nov 30, 2022	Knowledge Distillation	—Unverified
HIRE: Distilling High-order Relational Knowledge From Heterogeneous Graph Neural Networks	Jul 25, 2022	Knowledge DistillationVocal Bursts Intensity Prediction	—Unverified
HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs	Jun 16, 2025	HallucinationKnowledge Distillation	—Unverified
Holistic Approach to Measure Sample-level Adversarial Vulnerability and its Utility in Building Trustworthy Systems	May 5, 2022	Adversarial AttackKnowledge Distillation	—Unverified
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers	Feb 19, 2023	Knowledge DistillationModel Compression	—Unverified
Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning	Apr 9, 2023	image-classificationImage Classification	—Unverified
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images	May 22, 2024	GPUKnowledge Distillation	—Unverified
How and When Adversarial Robustness Transfers in Knowledge Distillation?	Oct 22, 2021	Adversarial RobustnessKnowledge Distillation	—Unverified
How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?	May 27, 2021	DiversityKnowledge Distillation	—Unverified
How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting	Mar 9, 2022	Knowledge DistillationTrajectory Forecasting	—Unverified
How Redundant Is the Transformer Stack in Speech Representation Models?	Sep 10, 2024	Knowledge DistillationSpeaker Identification	—Unverified
How to Backdoor the Knowledge Distillation	Apr 30, 2025	Knowledge Distillation	—Unverified
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark	Dec 21, 2023	Knowledge DistillationLanguage Modeling	—Unverified
How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding	Nov 1, 2021	Adversarial RobustnessAll	—Unverified
HRPose: Real-Time High-Resolution 6D Pose Estimation Network Using Knowledge Distillation	Apr 20, 2022	6D Pose Estimation6D Pose Estimation using RGB	—Unverified
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training	Apr 27, 2022	Action RecognitionContrastive Learning	—Unverified
Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference	Sep 16, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Human in the Latent Loop (HILL): Interactively Guiding Model Training Through Human Intuition	May 9, 2025	Knowledge Distillation	—Unverified
Human-Like Active Learning: Machines Simulating the Human Learning Process	Nov 7, 2020	Active LearningForm	—Unverified
HW-TSC’s Participation in the WMT 2020 News Translation Shared Task	Nov 1, 2020	Knowledge DistillationTranslation	—Unverified
HW-TSC’s Participation in the WMT 2021 Large-Scale Multilingual Translation Task	Nov 1, 2021	Knowledge DistillationTranslation	—Unverified
HW-TSC’s Participation in the WMT 2021 News Translation Shared Task	Nov 1, 2021	de-enKnowledge Distillation	—Unverified
Hybrid Paradigm-based Brain-Computer Interface for Robotic Arm Control	Dec 14, 2022	Brain Computer InterfaceEEG	—Unverified
HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Sep 30, 2024	Federated LearningKnowledge Distillation	—Unverified
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation	Apr 9, 2023	Knowledge DistillationNovel View Synthesis	—Unverified
Hyperspectral Image Analysis in Single-Modal and Multimodal setting using Deep Learning Techniques	Mar 3, 2024	Dimensionality ReductionHyperspectral image analysis	—Unverified
I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation	Mar 27, 2024	Knowledge DistillationSegmentation	—Unverified
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation	Dec 19, 2022	Imitation LearningKnowledge Distillation	—Unverified
I^2KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding	Oct 4, 2023	Intent DetectionKnowledge Distillation	—Unverified
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions	Nov 30, 2023	Knowledge DistillationRAG	—Unverified
ICD-Face: Intra-class Compactness Distillation for Face Recognition	Jan 1, 2023	Face RecognitionKnowledge Distillation	—Unverified
Cross-resolution Face Recognition via Identity-Preserving Network and Knowledge Distillation	Mar 15, 2023	Face RecognitionKnowledge Distillation	—Unverified
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Mar 30, 2023	Image RetrievalKnowledge Distillation	—Unverified
IIE’s Neural Machine Translation Systems for WMT20	Nov 1, 2020	Domain AdaptationKnowledge Distillation	—Unverified
IKD+: Reliable Low Complexity Deep Models For Retinopathy Classification	Mar 4, 2023	ClassificationKnowledge Distillation	—Unverified
IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment	Dec 10, 2023	Incremental LearningKnowledge Distillation	—Unverified
Image Restoration using Feature-guidance	Jan 1, 2022	Image RestorationKnowledge Distillation	—Unverified
Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer	Jan 21, 2022	Knowledge DistillationTransfer Learning	—Unverified
Attention-based Knowledge Distillation in Multi-attention Tasks: The Impact of a DCT-driven Loss	May 4, 2022	DescriptiveKnowledge Distillation	—Unverified
Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing	Feb 24, 2025	Cross-Lingual TransferDependency Parsing	—Unverified
Impossible Triangle: What's Next for Pre-trained Language Models?	Apr 13, 2022	Data AugmentationFew-Shot Learning	—Unverified
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation	Jun 1, 2023	automatic-speech-translationCross-Lingual Transfer	—Unverified
Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation	Feb 15, 2021	ClassificationGeneral Classification	—Unverified
Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery	Nov 27, 2024	Knowledge Distillation	—Unverified
Improved knowledge distillation by utilizing backward pass knowledge in neural networks	Jan 27, 2023	Knowledge DistillationModel Compression	—Unverified
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection	Feb 1, 2023	Knowledge Distillation	—Unverified
Improved Knowledge Distillation via Adversarial Collaboration	Nov 29, 2021	Knowledge Distillation	—Unverified
Improved Methods for Model Pruning and Knowledge Distillation	May 20, 2025	Knowledge Distillation	—Unverified
Improved Synthetic Training for Reading Comprehension	Oct 24, 2020	Knowledge DistillationMachine Reading Comprehension	—Unverified

Show:10 25 50

← PrevPage 73 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified