Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2700 of 4240 papers

Title	Date	Tasks	Status
A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning	Jul 1, 2017	Knowledge DistillationTransfer Learning	—Unverified
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation	Jul 24, 2023	Knowledge DistillationSemantic Segmentation	—Unverified
AI can evolve without labels: self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation	Feb 13, 2022	Deep LearningDiagnostic	—Unverified
AIDE: Agentically Improve Visual Language Model with Domain Experts	Feb 13, 2025	Knowledge DistillationLanguage Modeling	—Unverified
AI-KD: Adversarial learning and Implicit regularization for self-Knowledge Distillation	Nov 20, 2022	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
AirNet: Neural Network Transmission over the Air	May 24, 2021	Knowledge Distillation	—Unverified
A Joint Sequential and Relational Model for Frame-Semantic Parsing	Sep 1, 2017	Knowledge DistillationMachine Translation	—Unverified
AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks	May 5, 2025	Code CompletionCode Generation	—Unverified
A Knowledge Distillation Approach for Sepsis Outcome Prediction from Multivariate Clinical Time Series	Nov 16, 2023	Knowledge DistillationTime Series	—Unverified
A Knowledge Distillation-Based Backdoor Attack in Federated Learning	Aug 12, 2022	Backdoor AttackFederated Learning	—Unverified
A Knowledge Distillation framework for Multi-Organ Segmentation of Medaka Fish in Tomographic Image	Feb 24, 2023	Computed Tomography (CT)Image Segmentation	—Unverified
A Light-weight Deep Learning Model for Remote Sensing Image Classification	Feb 25, 2023	image-classificationImage Classification	—Unverified
A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition	May 12, 2023	EEGElectroencephalogram (EEG)	—Unverified
A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction	Feb 28, 2024	Image EnhancementKnowledge Distillation	—Unverified
A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation	Feb 15, 2023	Data AugmentationDefect Detection	—Unverified
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate	Sep 14, 2021	DecoderKnowledge Distillation	—Unverified
AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Oct 24, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Aligned Weight Regularizers for Pruning Pretrained Neural Networks	Nov 16, 2021	Knowledge DistillationLanguage Modeling	—Unverified
Aligning in a Compact Space: Contrastive Knowledge Distillation between Heterogeneous Architectures	May 28, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Aligning Teacher with Student Preferences for Tailored Training Data Generation	Jun 27, 2024	In-Context LearningKnowledge Distillation	—Unverified
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition	Feb 28, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Alleviating Catastrophic Forgetting of Incremental Object Detection via Within-Class and Between-Class Knowledge Distillation	Jan 1, 2023	Knowledge Distillationobject-detection	—Unverified
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Mar 27, 2025	HallucinationKnowledge Distillation	—Unverified
All You Need in Knowledge Distillation Is a Tailored Coordinate System	Dec 12, 2024	AllFew-Shot Learning	—Unverified
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation	Dec 27, 2020	Knowledge Distillation	—Unverified
Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction	Apr 17, 2023	Click-Through Rate PredictionDiversity	—Unverified
AMD: Adaptive Masked Distillation for Object Detection	Jan 31, 2023	Knowledge DistillationModel Compression	—Unverified
AMD: Automatic Multi-step Distillation of Large-scale Vision Models	Jul 5, 2024	image-classificationImage Classification	—Unverified
A method for estimating forest carbon storage distribution density via artificial intelligence generated content model	Feb 2, 2025	Knowledge Distillation	—Unverified
A metric learning approach for endoscopic kidney stone identification	Jul 13, 2023	Few-Shot LearningKnowledge Distillation	—Unverified
AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation	Aug 1, 2020	Knowledge DistillationTransfer Learning	—Unverified
Amortized Noisy Channel Neural Machine Translation	Dec 16, 2021	Imitation LearningKnowledge Distillation	—Unverified
AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference	May 13, 2023	Knowledge Distillation	—Unverified
An Active Learning Framework for Inclusive Generation by Large Language Models	Oct 17, 2024	Active LearningClustering	—Unverified
Analyzing Compression Techniques for Computer Vision	May 14, 2023	Knowledge DistillationQuantization	—Unverified
Analyzing Knowledge Distillation in Neural Machine Translation	Oct 1, 2018	Knowledge DistillationMachine Translation	—Unverified
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation	Jun 2, 2025	Automatic Speech RecognitionKnowledge Distillation	—Unverified
An Effective Deep Network for Head Pose Estimation without Keypoints	Oct 25, 2022	Gaze EstimationHead Pose Estimation	—Unverified
An Efficient Active Learning Pipeline for Legal Text Classification	Nov 15, 2022	Active LearningClassification	—Unverified
An Efficient Detection and Control System for Underwater Docking using Machine Learning and Realistic Simulation: A Comprehensive Approach	Nov 2, 2023	Generative Adversarial NetworkImage-to-Image Translation	—Unverified
An Efficient Federated Distillation Learning System for Multi-task Time Series Classification	Dec 30, 2021	Knowledge DistillationTime Series	—Unverified
An Efficient Method of Training Small Models for Regression Problems with Knowledge Distillation	Feb 28, 2020	Knowledge DistillationMemorization	—Unverified
An Efficient Private GPT Never Autoregressively Decodes	May 21, 2025	Knowledge Distillation	—Unverified
An Empirical Analysis of the Impact of Data Augmentation on Knowledge Distillation	Jun 6, 2020	Data AugmentationKnowledge Distillation	—Unverified
An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation	Jan 12, 2024	Knowledge Distillation	—Unverified
An Empirical Study of Efficient ASR Rescoring with Transformers	Oct 24, 2019	Knowledge DistillationLanguage Modeling	—Unverified
An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models	Apr 19, 2023	Knowledge DistillationMachine Translation	—Unverified
An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking	Feb 8, 2023	Document RankingKnowledge Distillation	—Unverified
An Enhanced Low-Resolution Image Recognition Method for Traffic Environments	Sep 28, 2023	Computational EfficiencyKnowledge Distillation	—Unverified
An Ensemble of Knowledge Sharing Models for Dynamic Hand Gesture Recognition	Aug 13, 2020	Gesture RecognitionHand Gesture Recognition	—Unverified

Show:10 25 50

← PrevPage 54 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified