Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2951–3000 of 4240 papers

Title	Date	Tasks	Status
TrustAL: Trustworthy Active Learning using Knowledge Distillation	Jan 26, 2022	Active LearningDiversity	—Unverified
TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines	Aug 26, 2024	Activity RecognitionHuman Activity Recognition	—Unverified
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models	Sep 23, 2024	Contrastive Learningcross-modal alignment	—Unverified
TT-MPD: Test Time Model Pruning and Distillation	Dec 10, 2024	Knowledge Distillationmodel	—Unverified
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models	Mar 18, 2024	3D Semantic SegmentationKnowledge Distillation	—Unverified
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Apr 20, 2025	2kKnowledge Distillation	—Unverified
TutorNet: Towards Flexible Knowledge Distillation for End-to-End Speech Recognition	Aug 3, 2020	Knowledge DistillationModel Compression	—Unverified
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Sep 24, 2024	Knowledge DistillationQuantization	—Unverified
Two-in-one Knowledge Distillation for Efficient Facial Forgery Detection	Feb 21, 2023	Knowledge DistillationVocal Bursts Valence Prediction	—Unverified
Two-Pass End-to-End ASR Model Compression	Jan 8, 2022	DecoderKnowledge Distillation	—Unverified
Two-Stage Multi-task Self-Supervised Learning for Medical Image Segmentation	Feb 11, 2024	Auxiliary LearningImage Segmentation	—Unverified
Two-Step Knowledge Distillation for Tiny Speech Enhancement	Sep 15, 2023	Knowledge DistillationModel Compression	—Unverified
UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images	Mar 4, 2024	ClassificationDenoising	—Unverified
Multi-trial Neural Architecture Search with Lottery Tickets	Mar 8, 2022	Knowledge DistillationNeural Architecture Search	—Unverified
UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Nov 13, 2024	DecoderFew-Shot Object Detection	—Unverified
UKD: Debiasing Conversion Rate Estimation via Uncertainty-regularized Knowledge Distillation	Jan 20, 2022	Knowledge DistillationSelection bias	—Unverified
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening	Dec 9, 2024	Knowledge Distillation	—Unverified
Ultrafast Video Attention Prediction with Coupled Knowledge Distillation	Apr 9, 2019	CPUGPU	—Unverified
Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces	Jul 17, 2025	EEGKnowledge Distillation	—Unverified
Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation	Mar 17, 2025	Autonomous NavigationKnowledge Distillation	—Unverified
Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading	May 1, 2025	Knowledge DistillationTransfer Learning	—Unverified
Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification	Jan 15, 2020	Knowledge DistillationObject	—Unverified
Uncertainty-Guided Never-Ending Learning to Drive	Jan 1, 2024	Autonomous DrivingContinual Learning	—Unverified
Understanding Adversarial Attacks on Autoencoders	Jan 1, 2021	Compressive SensingKnowledge Distillation	—Unverified
Understanding and Improving Knowledge Distillation	Feb 10, 2020	Knowledge DistillationModel Compression	—Unverified
Understanding and Improving Lexical Choice in Non-Autoregressive Translation	Dec 29, 2020	Knowledge DistillationTranslation	—Unverified
Understanding Knowledge Distillation	Jan 1, 2021	Knowledge Distillation	—Unverified
Understanding Knowledge Distillation in Non-autoregressive Machine Translation	Nov 7, 2019	Knowledge DistillationMachine Translation	—Unverified
Understanding the Effect of Data Augmentation on Knowledge Distillation	May 21, 2023	Data AugmentationKnowledge Distillation	—Unverified
Understanding the Gains from Repeated Self-Distillation	Jul 5, 2024	Knowledge Distillationregression	—Unverified
Understanding the Overfitting of the Episodic Meta-training	Jun 29, 2023	Knowledge Distillation	—Unverified
Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective	Sep 29, 2021	Active LearningData Augmentation	—Unverified
UNDO: Understanding Distillation as Optimization	Apr 3, 2025	Knowledge Distillation	—Unverified
UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation	May 27, 2024	Image CompressionKnowledge Distillation	—Unverified
UNIDEAL: Curriculum Knowledge Distillation Federated Learning	Sep 16, 2023	Federated LearningKnowledge Distillation	—Unverified
Unified and Effective Ensemble Knowledge Distillation	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified
Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization	Jul 3, 2024	Anomaly DetectionCPU	—Unverified
Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Apr 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds	Mar 12, 2025	Deep Reinforcement LearningKnowledge Distillation	—Unverified
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors	Jan 1, 2023	Knowledge Distillation	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation	Sep 13, 2021	Abstractive Text SummarizationDecoder	—Unverified
Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search	Feb 13, 2022	Contrastive LearningKnowledge Distillation	—Unverified
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling	Oct 12, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Universal-KD: Attention-based Output-Grounded Intermediate Layer Knowledge Distillation	Nov 1, 2021	Knowledge Distillation	—Unverified
Unlabeled Data Deployment for Classification of Diabetic Retinopathy Images Using Knowledge Transfer	Feb 9, 2020	General ClassificationKnowledge Distillation	—Unverified
Unlearning Clients, Features and Samples in Vertical Federated Learning	Jan 23, 2025	Federated LearningInference Attack	—Unverified
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Sep 17, 2024	3D Object DetectionAutonomous Driving	—Unverified
Unlimited Knowledge Distillation for Action Recognition in the Dark	Aug 18, 2023	Action RecognitionGPU	—Unverified

Show:10 25 50

← PrevPage 60 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified