Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2351–2400 of 4240 papers

Title	Date	Tasks	Status
Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition	Apr 29, 2025	GSM8KKnowledge Distillation	—Unverified
Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons	Feb 5, 2025	Instruction FollowingKnowledge Distillation	—Unverified
Training Domain Draft Models for Speculative Decoding: Best Practices and Insights	Mar 10, 2025	Knowledge Distillation	—Unverified
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Mar 13, 2024	Continual LearningImage Retrieval	—Unverified
Training Shallow and Thin Networks for Acceleration via Knowledge Distillation with Conditional Adversarial Networks	Sep 2, 2017	General ClassificationKnowledge Distillation	—Unverified
Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification	Mar 31, 2022	Knowledge DistillationSpeaker Verification	—Unverified
TransFair: Transferring Fairness from Ocular Disease Classification to Progression Prediction	Nov 24, 2024	ClassificationFairness	—Unverified
Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption	Apr 16, 2025	DecoderDomain Adaptation	—Unverified
Transfer Learning with Pre-trained Conditional Generative Models	Apr 27, 2022	Knowledge DistillationTransfer Learning	—Unverified
Transferring Knowledge from Structure-aware Self-attention Language Model to Sequence-to-Sequence Semantic Parsing	Jan 16, 2022	Code GenerationKnowledge Distillation	—Unverified
Transferring Knowledge from Structure-aware Self-attention Language Model to Sequence-to-Sequence Semantic Parsing	Oct 1, 2022	Code GenerationKnowledge Distillation	—Unverified
Transferring Learning Trajectories of Neural Networks	May 23, 2023	Knowledge Distillation	—Unverified
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation	Mar 17, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Nov 5, 2024	Fault DetectionIn-Context Learning	—Unverified
Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI	Oct 11, 2024	Autonomous VehiclesIntrusion Detection	—Unverified
TransformMix: Learning Transformation and Mixing Strategies from Data	Mar 19, 2024	Data AugmentationKnowledge Distillation	—Unverified
Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation	Jan 9, 2024	Information RetrievalKnowledge Distillation	—Unverified
Tree Knowledge Distillation for Compressing Transformer-Based Language Models	Jan 16, 2022	Knowledge Distillation	—Unverified
Tree-Like Decision Distillation	Jun 19, 2021	Decision MakingKnowledge Distillation	—Unverified
TriDeNT: Triple Deep Network Training for Privileged Knowledge Distillation in Histopathology	Dec 4, 2023	Knowledge Distillation	—Unverified
Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction	Aug 1, 2021	Event Argument ExtractionKnowledge Distillation	—Unverified
TRILLsson: Distilled Universal Paralinguistic Speech Representations	Mar 1, 2022	Emotion RecognitionKnowledge Distillation	—Unverified
Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning	Jan 1, 2025	class-incremental learningClass Incremental Learning	—Unverified
TripLe: Revisiting Pretrained Model Reuse and Progressive Learning for Efficient Vision Transformer Scaling and Searching	Jan 1, 2023	Knowledge DistillationNeural Architecture Search	—Unverified
Triplet Knowledge Distillation	May 25, 2023	Face Recognitionimage-classification	—Unverified
Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation	Sep 22, 2023	DecoderFeature Importance	—Unverified
TrustAL: Trustworthy Active Learning using Knowledge Distillation	Jan 26, 2022	Active LearningDiversity	—Unverified
TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines	Aug 26, 2024	Activity RecognitionHuman Activity Recognition	—Unverified
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models	Sep 23, 2024	Contrastive Learningcross-modal alignment	—Unverified
TT-MPD: Test Time Model Pruning and Distillation	Dec 10, 2024	Knowledge Distillationmodel	—Unverified
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models	Mar 18, 2024	3D Semantic SegmentationKnowledge Distillation	—Unverified
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Apr 20, 2025	2kKnowledge Distillation	—Unverified
TutorNet: Towards Flexible Knowledge Distillation for End-to-End Speech Recognition	Aug 3, 2020	Knowledge DistillationModel Compression	—Unverified
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Sep 24, 2024	Knowledge DistillationQuantization	—Unverified
Two-in-one Knowledge Distillation for Efficient Facial Forgery Detection	Feb 21, 2023	Knowledge DistillationVocal Bursts Valence Prediction	—Unverified
Two-Pass End-to-End ASR Model Compression	Jan 8, 2022	DecoderKnowledge Distillation	—Unverified
Two-Stage Multi-task Self-Supervised Learning for Medical Image Segmentation	Feb 11, 2024	Auxiliary LearningImage Segmentation	—Unverified
Two-Step Knowledge Distillation for Tiny Speech Enhancement	Sep 15, 2023	Knowledge DistillationModel Compression	—Unverified
UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images	Mar 4, 2024	ClassificationDenoising	—Unverified
Multi-trial Neural Architecture Search with Lottery Tickets	Mar 8, 2022	Knowledge DistillationNeural Architecture Search	—Unverified
UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Nov 13, 2024	DecoderFew-Shot Object Detection	—Unverified
UKD: Debiasing Conversion Rate Estimation via Uncertainty-regularized Knowledge Distillation	Jan 20, 2022	Knowledge DistillationSelection bias	—Unverified
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening	Dec 9, 2024	Knowledge Distillation	—Unverified
Ultrafast Video Attention Prediction with Coupled Knowledge Distillation	Apr 9, 2019	CPUGPU	—Unverified
Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces	Jul 17, 2025	EEGKnowledge Distillation	—Unverified
Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation	Mar 17, 2025	Autonomous NavigationKnowledge Distillation	—Unverified
Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading	May 1, 2025	Knowledge DistillationTransfer Learning	—Unverified
Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification	Jan 15, 2020	Knowledge DistillationObject	—Unverified
Uncertainty-Guided Never-Ending Learning to Drive	Jan 1, 2024	Autonomous DrivingContinual Learning	—Unverified
Understanding Adversarial Attacks on Autoencoders	Jan 1, 2021	Compressive SensingKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 48 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified