Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2401–2450 of 4240 papers

Title	Date	Tasks	Status
Understanding and Improving Knowledge Distillation	Feb 10, 2020	Knowledge DistillationModel Compression	—Unverified
Understanding and Improving Lexical Choice in Non-Autoregressive Translation	Dec 29, 2020	Knowledge DistillationTranslation	—Unverified
Understanding Knowledge Distillation	Jan 1, 2021	Knowledge Distillation	—Unverified
Understanding Knowledge Distillation in Non-autoregressive Machine Translation	Nov 7, 2019	Knowledge DistillationMachine Translation	—Unverified
Understanding the Effect of Data Augmentation on Knowledge Distillation	May 21, 2023	Data AugmentationKnowledge Distillation	—Unverified
Understanding the Gains from Repeated Self-Distillation	Jul 5, 2024	Knowledge Distillationregression	—Unverified
Understanding the Overfitting of the Episodic Meta-training	Jun 29, 2023	Knowledge Distillation	—Unverified
Understanding the Success of Knowledge Distillation -- A Data Augmentation Perspective	Sep 29, 2021	Active LearningData Augmentation	—Unverified
UNDO: Understanding Distillation as Optimization	Apr 3, 2025	Knowledge Distillation	—Unverified
UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation	May 27, 2024	Image CompressionKnowledge Distillation	—Unverified
UNIDEAL: Curriculum Knowledge Distillation Federated Learning	Sep 16, 2023	Federated LearningKnowledge Distillation	—Unverified
Unified and Effective Ensemble Knowledge Distillation	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified
Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization	Jul 3, 2024	Anomaly DetectionCPU	—Unverified
Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Apr 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds	Mar 12, 2025	Deep Reinforcement LearningKnowledge Distillation	—Unverified
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors	Jan 1, 2023	Knowledge Distillation	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation	Sep 13, 2021	Abstractive Text SummarizationDecoder	—Unverified
Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search	Feb 13, 2022	Contrastive LearningKnowledge Distillation	—Unverified
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling	Oct 12, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Universal-KD: Attention-based Output-Grounded Intermediate Layer Knowledge Distillation	Nov 1, 2021	Knowledge Distillation	—Unverified
Unlabeled Data Deployment for Classification of Diabetic Retinopathy Images Using Knowledge Transfer	Feb 9, 2020	General ClassificationKnowledge Distillation	—Unverified
Unlearning Clients, Features and Samples in Vertical Federated Learning	Jan 23, 2025	Federated LearningInference Attack	—Unverified
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Sep 17, 2024	3D Object DetectionAutonomous Driving	—Unverified
Unlimited Knowledge Distillation for Action Recognition in the Dark	Aug 18, 2023	Action RecognitionGPU	—Unverified
Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Oct 9, 2024	Knowledge DistillationScheduling	—Unverified
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models	Nov 14, 2023	Knowledge DistillationTransfer Learning	—Unverified
Unpaired Learning for Deep Image Deraining With Rain Direction Regularizer	Jan 1, 2021	Knowledge DistillationRain Removal	—Unverified
Unraveling Key Factors of Knowledge Distillation	Dec 14, 2023	Knowledge DistillationMachine Translation	—Unverified
Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation	Apr 21, 2022	Instance SegmentationKnowledge Distillation	—Unverified
Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving	Sep 25, 2023	Autonomous DrivingKnowledge Distillation	—Unverified
Unsupervised Deep Digital Staining For Microscopic Cell Images Via Knowledge Distillation	Mar 3, 2023	ColorizationKnowledge Distillation	—Unverified
Unsupervised Domain Adaptation for Segmentation with Black-box Source Model	Aug 16, 2022	Domain AdaptationKnowledge Distillation	—Unverified
Unsupervised Learning of Neural Networks to Explain Neural Networks (extended abstract)	Jan 21, 2019	Knowledge DistillationObject	—Unverified
Unsupervised Representation Transfer for Small Networks: I Believe I Can Distill On-the-Fly	Dec 1, 2021	Knowledge DistillationLinear evaluation	—Unverified
Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions	Feb 21, 2024	In-Context LearningKnowledge Distillation	—Unverified
Unveiling Context-Aware Criteria in Self-Assessing LLMs	Oct 28, 2024	Knowledge Distillation	—Unverified
Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning	Jun 12, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified
Unveiling the Unseen Potential of Graph Learning through MLPs: Effective Graph Learners Using Propagation-Embracing MLPs	Nov 20, 2023	Graph LearningGraph Neural Network	—Unverified
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Jul 12, 2024	3D Semantic SegmentationAutonomous Driving	—Unverified
Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach	Aug 13, 2024	Knowledge Distillation	—Unverified
Using a GAN to Generate Adversarial Examples to Facial Image Recognition	Nov 30, 2021	Face RecognitionGenerative Adversarial Network	—Unverified
Using Explainable Boosting Machine to Compare Idiographic and Nomothetic Approaches for Ecological Momentary Assessment Data	Apr 4, 2022	Interpretable Machine LearningKnowledge Distillation	—Unverified
Using Knowledge Distillation to improve interpretable models in a retail banking context	Sep 30, 2022	Data AugmentationKnowledge Distillation	—Unverified
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation	Jul 29, 2021	Knowledge DistillationMachine Translation	—Unverified
Using the Past Knowledge to Improve Sentiment Classification	Nov 1, 2020	ClassificationKnowledge Distillation	—Unverified
V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models	Aug 17, 2024	Autonomous DrivingContrastive Learning	—Unverified
Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial Training	Jun 5, 2022	Knowledge Distillation	—Unverified
Variational Information Distillation for Knowledge Transfer	Apr 11, 2019	Knowledge DistillationTransfer Learning	—Unverified

Show:10 25 50

← PrevPage 49 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified