Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2451–2500 of 4240 papers

Title	Date	Tasks	Status
Variational Knowledge Distillation for Disease Classification in Chest X-Rays	Mar 19, 2021	ClassificationGeneral Classification	—Unverified
Variational Student: Learning Compact and Sparser Networks in Knowledge Distillation Framework	Oct 26, 2019	Knowledge DistillationVariational Inference	—Unverified
VEM^2L: A Plug-and-play Framework for Fusing Text and Structure Knowledge on Sparse Knowledge Graph Completion	Jul 4, 2022	Knowledge DistillationKnowledge Graph Completion	—Unverified
Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping	Jun 18, 2024	Knowledge Distillation	—Unverified
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks	Sep 22, 2023	Adversarial RobustnessKeyword Spotting	—Unverified
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning	Sep 27, 2023	Knowledge Distillationregression	—Unverified
Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments	Mar 12, 2025	Knowledge DistillationMotion Planning	—Unverified
Vision-Based Detection of Uncooperative Targets and Components on Small Satellites	Aug 22, 2024	Knowledge Distillation	—Unverified
Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Feb 20, 2025	Domain AdaptationFederated Learning	—Unverified
Vision-Language Models for Edge Networks: A Comprehensive Survey	Feb 11, 2025	Autonomous VehiclesImage Captioning	—Unverified
Visualizing the embedding space to explain the effect of knowledge distillation	Oct 9, 2021	Knowledge Distillation	—Unverified
Visualizing the Emergence of Intermediate Visual Patterns in DNNs	Nov 5, 2021	Knowledge Distillation	—Unverified
Visual-Language Model Knowledge Distillation Method for Image Quality Assessment	Jul 21, 2025	Image Quality AssessmentKnowledge Distillation	—Unverified
Visual-Policy Learning through Multi-Camera View to Single-Camera View Knowledge Distillation for Robot Manipulation Tasks	Mar 13, 2023	Data AugmentationKnowledge Distillation	—Unverified
Visual Relationship Detection Based on Guided Proposals and Semantic Knowledge Distillation	May 28, 2018	Common Sense ReasoningKnowledge Distillation	—Unverified
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation	Jul 28, 2017	Knowledge DistillationRelationship Detection	—Unverified
ViTKD: Practical Guidelines for ViT feature knowledge distillation	Sep 6, 2022	Image ClassificationKnowledge Distillation	—Unverified
VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks	Jan 1, 2025	Classificationimage-classification	—Unverified
VLM-Assisted Continual learning for Visual Question Answering in Self-Driving	Feb 2, 2025	Autonomous DrivingContinual Learning	—Unverified
VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition	Aug 29, 2024	Knowledge DistillationLanguage Modeling	—Unverified
VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation	Nov 14, 2024	Brain SegmentationKnowledge Distillation	—Unverified
Wakening Past Concepts without Past Data: Class-incremental Learning from Placebos	Sep 29, 2021	class-incremental learningClass Incremental Learning	—Unverified
Wakening Past Concepts without Past Data: Class-Incremental Learning from Online Placebos	Oct 24, 2023	class-incremental learningClass Incremental Learning	—Unverified
Wake Vision: A Tailored Dataset and Benchmark Suite for TinyML Computer Vision Applications	May 1, 2024	Human DetectionKnowledge Distillation	—Unverified
Walsh-domain Neural Network for Power Amplifier Behavioral Modelling and Digital Predistortion	Feb 15, 2024	Knowledge Distillation	—Unverified
Wasserstein Contrastive Representation Distillation	Dec 15, 2020	Contrastive LearningKnowledge Distillation	—Unverified
Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation	Mar 12, 2022	Image-to-Image TranslationKnowledge Distillation	—Unverified
WAVE: Weight Template for Adaptive Initialization of Variable-sized Models	Jun 25, 2024	Knowledge DistillationTransfer Learning	—Unverified
Weakly Supervised Cross-lingual Semantic Relation Classification via Knowledge Distillation	Nov 1, 2019	ClassificationCross-Lingual Transfer	—Unverified
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching	May 18, 2021	Caption GenerationCross-Modal Retrieval	—Unverified
Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation	Mar 26, 2021	Domain AdaptationKnowledge Distillation	—Unverified
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Weakly Supervised Monocular 3D Detection with a Single-View Image	Feb 29, 2024	Knowledge DistillationObject Localization	—Unverified
Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching	Dec 17, 2021	Knowledge DistillationSemantic Segmentation	—Unverified
Weak-to-Strong Backdoor Attack for Large Language Models	Sep 26, 2024	Backdoor AttackKnowledge Distillation	—Unverified
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation	Dec 15, 2024	Activity Recognitioncross-modal alignment	—Unverified
WebChild 2.0 : Fine-Grained Commonsense Knowledge Distillation	Jul 1, 2017	Knowledge DistillationSemantic Parsing	—Unverified
Web Content Filtering through knowledge distillation of Large Language Models	May 8, 2023	Knowledge Distillation	—Unverified
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark	May 30, 2024	Knowledge DistillationObject Tracking	—Unverified
WeChat Neural Machine Translation Systems for WMT20	Oct 1, 2020	Knowledge DistillationMachine Translation	—Unverified
WeChat Neural Machine Translation Systems for WMT21	Aug 5, 2021	Knowledge DistillationMachine Translation	—Unverified
WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations	Jul 7, 2021	Knowledge DistillationModel Compression	—Unverified
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Weight Decay Scheduling and Knowledge Distillation for Active Learning	Aug 1, 2020	Active LearningKnowledge Distillation	—Unverified
Weight Distillation: Transferring the Knowledge in Neural Network Parameters	Sep 19, 2020	Knowledge DistillationMachine Translation	—Unverified
Weighted KL-Divergence for Document Ranking Model Refinement	Jun 10, 2024	Contrastive LearningDocument Ranking	—Unverified
Weight Squeezing: Reparameterization for Compression and Fast Inference	May 30, 2020	Knowledge DistillationModel Compression	—Unverified
Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding	Oct 16, 2021	Knowledge DistillationModel Compression	—Unverified
What do larger image classifiers memorise?	Oct 9, 2023	image-classificationImage Classification	—Unverified
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models	Apr 6, 2024	Knowledge DistillationLanguage Modeling	—Unverified

Show:10 25 50

← PrevPage 50 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified