Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2701–2750 of 4240 papers

Title	Date	Tasks	Status
A "Network Pruning Network" Approach to Deep Model Compression	Jan 15, 2020	Knowledge DistillationModel Compression	—Unverified
A New Method to Capturing Compositional Knowledge in Linguistic Space	Dec 20, 2024	Image RetrievalKnowledge Distillation	—Unverified
An Extra RMSNorm is All You Need for Fine Tuning to 1.58 Bits	May 12, 2025	AllKnowledge Distillation	—Unverified
An Interpretable Neuron Embedding for Static Knowledge Distillation	Nov 14, 2022	Knowledge Distillation	—Unverified
A Novel Algorithm for Personalized Federated Learning: Knowledge Distillation with Weighted Combination Loss	Apr 6, 2025	Federated LearningKnowledge Distillation	—Unverified
A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines	Apr 2, 2025	Knowledge Distillationtext-classification	—Unverified
A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation	Feb 21, 2022	Knowledge DistillationModel Compression	—Unverified
A novel channel pruning method for deep neural network compression	May 29, 2018	channel selectionCombinatorial Optimization	—Unverified
A Novel Garment Transfer Method Supervised by Distilled Knowledge of Virtual Try-on Model	Jan 23, 2024	DisentanglementKnowledge Distillation	—Unverified
A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning	Jun 11, 2025	DecoderImage Captioning	—Unverified
A Novel Local-Global Feature Fusion Framework for Body-weight Exercise Recognition with Pressure Mapping Sensors	Sep 14, 2023	Knowledge Distillationobject-detection	—Unverified
A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action Recognition	Sep 3, 2022	Action RecognitionKnowledge Distillation	—Unverified
A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation	Apr 26, 2024	Depth EstimationKnowledge Distillation	—Unverified
An Overview of Neural Network Compression	Jun 5, 2020	Knowledge DistillationModel Compression	—Unverified
AntMan: Sparse Low-Rank Compression to Accelerate RNN inference	Oct 2, 2019	Knowledge DistillationLow-rank compression	—Unverified
An Unsupervised Multiple-Task and Multiple-Teacher Model for Cross-lingual Named Entity Recognition	May 1, 2022	Cross-Lingual NERKnowledge Distillation	—Unverified
APALU: A Trainable, Adaptive Activation Function for Deep Learning Networks	Feb 13, 2024	Anomaly DetectionDeep Learning	—Unverified
A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts	May 1, 2021	Explainable artificial intelligenceKnowledge Distillation	—Unverified
A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing	Mar 31, 2025	Continual Self-Supervised LearningKnowledge Distillation	—Unverified
Application of Knowledge Distillation to Multi-task Speech Representation Learning	Oct 29, 2022	Keyword SpottingKnowledge Distillation	—Unverified
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Jan 12, 2025	Autonomous DrivingDecision Making	—Unverified
Applications of Knowledge Distillation in Remote Sensing: A Survey	Sep 18, 2024	Computational EfficiencyInstance Segmentation	—Unverified
Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study	Sep 10, 2024	Active LearningFederated Learning	—Unverified
Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition	Jun 20, 2024	Emotion RecognitionKnowledge Distillation	—Unverified
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy	Nov 15, 2017	image-classificationImage Classification	—Unverified
Apprentissage automatique de repr\'esentation de voix \`a l'aide d'une distillation de la connaissance pour le casting vocal (Learning voice representation using knowledge distillation for automatic voice casting )	Jun 1, 2020	Knowledge Distillation	—Unverified
A Practical Survey on Faster and Lighter Transformers	Mar 26, 2021	Knowledge DistillationSurvey	—Unverified
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene	Apr 17, 2024	image-classificationImage Classification	—Unverified
ARDIR: Improving Robustness using Knowledge Distillation of Internal Representation	Nov 1, 2022	Knowledge Distillation	—Unverified
A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation	May 30, 2023	Data AugmentationImage Retrieval	—Unverified
A Review on Discriminative Self-supervised Learning Methods in Computer Vision	May 8, 2024	ClusteringKnowledge Distillation	—Unverified
Artificial Behavior Intelligence: Technology, Challenges, and Future Directions	May 6, 2025	Autonomous DrivingEmotion Recognition	—Unverified
A scalable convolutional neural network for task-specified scenarios via knowledge distillation	Sep 19, 2016	Knowledge Distillation	—Unverified
A Selective Survey on Versatile Knowledge Distillation Paradigm for Neural Network Models	Nov 30, 2020	Knowledge DistillationModel Compression	—Unverified
A Short Study on Compressing Decoder-Based Language Models	Oct 16, 2021	DecoderKnowledge Distillation	—Unverified
A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation	Mar 23, 2023	image-classificationImage Classification	—Unverified
A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems	Oct 28, 2019	dialog state trackingDialogue State Tracking	—Unverified
SS-IL: Separated Softmax for Incremental Learning	Mar 31, 2020	class-incremental learningClass Incremental Learning	—Unverified
A Simple Linear Patch Revives Layer-Pruned Large Language Models	May 30, 2025	Knowledge DistillationQuestion Answering	—Unverified
A Simple Recipe for Competitive Low-compute Self supervised Vision Models	Jan 23, 2023	Knowledge Distillation	—Unverified
Asterisk*: Keep it Simple	Nov 8, 2024	ClassificationKnowledge Distillation	—Unverified
A Study of Non-autoregressive Model for Sequence Generation	Apr 22, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models	May 26, 2023	Knowledge Distillation	—Unverified
A Study on the Efficiency and Generalization of Light Hybrid Retrievers	Oct 4, 2022	Adversarial AttackContrastive Learning	—Unverified
A Survey of Methods for Low-Power Deep Learning and Computer Vision	Mar 24, 2020	Knowledge DistillationQuantization	—Unverified
A Survey of Model Compression and Acceleration for Deep Neural Networks	Oct 23, 2017	BenchmarkingKnowledge Distillation	—Unverified
A Survey of Techniques for Optimizing Transformer Inference	Jul 16, 2023	Knowledge DistillationNeural Architecture Search	—Unverified
A Survey on Deep Neural Network Compression: Challenges, Overview, and Solutions	Oct 5, 2020	Knowledge DistillationMiscellaneous	—Unverified
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking	Sep 5, 2023	BenchmarkingKnowledge Distillation	—Unverified
A Survey on Green Deep Learning	Nov 8, 2021	Deep LearningKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 55 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified