Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1851–1875 of 4240 papers

Title	Date	Tasks	Status
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning	Sep 17, 2020	Edge-computingKnowledge Distillation	—Unverified
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images	May 22, 2024	GPUKnowledge Distillation	—Unverified
Compacting Deep Neural Networks for Internet of Things: Methods and Applications	Mar 20, 2021	DiversityKnowledge Distillation	—Unverified
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation	Aug 26, 2021	Density EstimationKnowledge Distillation	—Unverified
How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?	May 27, 2021	DiversityKnowledge Distillation	—Unverified
Deep Neural Network Models Compression	Mar 4, 2021	Knowledge DistillationQuantization	—Unverified
How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting	Mar 9, 2022	Knowledge DistillationTrajectory Forecasting	—Unverified
Compact CNN Structure Learning by Knowledge Distillation	Apr 19, 2021	Knowledge DistillationModel Compression	—Unverified
How to Backdoor the Knowledge Distillation	Apr 30, 2025	Knowledge Distillation	—Unverified
A Survey on Transformer Compression	Feb 5, 2024	Knowledge DistillationMamba	—Unverified
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark	Dec 21, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices	Oct 11, 2021	Knowledge DistillationNetwork Pruning	—Unverified
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified
A Survey on Symbolic Knowledge Distillation of Large Language Models	Jul 12, 2024	Knowledge DistillationSurvey	—Unverified
Amortized Noisy Channel Neural Machine Translation	Dec 16, 2021	Imitation LearningKnowledge Distillation	—Unverified
Deep Serial Number: Computational Watermarking for DNN Intellectual Property Protection	Nov 17, 2020	Knowledge Distillationvalid	—Unverified
HRPose: Real-Time High-Resolution 6D Pose Estimation Network Using Knowledge Distillation	Apr 20, 2022	6D Pose Estimation6D Pose Estimation using RGB	—Unverified
A Flexible Multi-Task Model for BERT Serving	Nov 16, 2021	Knowledge Distillationmodel	—Unverified
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training	Apr 27, 2022	Action RecognitionContrastive Learning	—Unverified
Incrementer: Transformer for Class-Incremental Semantic Segmentation With Knowledge Distillation Focusing on Old Class	Jan 1, 2023	Class-Incremental Semantic SegmentationDecoder	—Unverified
In Defense of the Learning Without Forgetting for Task Incremental Learning	Jul 26, 2021	Continual LearningIncremental Learning	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Human in the Latent Loop (HILL): Interactively Guiding Model Training Through Human Intuition	May 9, 2025	Knowledge Distillation	—Unverified
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jun 25, 2024	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 75 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified