Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1701–1725 of 4240 papers

Title	Date	Tasks	Status
Empowering Knowledge Distillation via Open Set Recognition for Robust 3D Point Cloud Classification	Oct 25, 2020	3D Point Cloud ClassificationGeneral Classification	—Unverified
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models	Oct 7, 2020	AllKnowledge Distillation	—Unverified
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Feb 25, 2025	Knowledge DistillationLanguage Modeling	—Unverified
GAN-Knowledge Distillation for one-stage Object Detection	Jun 20, 2019	Knowledge DistillationObject	—Unverified
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified
GazeGen: Gaze-Driven User Interaction for Visual Content Generation	Nov 7, 2024	Gaze EstimationKnowledge Distillation	—Unverified
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark	Dec 21, 2023	Knowledge DistillationLanguage Modeling	—Unverified
How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding	Nov 1, 2021	Adversarial RobustnessAll	—Unverified
Empowering Dual-Encoder with Query Generator for Cross-Lingual Dense Retrieval	Mar 27, 2023	Knowledge DistillationRetrieval	—Unverified
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Apr 19, 2025	Knowledge DistillationState Space Models	—Unverified
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning	Dec 10, 2022	Knowledge DistillationRepresentation Learning	—Unverified
Knowledge distillation for optimization of quantized deep neural networks	Sep 4, 2019	Knowledge Distillation	—Unverified
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Apr 23, 2025	Emotion ClassificationGPU	—Unverified
Generalized Continual Zero-Shot Learning	Nov 17, 2020	Continual LearningKnowledge Distillation	—Unverified
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications	Feb 2, 2023	Knowledge DistillationModel Compression	—Unverified
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling	Mar 1, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation	Jul 6, 2021	Domain Generalizationimage-classification	—Unverified
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference	Apr 29, 2020	Knowledge DistillationQuantization	—Unverified
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation	Sep 29, 2021	Few-Shot LearningKnowledge Distillation	—Unverified
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation	Oct 23, 2020	DecoderKnowledge Distillation	—Unverified
Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation	Aug 20, 2024	FairnessKnowledge Distillation	—Unverified
Generation and Consolidation of Recollections for Efficient Deep Lifelong Learning	Jan 1, 2018	Knowledge DistillationLifelong learning	—Unverified
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval	Jan 27, 2023	Information RetrievalKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 69 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified