Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 4240 papers

Title	Date	Tasks	Status
Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Dec 17, 2024	Edge-computingKnowledge Distillation	—Unverified
Confidence-aware Self-Semantic Distillation on Knowledge Graph Embedding	Jun 7, 2022	Graph EmbeddingKnowledge Distillation	—Unverified
Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation	Nov 22, 2024	Knowledge DistillationMathematical Reasoning	—Unverified
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding	Apr 20, 2019	Ensemble LearningKnowledge Distillation	—Unverified
Batch Selection and Communication for Active Learning with Edge Labeling	Nov 14, 2023	Active LearningKnowledge Distillation	—Unverified
Active Large Language Model-based Knowledge Distillation for Session-based Recommendation	Dec 15, 2024	Active LearningKnowledge Distillation	—Unverified
Improving Neural Machine Translation by Denoising Training	Jan 19, 2022	DenoisingKnowledge Distillation	—Unverified
Improving Neural ODEs via Knowledge Distillation	Mar 10, 2022	Knowledge Distillation	—Unverified
Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Sep 3, 2024	Data AugmentationKnowledge Distillation	—Unverified
Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation	Dec 5, 2024	Bilevel OptimizationComputational Efficiency	—Unverified
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Oct 19, 2024	Knowledge Distillation	—Unverified
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser	Nov 28, 2023	3D Face AnimationContrastive Learning	—Unverified
Towards Complementary Knowledge Distillation for Efficient Dense Image Prediction	Jan 24, 2024	Implicit RelationsInstance Segmentation	—Unverified
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Aug 8, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Improving Route Choice Models by Incorporating Contextual Factors via Knowledge Distillation	Mar 27, 2019	Knowledge DistillationManagement	—Unverified
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers	Jan 22, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficient Object Detection in Optical Remote Sensing Imagery via Attention-based Feature Distillation	Oct 28, 2023	Knowledge DistillationObject	—Unverified
CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Jan 1, 2025	Knowledge DistillationSemantic Segmentation	—Unverified
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task	Jul 12, 2021	DecoderKnowledge Distillation	—Unverified
A Survey on Model Compression for Large Language Models	Aug 15, 2023	BenchmarkingKnowledge Distillation	—Unverified
Improving Task-Agnostic BERT Distillation with Layer Mapping Search	Dec 11, 2020	Knowledge Distillation	—Unverified
KDSM: An uplift modeling framework based on knowledge distillation and sample matching	Mar 6, 2023	counterfactualKnowledge Distillation	—Unverified
Improving the Interpretability of Deep Neural Networks with Knowledge Distillation	Dec 28, 2018	EthicsKnowledge Distillation	—Unverified
KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation	Jul 4, 2023	ClassificationKnowledge Distillation	—Unverified
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation	Feb 24, 2025	Adversarial AttackDiversity	—Unverified
Improving Video Model Transfer With Dynamic Representation Learning	Jan 1, 2022	Action ClassificationKnowledge Distillation	—Unverified
Efficient Machine Translation with Model Pruning and Quantization	Nov 1, 2021	CPUDecoder	—Unverified
Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation	Nov 1, 2021	Dialogue GenerationKnowledge Distillation	—Unverified
Improving Zero-Shot Multilingual Text Generation via Iterative Distillation	Oct 1, 2022	Knowledge DistillationText Generation	—Unverified
Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks	Aug 20, 2022	Knowledge DistillationNeural Network Compression	—Unverified
In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning	Dec 17, 2024	In-Context LearningKnowledge Distillation	—Unverified
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression	May 26, 2023	Knowledge Distillation	—Unverified
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation	May 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation	Jan 16, 2022	cross-modal alignmentKnowledge Distillation	—Unverified
Incremental Classifier Learning Based on PEDCC-Loss and Cosine Distance	Jun 11, 2019	Incremental LearningKnowledge Distillation	—Unverified
Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning	May 9, 2022	Few-Shot Object DetectionKnowledge Distillation	—Unverified
Incremental Knowledge Based Question Answering	Jan 18, 2021	Incremental LearningKnowledge Distillation	—Unverified
Incremental Learning for End-to-End Automatic Speech Recognition	May 11, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Direct Distillation between Different Domains	Jan 12, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Kendall's τ Coefficient for Logits Distillation	Sep 26, 2024	Knowledge Distillation	—Unverified
Knowledge Adaptation for Efficient Semantic Segmentation	Mar 12, 2019	Knowledge DistillationSegmentation	—Unverified
Efficient Knowledge Distillation via Curriculum Extraction	Mar 21, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Jan 28, 2025	Computational EfficiencyDecoder	—Unverified
Collective Wisdom: Improving Low-resource Neural Machine Translation using Adaptive Knowledge Distillation	Oct 12, 2020	Knowledge DistillationLow Resource Neural Machine Translation	—Unverified
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified
Incrementer: Transformer for Class-Incremental Semantic Segmentation With Knowledge Distillation Focusing on Old Class	Jan 1, 2023	Class-Incremental Semantic SegmentationDecoder	—Unverified
DiReDi: Distillation and Reverse Distillation for AIoT Applications	Sep 12, 2024	Knowledge DistillationManagement	—Unverified
Collective Knowledge Graph Completion with Mutual Knowledge Distillation	May 25, 2023	Knowledge DistillationKnowledge Graph Completion	—Unverified
Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Mar 21, 2025	intent-classificationIntent Classification	—Unverified

Show:10 25 50

← PrevPage 40 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified