Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2001–2050 of 4240 papers

Title	Date	Tasks	Status
Efficient Machine Translation with Model Pruning and Quantization	Nov 1, 2021	CPUDecoder	—Unverified
LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation	Nov 2, 2022	DecoderKnowledge Distillation	—Unverified
INDUS: Effective and Efficient Language Models for Scientific Applications	May 17, 2024	Contrastive LearningInformation Retrieval	—Unverified
Industry Scale Semi-Supervised Learning for Natural Language Understanding	Mar 29, 2021	intent-classificationIntent Classification	—Unverified
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Sep 29, 2024	Knowledge DistillationModel Compression	—Unverified
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jun 25, 2024	Knowledge Distillation	—Unverified
Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation	Nov 1, 2021	Dialogue GenerationKnowledge Distillation	—Unverified
Information-Theoretic GAN Compression with Variational Energy-based Model	Mar 28, 2023	Image EnhancementKnowledge Distillation	—Unverified
Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks	Aug 20, 2022	Knowledge DistillationNeural Network Compression	—Unverified
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression	May 26, 2023	Knowledge Distillation	—Unverified
Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions	May 30, 2022	6D Pose Estimation6D Pose Estimation using RGB	—Unverified
Knowledge Distillation for Adaptive MRI Prostate Segmentation Based on Limit-Trained Multi-Teacher Models	Mar 16, 2023	Knowledge DistillationMRI segmentation	—Unverified
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer	Mar 20, 2025	Knowledge DistillationModel Compression	—Unverified
Initial Classifier Weights Replay for Memoryless Class Incremental Learning	Aug 31, 2020	Allclass-incremental learning	—Unverified
Knowledge distillation for fast and accurate DNA sequence correction	Nov 17, 2022	Knowledge Distillation	—Unverified
Efficient Knowledge Distillation via Curriculum Extraction	Mar 21, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Jan 28, 2025	Computational EfficiencyDecoder	—Unverified
Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation	Dec 2, 2022	Knowledge DistillationSpeech Enhancement	—Unverified
Inplace knowledge distillation with teacher assistant for improved training of flexible deep neural networks	May 18, 2021	image-classificationImage Classification	—Unverified
In-situ animal behavior classification using knowledge distillation and fixed-point quantization	Sep 9, 2022	ClassificationKnowledge Distillation	—Unverified
Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation	Nov 15, 2022	Domain AdaptationKnowledge Distillation	—Unverified
Collective Wisdom: Improving Low-resource Neural Machine Translation using Adaptive Knowledge Distillation	Oct 12, 2020	Knowledge DistillationLow Resource Neural Machine Translation	—Unverified
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified
Distill, Adapt, Distill: Training Small, In-Domain Models for Neural Machine Translation	Mar 5, 2020	Domain AdaptationKnowledge Distillation	—Unverified
Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System	Feb 8, 2023	Knowledge Distillation	—Unverified
In Teacher We Trust: Learning Compressed Models for Pedestrian Detection	Dec 1, 2016	Knowledge DistillationPedestrian Detection	—Unverified
Collective Knowledge Graph Completion with Mutual Knowledge Distillation	May 25, 2023	Knowledge DistillationKnowledge Graph Completion	—Unverified
Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Feb 18, 2025	Data AugmentationGSM8K	—Unverified
Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Mar 21, 2025	intent-classificationIntent Classification	—Unverified
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding	Apr 15, 2021	intent-classificationIntent Classification	—Unverified
A Survey on Green Deep Learning	Nov 8, 2021	Deep LearningKnowledge Distillation	—Unverified
Efficient Inference via Universal LSH Kernel	Jun 21, 2021	Knowledge DistillationQuantization	—Unverified
Efficient Image Compression Using Advanced State Space Models	Sep 4, 2024	Computational EfficiencyImage Compression	—Unverified
Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision	Oct 31, 2023	InformativenessKnowledge Distillation	—Unverified
Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition	Nov 28, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Intermediate Distillation: Data-Efficient Distillation from Black-Box LLMs for Information Retrieval	Jun 18, 2024	Information RetrievalKnowledge Distillation	—Unverified
Interpretable discovery of new semiconductors with machine learning	Jan 12, 2021	BIG-bench Machine LearningKnowledge Distillation	—Unverified
Distillation-Enabled Knowledge Alignment for Generative Semantic Communications in AIGC Provisioning Tasks	Jun 24, 2025	Knowledge DistillationSemantic Communication	—Unverified
Knowledge Distillation based Ensemble Learning for Neural Machine Translation	Jan 1, 2021	Ensemble LearningKnowledge Distillation	—Unverified
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation	May 20, 2025	Information RetrievalKnowledge Distillation	—Unverified
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Apr 15, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Bring the Power of Diffusion Model to Defect Detection	Aug 25, 2024	Defect DetectionDenoising	—Unverified
Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Dec 11, 2024	AstronomyComputational Efficiency	—Unverified
Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving	Apr 24, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders	Sep 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
Efficient Federated Learning for AIoT Applications Using Knowledge Distillation	Nov 29, 2021	Federated LearningKnowledge Distillation	—Unverified
Collaborative Teacher-Student Learning via Multiple Knowledge Transfer	Jan 21, 2021	Knowledge DistillationModel Compression	—Unverified
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking	Sep 5, 2023	BenchmarkingKnowledge Distillation	—Unverified
Efficient Evaluation-Time Uncertainty Estimation by Improved Distillation	Jun 12, 2019	Knowledge Distillation	—Unverified
Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks	Oct 27, 2022	Knowledge DistillationQuantization	—Unverified

Show:10 25 50

← PrevPage 41 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified