Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2051–2100 of 4240 papers

Title	Date	Tasks	Status
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition	Mar 20, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficient Compression of Multitask Multilingual Speech Models	May 2, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Collaborative Learning for Deep Neural Networks	May 30, 2018	Knowledge DistillationMulti-Task Learning	—Unverified
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Oct 9, 2024	Knowledge DistillationNeural Network Compression	—Unverified
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning	Sep 25, 2019	Decision MakingKnowledge Distillation	—Unverified
A Survey of Techniques for Optimizing Transformer Inference	Jul 16, 2023	Knowledge DistillationNeural Architecture Search	—Unverified
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Sep 25, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Feb 20, 2025	Knowledge DistillationModel Compression	—Unverified
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition	Sep 15, 2020	Action RecognitionKnowledge Distillation	—Unverified
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks	May 20, 2024	Inference OptimizationKnowledge Distillation	—Unverified
A Survey of Model Compression and Acceleration for Deep Neural Networks	Oct 23, 2017	BenchmarkingKnowledge Distillation	—Unverified
A Bayesian Optimization Framework for Neural Network Compression	Oct 1, 2019	Bayesian OptimizationKnowledge Distillation	—Unverified
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations	Aug 10, 2019	Knowledge DistillationQuantization	—Unverified
Collaborative Distillation for Top-N Recommendation	Nov 13, 2019	Collaborative FilteringKnowledge Distillation	—Unverified
Effectiveness of Function Matching in Driving Scene Recognition	Aug 20, 2022	Autonomous Drivingimage-classification	—Unverified
A Survey of Methods for Low-Power Deep Learning and Computer Vision	Mar 24, 2020	Knowledge DistillationQuantization	—Unverified
A Study on the Efficiency and Generalization of Light Hybrid Retrievers	Oct 4, 2022	Adversarial AttackContrastive Learning	—Unverified
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation	Nov 18, 2020	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Effective Decision Boundary Learning for Class Incremental Learning	Jan 12, 2023	class-incremental learningClass Incremental Learning	—Unverified
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Sep 18, 2024	Knowledge DistillationMedical Image Analysis	—Unverified
EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures	Aug 6, 2024	Brain Computer InterfaceEEG	—Unverified
Cold & Warm Net: Addressing Cold-Start Users in Recommender Systems	Sep 27, 2023	Knowledge DistillationMeta-Learning	—Unverified
Active Data Curation Effectively Distills Large-Scale Multimodal Models	Nov 27, 2024	DecoderImage Captioning	—Unverified
Knowledge Distillation from Few Samples	Sep 27, 2018	Knowledge Distillation	—Unverified
Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer	Aug 31, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles	Nov 8, 2024	Autonomous VehiclesDescriptive	—Unverified
Knowledge Distillation via Weighted Ensemble of Teaching Assistants	Jun 23, 2022	Ensemble LearningKnowledge Distillation	—Unverified
LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage Approach	May 18, 2025	Deep Reinforcement LearningKnowledge Distillation	—Unverified
EduPal leaves no professor behind: Supporting faculty via a peer-powered recommender system	Apr 20, 2021	ChatbotKnowledge Distillation	—Unverified
A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models	May 26, 2023	Knowledge Distillation	—Unverified
Education distillation:getting student models to learn in shcools	Nov 23, 2023	Incremental LearningKnowledge Distillation	—Unverified
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Feb 23, 2025	Document Layout AnalysisKnowledge Distillation	—Unverified
CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition	Jun 14, 2021	DecoderKnowledge Distillation	—Unverified
Active Class Incremental Learning for Imbalanced Datasets	Aug 25, 2020	class-incremental learningClass Incremental Learning	—Unverified
EdgeFusion: On-Device Text-to-Image Generation	Apr 18, 2024	Image GenerationKnowledge Distillation	—Unverified
CoCo DistillNet: a Cross-layer Correlation Distillation Network for Pathological Gastric Cancer Segmentation	Aug 27, 2021	Image SegmentationKnowledge Distillation	—Unverified
Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs	Mar 24, 2023	Knowledge Distillation	—Unverified
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation	Feb 16, 2022	Grammatical Error CorrectionKnowledge Distillation	—Unverified
A Study of Non-autoregressive Model for Sequence Generation	Apr 22, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images	Apr 11, 2025	General KnowledgeKnowledge Distillation	—Unverified
Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis	Apr 11, 2024	Knowledge DistillationModel Optimization	—Unverified
Edge AI-Enabled Chicken Health Detection Based on Enhanced FCOS-Lite and Knowledge Distillation	Jul 3, 2024	Knowledge DistillationQuantization	—Unverified
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Jan 22, 2025	Knowledge DistillationResponse Generation	—Unverified
CMU’s IWSLT 2022 Dialect Speech Translation System	May 1, 2022	DecoderKnowledge Distillation	—Unverified
Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples	Mar 8, 2024	Knowledge Distillation	—Unverified
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models	Sep 22, 2024	Knowledge Distillation	—Unverified
ECG-guided individual identification via PPG	Dec 30, 2024	Knowledge Distillation	—Unverified
ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation	Jul 2, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Asterisk*: Keep it Simple	Nov 8, 2024	ClassificationKnowledge Distillation	—Unverified
A baseline revisited: Pushing the limits of multi-segment models for context-aware translation	Oct 19, 2022	Knowledge DistillationTranslation	—Unverified

Show:10 25 50

← PrevPage 42 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified