Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3251–3300 of 4240 papers

Title	Date	Tasks	Status
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models	Nov 25, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification	Nov 9, 2024	Knowledge DistillationPerson Re-Identification	—Unverified
Dynamic Transformer Architecture for Continual Learning of Multimodal Tasks	Jan 27, 2024	Continual LearningEdge-computing	—Unverified
Dynamic Y-KD: A Hybrid Approach to Continual Instance Segmentation	Mar 10, 2023	Continual LearningIncremental Learning	—Unverified
EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models	May 27, 2025	Knowledge Distillation	—Unverified
EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language Processing	Apr 30, 2022	Few-Shot LearningKnowledge Distillation	—Unverified
ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation	Jul 2, 2024	Domain AdaptationKnowledge Distillation	—Unverified
ECG-guided individual identification via PPG	Dec 30, 2024	Knowledge Distillation	—Unverified
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models	Sep 22, 2024	Knowledge Distillation	—Unverified
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Jan 22, 2025	Knowledge DistillationResponse Generation	—Unverified
Edge AI-Enabled Chicken Health Detection Based on Enhanced FCOS-Lite and Knowledge Distillation	Jul 3, 2024	Knowledge DistillationQuantization	—Unverified
Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis	Apr 11, 2024	Knowledge DistillationModel Optimization	—Unverified
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation	Feb 16, 2022	Grammatical Error CorrectionKnowledge Distillation	—Unverified
Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs	Mar 24, 2023	Knowledge Distillation	—Unverified
EdgeFusion: On-Device Text-to-Image Generation	Apr 18, 2024	Image GenerationKnowledge Distillation	—Unverified
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Feb 23, 2025	Document Layout AnalysisKnowledge Distillation	—Unverified
Education distillation:getting student models to learn in shcools	Nov 23, 2023	Incremental LearningKnowledge Distillation	—Unverified
EduPal leaves no professor behind: Supporting faculty via a peer-powered recommender system	Apr 20, 2021	ChatbotKnowledge Distillation	—Unverified
EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures	Aug 6, 2024	Brain Computer InterfaceEEG	—Unverified
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Sep 18, 2024	Knowledge DistillationMedical Image Analysis	—Unverified
Effective Decision Boundary Learning for Class Incremental Learning	Jan 12, 2023	class-incremental learningClass Incremental Learning	—Unverified
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation	Nov 18, 2020	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Effectiveness of Function Matching in Driving Scene Recognition	Aug 20, 2022	Autonomous Drivingimage-classification	—Unverified
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations	Aug 10, 2019	Knowledge DistillationQuantization	—Unverified
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks	May 20, 2024	Inference OptimizationKnowledge Distillation	—Unverified
Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Feb 20, 2025	Knowledge DistillationModel Compression	—Unverified
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Oct 9, 2024	Knowledge DistillationNeural Network Compression	—Unverified
Efficient Compression of Multitask Multilingual Speech Models	May 2, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficient Controllable Multi-Task Architectures	Aug 22, 2023	DecoderKnowledge Distillation	—Unverified
Efficient Convolutional Neural Networks for Depth-Based Multi-Person Pose Estimation	Dec 2, 2019	2D Pose EstimationDomain Adaptation	—Unverified
Efficient Evaluation-Time Uncertainty Estimation by Improved Distillation	Jun 12, 2019	Knowledge Distillation	—Unverified
Efficient Federated Learning for AIoT Applications Using Knowledge Distillation	Nov 29, 2021	Federated LearningKnowledge Distillation	—Unverified
Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Dec 11, 2024	AstronomyComputational Efficiency	—Unverified
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Apr 15, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Efficient Image Compression Using Advanced State Space Models	Sep 4, 2024	Computational EfficiencyImage Compression	—Unverified
Efficient Inference via Universal LSH Kernel	Jun 21, 2021	Knowledge DistillationQuantization	—Unverified
Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Mar 21, 2025	intent-classificationIntent Classification	—Unverified
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified
Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Jan 28, 2025	Computational EfficiencyDecoder	—Unverified
Efficient Knowledge Distillation via Curriculum Extraction	Mar 21, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Efficient Machine Translation with Model Pruning and Quantization	Nov 1, 2021	CPUDecoder	—Unverified
Efficient Object Detection in Optical Remote Sensing Imagery via Attention-based Feature Distillation	Oct 28, 2023	Knowledge DistillationObject	—Unverified
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Sep 3, 2024	Data AugmentationKnowledge Distillation	—Unverified
Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Dec 17, 2024	Edge-computingKnowledge Distillation	—Unverified
Efficient speech detection in environmental audio using acoustic recognition and knowledge distillation	Dec 14, 2023	Knowledge DistillationModel Selection	—Unverified
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation	Aug 26, 2021	Density EstimationKnowledge Distillation	—Unverified
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning	Sep 17, 2020	Edge-computingKnowledge Distillation	—Unverified
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified

Show:10 25 50

← PrevPage 66 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified