Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3951–4000 of 4240 papers

Title	Date	Tasks	Status
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation	Jan 20, 2021	Knowledge Distillation	—Unverified
Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach	Aug 20, 2020	AttributeAttribute Value Extraction	—Unverified
Learning to Project for Cross-Task Knowledge Distillation	Mar 21, 2024	Depth EstimationKnowledge Distillation	—Unverified
Learning to reconstruct signals with inexact sensing operator via knowledge distillation	Jan 18, 2025	Knowledge Distillation	—Unverified
Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation	Feb 28, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Learning to Specialize with Knowledge Distillation for Visual Question Answering	Dec 1, 2018	General ClassificationGeneral Knowledge	—Unverified
Learning to Teach with Student Feedback	Sep 10, 2021	Knowledge Distillation	—Unverified
Learning to Teach with Student Feedback	Nov 16, 2021	Knowledge Distillation	—Unverified
Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System	Jul 20, 2021	Decision MakingKnowledge Distillation	—Unverified
Learning Using Generated Privileged Information by Text-to-Image Diffusion Models	Sep 26, 2023	ClassificationKnowledge Distillation	—Unverified
Teaching What You Should Teach: A Data-Based Distillation Method	Dec 11, 2022	Data AugmentationKnowledge Distillation	—Unverified
Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Nov 12, 2024	Knowledge Distillation	—Unverified
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition	Jul 13, 2019	Knowledge DistillationLanguage Modeling	—Unverified
Learn to Talk via Proactive Knowledge Transfer	Aug 23, 2020	de-enKnowledge Distillation	—Unverified
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data	Jul 15, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models	Oct 24, 2022	Knowledge DistillationModel Compression	—Unverified
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision	Dec 18, 2021	Knowledge DistillationModel Compression	—Unverified
LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity	Jan 1, 2025	Computational EfficiencyIntrusion Detection	—Unverified
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction	Jul 9, 2024	Autonomous DrivingDecision Making	—Unverified
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation	Dec 22, 2023	Bilevel OptimizationClick-Through Rate Prediction	—Unverified
Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection	Aug 26, 2024	Knowledge DistillationLesion Detection	—Unverified
Letz Translate: Low-Resource Machine Translation for Luxembourgish	Mar 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation	Aug 17, 2022	ClassificationKnowledge Distillation	—Unverified
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification	Feb 15, 2021	ClassificationGeneral Classification	—Unverified
Leveraging Advantages of Interactive and Non-Interactive Models for Vector-Based Cross-Lingual Information Retrieval	Nov 3, 2021	Computational EfficiencyCross-Lingual Information Retrieval	—Unverified
Leveraging Angular Distributions for Improved Knowledge Distillation	Feb 27, 2023	Knowledge Distillation	—Unverified
Leveraging ASR Pretrained Conformers for Speaker Verification through Transfer Learning and Knowledge Distillation	Sep 6, 2023	Knowledge DistillationSpeaker Verification	—Unverified
Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification	Feb 16, 2025	Classificationimage-classification	—Unverified
Leveraging Different Learning Styles for Improved Knowledge Distillation in Biomedical Imaging	Dec 6, 2022	Knowledge DistillationModel Compression	—Unverified
Leveraging Expert Models for Training Deep Neural Networks in Scarce Data Domains: Application to Offline Handwritten Signature Verification	Aug 2, 2023	Knowledge Distillation	—Unverified
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs	Oct 17, 2024	Dataset GenerationKnowledge Distillation	—Unverified
Leveraging Foundation Models To learn the shape of semi-fluid deformable objects	Nov 25, 2024	Knowledge DistillationObject	—Unverified
Leveraging Knowledge Distillation for Lightweight Skin Cancer Classification: Balancing Accuracy and Computational Efficiency	Jun 24, 2024	Cancer ClassificationComputational Efficiency	—Unverified
Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies	Feb 14, 2024	Knowledge Distillationnamed-entity-recognition	—Unverified
Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition	Mar 16, 2021	Deep LearningEmotion Recognition	—Unverified
Li3DeTr: A LiDAR based 3D Detection Transformer	Oct 27, 2022	Autonomous DrivingDecoder	—Unverified
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification	Feb 11, 2025	Knowledge Distillation	—Unverified
Lifelong GAN: Continual Learning for Conditional Image Generation	Jul 23, 2019	Conditional Image GenerationContinual Learning	—Unverified
Lifelong Intent Detection via Multi-Strategy Rebalancing	Aug 10, 2021	Intent DetectionKnowledge Distillation	—Unverified
Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation	Dec 6, 2022	Knowledge DistillationMachine Translation	—Unverified
Lifelong Learning for Neural powered Mixed Integer Programming	Aug 24, 2022	Graph AttentionKnowledge Distillation	—Unverified
Lifelong Learning via Progressive Distillation and Retrospection	Sep 1, 2018	Knowledge DistillationLifelong learning	—Unverified
Lifelong Object Detection	Sep 2, 2020	Knowledge DistillationLifelong learning	—Unverified
Lifelong Person Search	Jul 31, 2024	Knowledge DistillationPerson Search	—Unverified
Lifelong Twin Generative Adversarial Networks	Jul 9, 2021	Knowledge Distillation	—Unverified
Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation	Dec 13, 2021	Domain Adaptive Person Re-IdentificationKnowledge Distillation	—Unverified
LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation	Nov 18, 2023	Knowledge DistillationLesion Detection	—Unverified
Light distillation for Incremental Graph Convolution Collaborative Filtering	May 26, 2025	Collaborative FilteringKnowledge Distillation	—Unverified
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning	Apr 27, 2020	Knowledge DistillationLanguage Modeling	—Unverified
LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation	Nov 2, 2022	DecoderKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 80 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified