Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3101–3150 of 4240 papers

Title	Date	Tasks	Status
Yield Evaluation of Citrus Fruits based on the YoloV5 compressed by Knowledge Distillation	Nov 16, 2022	Knowledge Distillation	—Unverified
YOLO in the Dark - Domain Adaptation Method for Merging Multiple Models -	Aug 1, 2020	Domain AdaptationKnowledge Distillation	—Unverified
You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models	Oct 13, 2022	Cross-Lingual TransferKnowledge Distillation	—Unverified
You Do Not Need Additional Priors or Regularizers in Retinex-Based Low-Light Image Enhancement	Jan 1, 2023	Contrastive LearningImage Enhancement	—Unverified
Zero shot framework for satellite image restoration	Jun 5, 2023	DisentanglementImage Restoration	—Unverified
Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems	Nov 28, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
Diverse Knowledge Distillation (DKD): A Solution for Improving The Robustness of Ensemble Models Against Adversarial Attacks	Jun 26, 2020	Ensemble Learningimage-classification	—Unverified
Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning	Sep 29, 2021	Image Super-ResolutionKnowledge Distillation	—Unverified
Learning Efficient Object Detection Models with Knowledge Distillation	Dec 1, 2017	Knowledge DistillationModel Compression	—Unverified
Learning from a Lightweight Teacher for Efficient Knowledge Distillation	May 19, 2020	Knowledge Distillation	—Unverified
Learning From Biased Soft Labels	Feb 16, 2023	Knowledge Distillation	—Unverified
Learning from deep model via exploring local targets	Jan 1, 2021	Knowledge Distillationmodel	—Unverified
Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL	Oct 15, 2024	Knowledge DistillationText to SQL	—Unverified
Learning from Matured Dumb Teacher for Fine Generalization	Aug 12, 2021	image-classificationImage Classification	—Unverified
Learning Human-Human Interactions in Images from Weak Textual Supervision	Apr 27, 2023	Human-Human Interaction RecognitionImage Captioning	—Unverified
MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing	Nov 19, 2020	AllKnowledge Distillation	—Unverified
Learning Interpretation with Explainable Knowledge Distillation	Nov 12, 2021	Knowledge DistillationModel Compression	—Unverified
Learning Knowledge Representation with Meta Knowledge Distillation for Single Image Super-Resolution	Jul 18, 2022	Image Super-ResolutionKnowledge Distillation	—Unverified
Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation	Aug 17, 2023	Edge-computingInstance Segmentation	—Unverified
Learning Lightweight Pedestrian Detector with Hierarchical Knowledge Distillation	Sep 20, 2019	Knowledge DistillationPedestrian Detection	—Unverified
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Jul 16, 2024	Knowledge DistillationSemantic Segmentation	—Unverified
Learning Student-Friendly Teacher Networks for Knowledge Distillation	Feb 12, 2021	Knowledge DistillationTransfer Learning	—Unverified
Learning Student Networks via Feature Embedding	Dec 17, 2018	Knowledge Distillation	—Unverified
Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion	Jan 1, 2020	Knowledge Distillation	—Unverified
Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation	Mar 9, 2023	Knowledge Distillation	—Unverified
Learning Through Guidance: Knowledge Distillation for Endoscopic Image Classification	Aug 17, 2023	ClassificationFeature Engineering	—Unverified
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation	Jan 20, 2021	Knowledge Distillation	—Unverified
Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach	Aug 20, 2020	AttributeAttribute Value Extraction	—Unverified
Learning to Project for Cross-Task Knowledge Distillation	Mar 21, 2024	Depth EstimationKnowledge Distillation	—Unverified
Learning to reconstruct signals with inexact sensing operator via knowledge distillation	Jan 18, 2025	Knowledge Distillation	—Unverified
Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation	Feb 28, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Learning to Specialize with Knowledge Distillation for Visual Question Answering	Dec 1, 2018	General ClassificationGeneral Knowledge	—Unverified
Learning to Teach with Student Feedback	Sep 10, 2021	Knowledge Distillation	—Unverified
Learning to Teach with Student Feedback	Nov 16, 2021	Knowledge Distillation	—Unverified
Learning ULMFiT and Self-Distillation with Calibration for Medical Dialogue System	Jul 20, 2021	Decision MakingKnowledge Distillation	—Unverified
Learning Using Generated Privileged Information by Text-to-Image Diffusion Models	Sep 26, 2023	ClassificationKnowledge Distillation	—Unverified
Teaching What You Should Teach: A Data-Based Distillation Method	Dec 11, 2022	Data AugmentationKnowledge Distillation	—Unverified
Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Nov 12, 2024	Knowledge Distillation	—Unverified
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition	Jul 13, 2019	Knowledge DistillationLanguage Modeling	—Unverified
Learn to Talk via Proactive Knowledge Transfer	Aug 23, 2020	de-enKnowledge Distillation	—Unverified
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data	Jul 15, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models	Oct 24, 2022	Knowledge DistillationModel Compression	—Unverified
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision	Dec 18, 2021	Knowledge DistillationModel Compression	—Unverified
LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity	Jan 1, 2025	Computational EfficiencyIntrusion Detection	—Unverified
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction	Jul 9, 2024	Autonomous DrivingDecision Making	—Unverified
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation	Dec 22, 2023	Bilevel OptimizationClick-Through Rate Prediction	—Unverified
Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection	Aug 26, 2024	Knowledge DistillationLesion Detection	—Unverified
Letz Translate: Low-Resource Machine Translation for Luxembourgish	Mar 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation	Aug 17, 2022	ClassificationKnowledge Distillation	—Unverified
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification	Feb 15, 2021	ClassificationGeneral Classification	—Unverified

Show:10 25 50

← PrevPage 63 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified