Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3151–3200 of 4240 papers

Title	Date	Tasks	Status
Leveraging Advantages of Interactive and Non-Interactive Models for Vector-Based Cross-Lingual Information Retrieval	Nov 3, 2021	Computational EfficiencyCross-Lingual Information Retrieval	—Unverified
Leveraging Angular Distributions for Improved Knowledge Distillation	Feb 27, 2023	Knowledge Distillation	—Unverified
Leveraging ASR Pretrained Conformers for Speaker Verification through Transfer Learning and Knowledge Distillation	Sep 6, 2023	Knowledge DistillationSpeaker Verification	—Unverified
Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification	Feb 16, 2025	Classificationimage-classification	—Unverified
Leveraging Different Learning Styles for Improved Knowledge Distillation in Biomedical Imaging	Dec 6, 2022	Knowledge DistillationModel Compression	—Unverified
Leveraging Expert Models for Training Deep Neural Networks in Scarce Data Domains: Application to Offline Handwritten Signature Verification	Aug 2, 2023	Knowledge Distillation	—Unverified
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs	Oct 17, 2024	Dataset GenerationKnowledge Distillation	—Unverified
Leveraging Foundation Models To learn the shape of semi-fluid deformable objects	Nov 25, 2024	Knowledge DistillationObject	—Unverified
Leveraging Knowledge Distillation for Lightweight Skin Cancer Classification: Balancing Accuracy and Computational Efficiency	Jun 24, 2024	Cancer ClassificationComputational Efficiency	—Unverified
Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies	Feb 14, 2024	Knowledge Distillationnamed-entity-recognition	—Unverified
Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition	Mar 16, 2021	Deep LearningEmotion Recognition	—Unverified
Li3DeTr: A LiDAR based 3D Detection Transformer	Oct 27, 2022	Autonomous DrivingDecoder	—Unverified
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification	Feb 11, 2025	Knowledge Distillation	—Unverified
Lifelong GAN: Continual Learning for Conditional Image Generation	Jul 23, 2019	Conditional Image GenerationContinual Learning	—Unverified
Lifelong Intent Detection via Multi-Strategy Rebalancing	Aug 10, 2021	Intent DetectionKnowledge Distillation	—Unverified
Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation	Dec 6, 2022	Knowledge DistillationMachine Translation	—Unverified
Lifelong Learning for Neural powered Mixed Integer Programming	Aug 24, 2022	Graph AttentionKnowledge Distillation	—Unverified
Lifelong Learning via Progressive Distillation and Retrospection	Sep 1, 2018	Knowledge DistillationLifelong learning	—Unverified
Lifelong Object Detection	Sep 2, 2020	Knowledge DistillationLifelong learning	—Unverified
Lifelong Person Search	Jul 31, 2024	Knowledge DistillationPerson Search	—Unverified
Lifelong Twin Generative Adversarial Networks	Jul 9, 2021	Knowledge Distillation	—Unverified
Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation	Dec 13, 2021	Domain Adaptive Person Re-IdentificationKnowledge Distillation	—Unverified
LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation	Nov 18, 2023	Knowledge DistillationLesion Detection	—Unverified
Light distillation for Incremental Graph Convolution Collaborative Filtering	May 26, 2025	Collaborative FilteringKnowledge Distillation	—Unverified
LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning	Apr 27, 2020	Knowledge DistillationLanguage Modeling	—Unverified
LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation	Nov 2, 2022	DecoderKnowledge Distillation	—Unverified
Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning	Jan 15, 2020	3D Human Pose Estimation3D Pose Estimation	—Unverified
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval	Feb 27, 2025	Cross-Modal RetrievalKnowledge Distillation	—Unverified
Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision	Jan 14, 2024	Knowledge DistillationRepresentation Learning	—Unverified
Lightweight Neural Network with Knowledge Distillation for CSI Feedback	Oct 31, 2022	Knowledge Distillation	—Unverified
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified
Lightweight Task-Oriented Semantic Communication Empowered by Large-Scale AI Models	Jun 16, 2025	Knowledge DistillationSemantic Communication	—Unverified
Limitations of Knowledge Distillation for Zero-shot Transfer Learning	Nov 1, 2021	CPUCross-Lingual Transfer	—Unverified
Linear Projections of Teacher Embeddings for Few-Class Distillation	Sep 30, 2024	Binary ClassificationKnowledge Distillation	—Unverified
Linkless Link Prediction via Relational Distillation	Oct 11, 2022	Knowledge DistillationLink Prediction	—Unverified
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models	Jun 5, 2022	Knowledge DistillationLipreading	—Unverified
Lipschitz Continuity Guided Knowledge Distillation	Aug 29, 2021	Knowledge DistillationModel Compression	—Unverified
ListBERT: Learning to Rank E-commerce products with Listwise BERT	Jun 30, 2022	Knowledge DistillationLearning-To-Rank	—Unverified
LIT: Block-wise Intermediate Representation Training for Model Compression	Oct 2, 2018	Knowledge DistillationModel Compression	—Unverified
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jan 22, 2025	Image GenerationKnowledge Distillation	—Unverified
LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving	Mar 13, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Llama-Nemotron: Efficient Reasoning Models	May 2, 2025	Knowledge DistillationNeural Architecture Search	—Unverified
LLAVADI: What Matters For Multimodal Large Language Models Distillation	Jul 28, 2024	Knowledge Distillation	—Unverified
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Oct 19, 2024	Instruction FollowingKnowledge Distillation	—Unverified
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification	Feb 26, 2024	Data AugmentationKnowledge Distillation	—Unverified
LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering	Dec 13, 2024	Few-Shot LearningKnowledge Distillation	—Unverified
LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs	Feb 15, 2025	Edge ClassificationKnowledge Distillation	—Unverified
LLM Pretraining with Continuous Concepts	Feb 12, 2025	Knowledge DistillationLanguage Modeling	—Unverified
LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation	Apr 1, 2024	Knowledge Distillation	—Unverified
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward	Sep 19, 2024	Dialogue GenerationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 64 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified