Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2601–2650 of 4240 papers

Title	Date	Tasks	Status
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation	Nov 1, 2024	EpidemiologyKnowledge Distillation	—Unverified
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings	Apr 3, 2024	Data IntegrationKnowledge Distillation	—Unverified
Adaptive Beam Search to Enhance On-device Abstractive Summarization	Dec 22, 2021	Abstractive Text SummarizationKnowledge Distillation	—Unverified
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions	Jul 11, 2024	Iris RecognitionKnowledge Distillation	—Unverified
Adaptive Explicit Knowledge Transfer for Knowledge Distillation	Sep 3, 2024	Knowledge DistillationTransfer Learning	—Unverified
Adaptive Group Robust Ensemble Knowledge Distillation	Nov 22, 2024	Knowledge Distillation	—Unverified
Adaptive Instance Distillation for Object Detection in Autonomous Driving	Jan 26, 2022	Autonomous DrivingKnowledge Distillation	—Unverified
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models	Mar 7, 2023	Knowledge DistillationSpoken Language Understanding	—Unverified
Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers	Aug 20, 2024	Knowledge Distillation	—Unverified
Adaptive Label Smoothing with Self-Knowledge	Sep 29, 2021	Knowledge DistillationMachine Translation	—Unverified
Adaptive Label Smoothing with Self-Knowledge in Natural Language Generation	Oct 22, 2022	Knowledge DistillationText Generation	—Unverified
Adaptively Integrated Knowledge Distillation and Prediction Uncertainty for Continual Learning	Jan 18, 2023	Continual LearningKnowledge Distillation	—Unverified
Adaptive Multiplane Image Generation from a Single Internet Picture	Nov 26, 2020	Depth EstimationImage Generation	—Unverified
Adaptive Regularization of Labels	Aug 15, 2019	Data AugmentationKnowledge Distillation	—Unverified
Add a SideNet to your MainNet	Jul 14, 2020	General ClassificationKnowledge Distillation	—Unverified
Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning	Feb 1, 2024	Ensemble LearningKnowledge Distillation	—Unverified
A Deep Hierarchical Feature Sparse Framework for Occluded Person Re-Identification	Jan 15, 2024	Data AugmentationKnowledge Distillation	—Unverified
A deep Natural Language Inference predictor without language-specific training data	Sep 6, 2023	Aspect-Based Sentiment AnalysisKnowledge Distillation	—Unverified
A Deep Reinforcement Learning Framework for Rapid Diagnosis of Whole Slide Pathological Images	May 5, 2022	Deep Reinforcement LearningKnowledge Distillation	—Unverified
A Dimensional Structure based Knowledge Distillation Method for Cross-Modal Learning	Jun 28, 2023	Knowledge Distillation	—Unverified
ADINet: Attribute Driven Incremental Network for Retinal Image Classification	Jun 1, 2020	AttributeClassification	—Unverified
A distillation based approach for the diagnosis of diseases	Aug 7, 2021	Knowledge Distillation	—Unverified
ADMP: An Adversarial Double Masks Based Pruning Framework For Unsupervised Cross-Domain Compression	Jun 7, 2020	Domain AdaptationKnowledge Distillation	—Unverified
ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning	Mar 10, 2025	Active LearningKnowledge Distillation	—Unverified
ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation	Jan 1, 2025	Domain AdaptationKnowledge Distillation	—Unverified
ADU-Depth: Attention-based Distillation with Uncertainty Modeling for Depth Estimation	Sep 26, 2023	3D geometryDepth Estimation	—Unverified
Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI	Mar 19, 2025	Deep LearningFederated Learning	—Unverified
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity	Oct 1, 2024	DecoderKnowledge Distillation	—Unverified
Adversarial-Based Knowledge Distillation for Multi-Model Ensemble and Noisy Data Refinement	Aug 22, 2019	Knowledge DistillationMissing Labels	—Unverified
Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks	Apr 1, 2025	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Adversarial Feature Alignment: Avoid Catastrophic Forgetting in Incremental Task Lifelong Learning	Oct 24, 2019	Continual Learningimage-classification	—Unverified
Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff	Aug 31, 2023	Knowledge Distillation	—Unverified
Adversarially Robust and Explainable Model Compression with On-Device Personalization for Text Classification	Jan 10, 2021	Adversarial RobustnessGeneral Classification	—Unverified
Adversarial Prompt Distillation for Vision-Language Models	Nov 22, 2024	Adversarial RobustnessAutonomous Driving	—Unverified
Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers	Apr 11, 2024	Adversarial RobustnessKnowledge Distillation	—Unverified
Adversarial Self-Supervised Data-Free Distillation for Text Classification	Oct 10, 2020	ClassificationGeneral Classification	—Unverified
Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples	Mar 8, 2024	Knowledge Distillation	—Unverified
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Sep 25, 2024	Domain AdaptationKnowledge Distillation	—Unverified
A dynamic interactive learning framework for automated 3D medical image segmentation	Dec 11, 2023	Image RegistrationImage Segmentation	—Unverified
A Flexible Multi-Task Model for BERT Serving	Nov 16, 2021	Knowledge Distillationmodel	—Unverified
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model	Jul 17, 2024	Knowledge Distillationscientific discovery	—Unverified
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Feb 25, 2025	Knowledge DistillationLanguage Modeling	—Unverified
After-Stroke Arm Paresis Detection using Kinematic Data	Nov 3, 2023	Action ClassificationKnowledge Distillation	—Unverified
A Generalized and Robust Method Towards Practical Gaze Estimation on Smart Phone	Oct 16, 2019	Gaze EstimationKnowledge Distillation	—Unverified
Generalized Supervised Contrastive Learning	Jun 1, 2022	Contrastive LearningKnowledge Distillation	—Unverified
A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks	May 29, 2022	Data Augmentationimage-classification	—Unverified
A Generative Framework for Personalized Learning and Estimation: Theory, Algorithms, and Privacy	Jul 5, 2022	Federated LearningKnowledge Distillation	—Unverified
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jun 17, 2025	Knowledge DistillationTransfer Learning	—Unverified
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation	Apr 3, 2025	Image SegmentationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 53 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified