Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4101–4150 of 4240 papers

Title	Date	Tasks	Status
Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection	May 10, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation	Aug 30, 2022	Knowledge DistillationSegmentation	CodeCode Available
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available
On the Transferability of Visual Features in Generalized Zero-Shot Learning	Nov 22, 2022	Generalized Zero-Shot LearningKnowledge Distillation	CodeCode Available
A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation	Mar 6, 2024	Knowledge Distillation	CodeCode Available
On the Use of External Data for Spoken Named Entity Recognition	Dec 14, 2021	Knowledge Distillationnamed-entity-recognition	CodeCode Available
OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Feb 11, 2025	Knowledge DistillationMMLU	CodeCode Available
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Oct 3, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available
Born Again Neural Networks	May 12, 2018	Image ClassificationKnowledge Distillation	CodeCode Available
Data-Free Knowledge Distillation for Image Super-Resolution	Jun 19, 2021	Data-free Knowledge DistillationImage Super-Resolution	CodeCode Available
Faithful Label-free Knowledge Distillation	Nov 22, 2024	Inductive BiasKnowledge Distillation	CodeCode Available
Data-free Knowledge Distillation for Fine-grained Visual Categorization	Apr 18, 2024	Data-free Knowledge DistillationFine-Grained Visual Categorization	CodeCode Available
Self-Attentive Spatio-Temporal Calibration for Precise Intermediate Layer Matching in ANN-to-SNN Distillation	Jan 14, 2025	Knowledge Distillation	CodeCode Available
Fairness without Demographics through Knowledge Distillation	Nov 1, 2022	FairnessKnowledge Distillation	CodeCode Available
Towards Real-time Video Compressive Sensing on Mobile Devices	Aug 14, 2024	Compressive SensingKnowledge Distillation	CodeCode Available
Boosting Summarization with Normalizing Flows and Aggressive Training	Nov 1, 2023	DecoderKnowledge Distillation	CodeCode Available
Self-Distillation for Gaussian Process Regression and Classification	Apr 5, 2023	ClassificationGPR	CodeCode Available
Data-free Knowledge Distillation for Segmentation using Data-Enriching GAN	Nov 2, 2020	Data-free Knowledge DistillationDiversity	CodeCode Available
Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking	Jun 4, 2024	Entity LinkingKnowledge Distillation	CodeCode Available
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning	Jun 1, 2023	Incremental LearningKnowledge Distillation	CodeCode Available
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models	Mar 20, 2024	ChatbotKnowledge Distillation	CodeCode Available
Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation	Sep 18, 2023	ChatbotKnowledge Distillation	CodeCode Available
Optimizing edge AI models on HPC systems with the edge in the loop	May 26, 2025	Hardware Aware Neural Architecture SearchKnowledge Distillation	CodeCode Available
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation	Feb 18, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers	Feb 23, 2023	Knowledge DistillationQuantization	CodeCode Available
Teacher Network Calibration Improves Cross-Quality Knowledge Distillation	Apr 15, 2023	image-classificationImage Classification	CodeCode Available
Facial Landmark Points Detection Using Knowledge Distillation-Based Neural Networks	Nov 13, 2021	Face AlignmentFacial Landmark Detection	CodeCode Available
Boosting Residual Networks with Group Knowledge	Aug 26, 2023	Knowledge Distillation	CodeCode Available
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning	Aug 2, 2024	Continual LearningKnowledge Distillation	CodeCode Available
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining	Jun 6, 2023	Knowledge DistillationModel-based Reinforcement Learning	CodeCode Available
When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?	Nov 25, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
Eyelid’s Intrinsic Motion-aware Feature Learning for Real-time Eyeblink Detection in the Wild	Aug 3, 2023	AttributeDescriptive	CodeCode Available
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding	Nov 15, 2022	class-incremental learningClass Incremental Learning	CodeCode Available
ORC: Network Group-based Knowledge Distillation using Online Role Change	Jun 1, 2022	Knowledge Distillation	CodeCode Available
Data-Free Generative Replay for Class-Incremental Learning on Imbalanced Data	Jun 7, 2024	class-incremental learningClass Incremental Learning	CodeCode Available
Exploring Target Representations for Masked Autoencoders	Sep 8, 2022	Image ClassificationInstance Segmentation	CodeCode Available
Adaptive Mixing of Auxiliary Losses in Supervised Learning	Feb 7, 2022	DenoisingKnowledge Distillation	CodeCode Available
Distilling the Unknown to Unveil Certainty	Nov 14, 2023	Knowledge DistillationOut of Distribution (OOD) Detection	CodeCode Available
Exploring Social Media for Early Detection of Depression in COVID-19 Patients	Feb 23, 2023	Knowledge Distillation	CodeCode Available
Exploring Non-Autoregressive Text Style Transfer	Nov 1, 2021	Contrastive LearningKnowledge Distillation	CodeCode Available
Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware Detector	Jan 2, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
Exploring Feature-based Knowledge Distillation for Recommender System: A Frequency Perspective	Nov 16, 2024	Knowledge DistillationRecommendation Systems	CodeCode Available
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Jan 7, 2025	DiversityKnowledge Distillation	CodeCode Available
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation	Jul 28, 2024	Knowledge DistillationSequential Diagnosis	CodeCode Available
Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation	Nov 10, 2024	Knowledge DistillationTensor Decomposition	CodeCode Available
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels	Sep 10, 2023	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Aug 18, 2024	Autonomous DrivingDomain Adaptation	CodeCode Available
Evolutionary Generative Adversarial Networks with Crossover Based Knowledge Distillation	Jan 27, 2021	Knowledge Distillation	CodeCode Available
PruMUX: Augmenting Data Multiplexing with Model Compression	May 24, 2023	Knowledge Distillationmodel	CodeCode Available
Weak-to-Strong 3D Object Detection with X-Ray Distillation	Mar 31, 2024	3D Object DetectionAutonomous Driving	CodeCode Available

Show:10 25 50

← PrevPage 83 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified