Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3751–3800 of 4240 papers

Title	Date	Tasks	Status
ReffAKD: Resource-efficient Autoencoder-based Knowledge Distillation	Apr 15, 2024	Knowledge Distillation	CodeCode Available
Collaborative Learning of Bidirectional Decoders for Unsupervised Text Style Transfer	Nov 1, 2021	AttributeDecoder	CodeCode Available
Refined Response Distillation for Class-Incremental Player Detection	May 1, 2023	Knowledge Distillationobject-detection	CodeCode Available
MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Face Images	Nov 19, 2017	CPUFacial Expression Recognition	CodeCode Available
Image Recognition with Online Lightweight Vision Transformer: A Survey	May 6, 2025	Knowledge DistillationSurvey	CodeCode Available
Distilling Object Detectors With Global Knowledge	Oct 17, 2022	Knowledge DistillationObject	CodeCode Available
Low-Energy On-Device Personalization for MCUs	Mar 12, 2024	Knowledge DistillationTransfer Learning	CodeCode Available
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU	Aug 15, 2024	domain classificationIntent Detection	CodeCode Available
Hybrid Data-Free Knowledge Distillation	Dec 18, 2024	Data-free Knowledge DistillationGenerative Adversarial Network	CodeCode Available
Collaborative Deep Reinforcement Learning	Feb 19, 2017	Deep Reinforcement LearningKnowledge Distillation	CodeCode Available
Cogni-Net: Cognitive Feature Learning through Deep Visual Perception	Nov 1, 2018	EEGElectroencephalogram (EEG)	CodeCode Available
MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation	Jan 26, 2025	Gait RecognitionGait Recognition in the Wild	CodeCode Available
Regression-Oriented Knowledge Distillation for Lightweight Ship Orientation Angle Prediction with Optical Remote Sensing Images	Jul 13, 2023	Knowledge DistillationPrediction	CodeCode Available
Distilling Object Detectors with Fine-grained Feature Imitation	Jun 9, 2019	Knowledge DistillationObject	CodeCode Available
Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting	Nov 16, 2024	Knowledge Distillation	CodeCode Available
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation	Oct 14, 2024	Knowledge DistillationMedical Image Analysis	CodeCode Available
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation	Mar 18, 2024	Knowledge DistillationNER	CodeCode Available
Distilling Reasoning Capabilities into Smaller Language Models	Dec 1, 2022	GSM8KKnowledge Distillation	CodeCode Available
Minimizing PLM-Based Few-Shot Intent Detectors	Jul 13, 2024	Data AugmentationKnowledge Distillation	CodeCode Available
Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation	Dec 10, 2021	Extractive SummarizationKnowledge Distillation	CodeCode Available
TSPipe: Learn from Teacher Faster with Pipelines	Jul 17, 2022	GPUKnowledge Distillation	CodeCode Available
Reinforced Knowledge Distillation for Time Series Regression	Jun 21, 2024	Knowledge DistillationModel Compression	CodeCode Available
A Flexible Multi-Task Model for BERT Serving	Jul 12, 2021	Knowledge Distillationmodel	CodeCode Available
HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation	Dec 24, 2024	Computational EfficiencyHandwritten Text Recognition	CodeCode Available
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation	Jun 2, 2021	Knowledge DistillationTranslation	CodeCode Available
Relational Diffusion Distillation for Efficient Image Generation	Oct 10, 2024	Image GenerationKnowledge Distillation	CodeCode Available
Relational Knowledge Distillation	Apr 10, 2019	Knowledge DistillationMetric Learning	CodeCode Available
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression	Oct 16, 2021	Few-Shot LearningKnowledge Distillation	CodeCode Available
Distilling Model Knowledge	Oct 8, 2015	Bayesian InferenceBIG-bench Machine Learning	CodeCode Available
How to Train the Teacher Model for Effective Knowledge Distillation	Jul 25, 2024	Knowledge Distillation	CodeCode Available
MixedTeacher : Knowledge Distillation for fast inference textural anomaly detection	Jun 16, 2023	Anomaly DetectionKnowledge Distillation	CodeCode Available
Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing	May 12, 2025	3D Object RecognitionAutonomous Driving	CodeCode Available
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes	Jan 2, 2024	Knowledge Distillation	CodeCode Available
Dynamic Data-Free Knowledge Distillation by Easy-to-Hard Learning Strategy	Aug 29, 2022	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
CL-XABSA: Contrastive Learning for Cross-lingual Aspect-based Sentiment Analysis	Apr 2, 2022	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	CodeCode Available
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
Relative Difficulty Distillation for Semantic Segmentation	Jul 4, 2024	Knowledge DistillationSemantic Segmentation	CodeCode Available
Self-Supervised Z-Slice Augmentation for 3D Bio-Imaging via Knowledge Distillation	Mar 5, 2025	Generative Adversarial NetworkKnowledge Distillation	CodeCode Available
How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition	Aug 30, 2024	Face RecognitionFairness	CodeCode Available
Releasing Graph Neural Networks with Differential Privacy Guarantees	Sep 18, 2021	Knowledge DistillationPrivacy Preserving	CodeCode Available
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy	May 25, 2025	DiagnosticKnowledge Distillation	CodeCode Available
Distilling Knowledge for Empathy Detection	Nov 1, 2021	Knowledge Distillation	CodeCode Available
RELIANT: Fair Knowledge Distillation for Graph Neural Networks	Jan 3, 2023	FairnessGraph Learning	CodeCode Available
HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution	Aug 30, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available
Highlight Every Step: Knowledge Distillation via Collaborative Teaching	Jul 23, 2019	Knowledge Distillation	CodeCode Available
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification	Jul 10, 2024	Computational Efficiencyimage-classification	CodeCode Available
Distilling Knowledge for Designing Computational Imaging Systems	Jan 29, 2025	DecoderImage Reconstruction	CodeCode Available
MOD: A Deep Mixture Model with Online Knowledge Distillation for Large Scale Video Temporal Concept Localization	Oct 27, 2019	Knowledge DistillationVideo Understanding	CodeCode Available
Handling Data Heterogeneity in Federated Learning via Knowledge Distillation and Fusion	Jul 23, 2022	Data-free Knowledge DistillationFairness	CodeCode Available
Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering	Jul 20, 2023	ClusteringData Augmentation	CodeCode Available

Show:10 25 50

← PrevPage 76 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified