Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 4240 papers

Title	Date	Tasks	Status
Gemma 2: Improving Open Language Models at a Practical Size	Jul 31, 2024	Knowledge Distillation	—Unverified
A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction	Feb 28, 2024	Image EnhancementKnowledge Distillation	—Unverified
Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation	May 19, 2024	Knowledge DistillationPose Estimation	—Unverified
Cross domain knowledge compression in realtime optical flow prediction on ultrasound sequences	Feb 4, 2022	Knowledge DistillationOptical Flow Estimation	—Unverified
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings	Apr 3, 2024	Data IntegrationKnowledge Distillation	—Unverified
Cross-Class Feature Augmentation for Class Incremental Learning	Apr 4, 2023	class-incremental learningClass Incremental Learning	—Unverified
Cross-Architecture Knowledge Distillation	Jul 12, 2022	Knowledge Distillation	—Unverified
Cross Architecture Distillation for Face Recognition	Jun 26, 2023	Face RecognitionKnowledge Distillation	—Unverified
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models	Dec 20, 2024	Knowledge DistillationLanguage Modeling	—Unverified
A Lightweight Domain Adversarial Neural Network Based on Knowledge Distillation for EEG-based Cross-subject Emotion Recognition	May 12, 2023	EEGElectroencephalogram (EEG)	—Unverified
Exploring Extreme Quantization in Spiking Language Models	May 4, 2024	Knowledge DistillationLanguage Modeling	—Unverified
CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation	Mar 28, 2024	3D Object DetectionAutonomous Driving	—Unverified
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	May 30, 2025	Knowledge DistillationLanguage Modeling	—Unverified
AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Sep 13, 2024	Class-Incremental Semantic SegmentationKnowledge Distillation	—Unverified
Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices	May 6, 2019	Knowledge DistillationModel Compression	—Unverified
A Light-weight Deep Learning Model for Remote Sensing Image Classification	Feb 25, 2023	image-classificationImage Classification	—Unverified
Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification	Mar 14, 2025	Acoustic Scene ClassificationKnowledge Distillation	—Unverified
CovidCare: Transferring Knowledge from Existing EMR to Emerging Epidemic for Interpretable Prognosis	Jul 17, 2020	DiagnosticKnowledge Distillation	—Unverified
Aware of the History: Trajectory Forecasting with the Local Behavior Data	Jul 20, 2022	Knowledge DistillationPrediction	—Unverified
CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization	May 8, 2024	DiversityKnowledge Distillation	—Unverified
CoupleFace: Relation Matters for Face Recognition Distillation	Apr 12, 2022	Face RecognitionKnowledge Distillation	—Unverified
A Knowledge Distillation framework for Multi-Organ Segmentation of Medaka Fish in Tomographic Image	Feb 24, 2023	Computed Tomography (CT)Image Segmentation	—Unverified
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation	Nov 1, 2024	EpidemiologyKnowledge Distillation	—Unverified
Coupled End-to-End Transfer Learning With Generalized Fisher Information	Jun 1, 2018	DecoderDomain Adaptation	—Unverified
Co-training and Co-distillation for Quality Improvement and Compression of Language Models	Nov 6, 2023	Data AugmentationKnowledge Distillation	—Unverified
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Mar 10, 2025	Autonomous DrivingKnowledge Distillation	—Unverified
A vision transformer-based framework for knowledge transfer from multi-modal to mono-modal lymphoma subtyping models	Aug 2, 2023	Knowledge DistillationTransfer Learning	—Unverified
1st Place Solution to the EPIC-Kitchens Action Anticipation Challenge 2022	Jul 10, 2022	Action AnticipationKnowledge Distillation	—Unverified
CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers	Feb 24, 2025	Knowledge Distillation	—Unverified
Cost-effective Deployment of BERT Models in Serverless Environment	Jun 1, 2021	Knowledge DistillationSemantic Textual Similarity	—Unverified
AUTOSUMM: Automatic Model Creation for Text Summarization	Nov 1, 2021	Abstractive Text SummarizationDeep Learning	—Unverified
Cost-effective Deployment of BERT Models in Serverless Environment	Mar 19, 2021	Knowledge DistillationSemantic Textual Similarity	—Unverified
Cosine Similarity Knowledge Distillation for Individual Class Information Transfer	Nov 24, 2023	Knowledge DistillationModel Compression	—Unverified
Adapting OC20-trained EquiformerV2 Models for High-Entropy Materials	Mar 14, 2024	Knowledge Distillation	—Unverified
Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch	May 21, 2024	Knowledge Distillation	—Unverified
Exploring Dual Model Knowledge Distillation for Anomaly Detection	Jun 27, 2023	Anomaly Detectionfeature selection	—Unverified
CORSD: Class-Oriented Relational Self Distillation	Apr 28, 2023	Knowledge DistillationModel Compression	—Unverified
Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities	Apr 25, 2024	DisentanglementKnowledge Distillation	—Unverified
A Knowledge Distillation-Based Backdoor Attack in Federated Learning	Aug 12, 2022	Backdoor AttackFederated Learning	—Unverified
Automatic Mixed-Precision Quantization Search of BERT	Dec 30, 2021	Knowledge DistillationModel Compression	—Unverified
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible	Apr 5, 2025	Federated LearningKnowledge Distillation	—Unverified
Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Mar 24, 2023	Image RetrievalKnowledge Distillation	—Unverified
Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models	Sep 19, 2024	Knowledge Distillation	—Unverified
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks	Mar 13, 2024	Knowledge Distillation	—Unverified
ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation	Sep 14, 2023	3DGSColorization	—Unverified
Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks	May 7, 2022	Knowledge DistillationModel Compression	—Unverified
Adapting Models to Signal Degradation using Distillation	Apr 1, 2016	Domain AdaptationKnowledge Distillation	—Unverified
Coordinating Cross-modal Distillation for Molecular Property Prediction	Nov 30, 2022	Graph RegressionGraph Representation Learning	—Unverified
Accelerating Molecular Graph Neural Networks via Knowledge Distillation	Jun 26, 2023	Data AugmentationKnowledge Distillation	—Unverified
Exploiting Knowledge Distillation for Few-Shot Image Generation	Sep 29, 2021	DiversityImage Generation	—Unverified

Show:10 25 50

← PrevPage 26 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified