Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3251–3300 of 4240 papers

Title	Date	Tasks	Status
Enhancing Metaphor Detection through Soft Labels and Target Word Prediction	Mar 27, 2024	Knowledge DistillationPrompt Learning	—Unverified
Measuring and Reducing Model Update Regression in Structured Prediction for NLP	Feb 7, 2022	Dependency ParsingKnowledge Distillation	—Unverified
Medical Image Segmentation on MRI Images with Missing Modalities: A Review	Mar 11, 2022	Image GenerationImage Segmentation	—Unverified
MEDIC: Remove Model Backdoors via Importance Driven Cloning	Jan 1, 2023	Knowledge Distillationmodel	—Unverified
MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment	Aug 18, 2024	Brain Tumor SegmentationDomain Adaptation	—Unverified
MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models	Aug 6, 2020	image-classificationImage Classification	—Unverified
Membership Privacy Protection for Image Translation Models via Adversarial Knowledge Distillation	Mar 10, 2022	Image-to-Image TranslationInference Attack	—Unverified
MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation	May 21, 2025	Knowledge Distillation	—Unverified
MergeDistill: Merging Pre-trained Language Models using Distillation	Jun 5, 2021	Cross-Lingual TransferKnowledge Distillation	—Unverified
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities	Apr 20, 2024	Knowledge DistillationTransfer Learning	—Unverified
MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation	Aug 27, 2020	Knowledge DistillationMeta-Learning	—Unverified
Meta-Ensemble Parameter Learning	Oct 5, 2022	Knowledge DistillationMeta-Learning	—Unverified
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains	Dec 2, 2020	Knowledge DistillationLanguage Modeling	—Unverified
Meta Knowledge Distillation	Feb 16, 2022	Data AugmentationImage Classification	—Unverified
Meta-Learning across Meta-Tasks for Few-Shot Learning	Feb 11, 2020	Domain AdaptationFew-Shot Learning	—Unverified
MetaMixer: A Regularization Strategy for Online Knowledge Distillation	Mar 14, 2023	Knowledge Distillation	—Unverified
MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis	May 10, 2024	Federated LearningKnowledge Distillation	—Unverified
MIAShield: Defending Membership Inference Attacks via Preemptive Exclusion of Members	Mar 2, 2022	image-classificationImage Classification	—Unverified
MICIK: MIning Cross-Layer Inherent Similarity Knowledge for Deep Model Compression	Feb 3, 2019	Knowledge DistillationModel Compression	—Unverified
Microdosing: Knowledge Distillation for GAN based Compression	Jan 7, 2022	Knowledge DistillationVideo Compression	—Unverified
Microsoft Research Asia's Systems for WMT19	Nov 7, 2019	Data AugmentationKnowledge Distillation	—Unverified
MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery	Feb 28, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Mimic and Conquer: Heterogeneous Tree Structure Distillation for Syntactic NLP	Sep 16, 2020	Knowledge Distillation	—Unverified
MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks	Feb 3, 2025	ImputationKnowledge Distillation	—Unverified
Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data	May 6, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment	Sep 28, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified
Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner	Jun 19, 2021	Knowledge DistillationModel Compression	—Unverified
Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design	Jan 30, 2025	Emotion RecognitionFacial Emotion Recognition	—Unverified
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation	Sep 27, 2024	Knowledge DistillationVision and Language Navigation	—Unverified
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning	Jul 16, 2023	Knowledge DistillationMathematical Reasoning	—Unverified
Mitigating Cross-client GANs-based Attack in Federated Learning	Jul 25, 2023	Data-free Knowledge DistillationFederated Learning	—Unverified
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal	Mar 23, 2022	counterfactualFairness	—Unverified
Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine	Nov 8, 2024	Computational EfficiencyHallucination	—Unverified
Mixed Distillation Helps Smaller Language Model Better Reasoning	Dec 17, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Mixed-Type Wafer Classification For Low Memory Devices Using Knowledge Distillation	Mar 24, 2023	Knowledge DistillationLightweight Deployment	—Unverified
MixKD: Towards Efficient Distillation of Large-scale Language Models	Nov 1, 2020	Data AugmentationKnowledge Distillation	—Unverified
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches	Jul 10, 2024	Abstractive Text SummarizationData Augmentation	—Unverified
MKF-ADS: Multi-Knowledge Fusion Based Self-supervised Anomaly Detection System for Control Area Network	Mar 7, 2024	Anomaly DetectionIntrusion Detection	—Unverified
MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition	Apr 16, 2024	Action RecognitionKnowledge Distillation	—Unverified
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models	Jul 3, 2024	Extractive Question-AnsweringKnowledge Distillation	—Unverified
Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection	Dec 12, 2022	Fake News DetectionImage-text matching	—Unverified
MOBA: Multi-teacher Model Based Reinforcement Learning	Sep 29, 2021	Decision MakingKnowledge Distillation	—Unverified
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
Modality-Inconsistent Continual Learning of Multimodal Large Language Models	Dec 17, 2024	Continual LearningKnowledge Distillation	—Unverified
ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation	Aug 28, 2024	Audio ClassificationFederated Learning	—Unverified
MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding	Jan 6, 2021	Knowledge DistillationMeta-Learning	—Unverified
Modality-specific Distillation	Jun 1, 2021	Knowledge DistillationMeta-Learning	—Unverified
Model-Agnostic Decentralized Collaborative Learning for On-Device POI Recommendation	Apr 8, 2023	Knowledge DistillationPrivacy Preserving	—Unverified
Model Compression and Efficient Inference for Large Language Models: A Survey	Feb 15, 2024	Knowledge DistillationModel Compression	—Unverified
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography	Jan 31, 2018	ClassificationGeneral Classification	—Unverified

Show:10 25 50

← PrevPage 66 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified