Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2201–2250 of 4240 papers

Title	Date	Tasks	Status
Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression	Mar 11, 2024	Backdoor AttackImage Compression	—Unverified
Enhancing Chinese Multi-Label Text Classification Performance with Response-based Knowledge Distillation	Nov 1, 2022	Knowledge DistillationMulti Label Text Classification	—Unverified
Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation	Dec 8, 2024	Image Quality AssessmentKnowledge Distillation	—Unverified
Enhancing CTC-Based Visual Speech Recognition	Sep 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation	Feb 23, 2021	Knowledge Distillation	—Unverified
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models	Jun 21, 2025	Dimensionality ReductionKeyword Spotting	—Unverified
Enhancing Generalization in Chain of Thought Reasoning for Smaller Models	Jan 16, 2025	Knowledge DistillationMemorization	—Unverified
Enhancing Mapless Trajectory Prediction through Knowledge Distillation	Jun 25, 2023	Autonomous DrivingKnowledge Distillation	—Unverified
Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation	Feb 8, 2023	Brain Tumor SegmentationImage Generation	—Unverified
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits	Feb 3, 2023	AllKnowledge Distillation	—Unverified
Enhancing Review Comprehension with Domain-Specific Commonsense	Apr 6, 2020	Aspect ExtractionKnowledge Distillation	—Unverified
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Sep 30, 2024	Data AugmentationKnowledge Distillation	—Unverified
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning	Jan 19, 2024	GPUKnowledge Distillation	—Unverified
Enhancing Semi-supervised Learning with Zero-shot Pseudolabels	Feb 18, 2025	Knowledge Distillation	—Unverified
Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation	Jun 18, 2024	Computed Tomography (CT)Knowledge Distillation	—Unverified
Enhancing SLM via ChatGPT and Dataset Augmentation	Sep 19, 2024	Knowledge DistillationNatural Language Inference	—Unverified
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic	Feb 22, 2024	Formal LogicKnowledge Distillation	—Unverified
Ensemble Knowledge Distillation for CTR Prediction	Nov 8, 2020	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Ensemble Distillation for Neural Machine Translation	Feb 6, 2017	Knowledge DistillationMachine Translation	—Unverified
Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials	Mar 18, 2025	Atomic ForcesKnowledge Distillation	—Unverified
Ensemble knowledge distillation of self-supervised speech models	Feb 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Ensembling of Distilled Models from Multi-task Teachers for Constrained Resource Language Pairs	Nov 26, 2021	Knowledge DistillationTranslation	—Unverified
EnSiam: Self-Supervised Learning With Ensemble Representations	May 22, 2023	Contrastive LearningKnowledge Distillation	—Unverified
Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction	Dec 17, 2024	Knowledge DistillationRecommendation Systems	—Unverified
EPIK: Eliminating multi-model Pipelines with Knowledge-distillation	Nov 27, 2022	Knowledge DistillationTransliteration	—Unverified
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression	Jan 31, 2024	Knowledge DistillationModel Compression	—Unverified
ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval	May 18, 2022	Knowledge DistillationOpen-Domain Question Answering	—Unverified
Error Exponent in Agnostic PAC Learning	May 1, 2024	Binary ClassificationKnowledge Distillation	—Unverified
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining	May 26, 2025	Knowledge DistillationLanguage Modeling	—Unverified
ESPnet How2 Speech Translation System for IWSLT 2019: Pre-training, Knowledge Distillation, and Going Deeper	Nov 1, 2019	AllKnowledge Distillation	—Unverified
ESPnet-ST IWSLT 2021 Offline Speech Translation System	Jul 1, 2021	DecoderKnowledge Distillation	—Unverified
Essence Knowledge Distillation for Speech Recognition	Jun 26, 2019	Knowledge Distillationspeech-recognition	—Unverified
Estimating and Maximizing Mutual Information for Knowledge Distillation	Oct 29, 2021	Knowledge Distillation	—Unverified
Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach	May 30, 2024	Activity RecognitionKnowledge Distillation	—Unverified
Evaluation-oriented Knowledge Distillation for Deep Face Recognition	Jun 6, 2022	Face RecognitionKnowledge Distillation	—Unverified
Ever Evolving Evaluator (EV3): Towards Flexible and Reliable Meta-Optimization for Knowledge Distillation	Oct 29, 2023	DiversityEvolutionary Algorithms	—Unverified
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Feb 18, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
Evidential Federated Learning for Skin Lesion Image Classification	Nov 15, 2024	ClassificationFederated Learning	—Unverified
EVOKE: Emotion Enabled Virtual Avatar Mapping Using Optimized Knowledge Distillation	Jan 13, 2024	Emotion RecognitionKnowledge Distillation	—Unverified
Evolving Knowledge Distillation with Large Language Models and Active Learning	Mar 11, 2024	Active LearningKnowledge Distillation	—Unverified
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models	May 20, 2024	Knowledge DistillationStory Generation	—Unverified
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation	Apr 12, 2019	DecoderDenoising	—Unverified
Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition	Aug 1, 2020	DiversityFace Recognition	—Unverified
Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer	Dec 5, 2024	Deep LearningKnowledge Distillation	—Unverified
ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks	Nov 26, 2018	General Classificationimage-classification	—Unverified
Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders	Dec 19, 2023	Knowledge Distillation	—Unverified
Experimentation in Content Moderation using RWKV	Sep 5, 2024	CPUKnowledge Distillation	—Unverified
Experimenting with Knowledge Distillation techniques for performing Brain Tumor Segmentation	May 24, 2021	Brain Tumor SegmentationKnowledge Distillation	—Unverified
Explainability-Driven Leaf Disease Classification Using Adversarial Training and Knowledge Distillation	Dec 30, 2023	Adversarial AttackClassification	—Unverified
Explainable Knowledge Distillation for On-device Chest X-Ray Classification	May 10, 2023	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	—Unverified

Show:10 25 50

← PrevPage 45 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified