Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 4240 papers

Title	Date	Tasks	Status	Hype
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection	Sep 16, 2018	ClassificationGeneral Classification	CodeCode Available	1
Channel Gating Neural Networks	May 29, 2018	Knowledge DistillationNetwork Pruning	CodeCode Available	1
Grad-CAM++: Improved Visual Explanations for Deep Convolutional Networks	Oct 30, 2017	3D Action RecognitionAction Recognition	CodeCode Available	1
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer	Dec 12, 2016	Knowledge Distillation	CodeCode Available	1
Sequence-Level Knowledge Distillation	Jun 25, 2016	Knowledge DistillationMachine Translation	CodeCode Available	1
Distilling the Knowledge in a Neural Network	Mar 9, 2015	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
FitNets: Hints for Thin Deep Nets	Dec 19, 2014	Knowledge Distillation	CodeCode Available	1
Visual-Language Model Knowledge Distillation Method for Image Quality Assessment	Jul 21, 2025	Image Quality AssessmentKnowledge Distillation	—Unverified	0
Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces	Jul 17, 2025	EEGKnowledge Distillation	—Unverified	0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition	Jul 16, 2025	BenchmarkingKnowledge Distillation	CodeCode Available	0
HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training	Jul 15, 2025	Cross-Lingual TransferKnowledge Distillation	—Unverified	0
Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning	Jul 14, 2025	Federated LearningKnowledge Distillation	—Unverified	0
KAT-V1: Kwai-AutoThink Technical Report	Jul 11, 2025	Knowledge DistillationLarge Language Model	—Unverified	0
SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation	Jul 11, 2025	Federated LearningKnowledge Distillation	—Unverified	0
Towards Collaborative Fairness in Federated Learning Under Imbalanced Covariate Shift	Jul 11, 2025	Collaborative FairnessFairness	—Unverified	0
The Trilemma of Truth in Large Language Models	Jun 30, 2025	AttributeConformal Prediction	CodeCode Available	0
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training	Jun 27, 2025	Knowledge DistillationMathematical Reasoning	—Unverified	0
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing	Jun 26, 2025	Continual LearningContinual Self-Supervised Learning	—Unverified	0
Distilling Normalizing Flows	Jun 26, 2025	Density EstimationKnowledge Distillation	—Unverified	0
G^2D: Boosting Multimodal Learning with Gradient-Guided Distillation	Jun 26, 2025	Knowledge DistillationModel Optimization	CodeCode Available	0
Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation	Jun 25, 2025	Federated LearningKnowledge Distillation	CodeCode Available	0
FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data	Jun 25, 2025	Federated LearningKnowledge Distillation	CodeCode Available	0
Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Jun 25, 2025	Earth ObservationKnowledge Distillation	—Unverified	0
Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning	Jun 25, 2025	Knowledge DistillationTransfer Learning	—Unverified	0
Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation	Jun 25, 2025	Knowledge DistillationRelation	—Unverified	0
Distillation-Enabled Knowledge Alignment for Generative Semantic Communications in AIGC Provisioning Tasks	Jun 24, 2025	Knowledge DistillationSemantic Communication	—Unverified	0
Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy	Jun 24, 2025	Knowledge DistillationLearning with noisy labels	—Unverified	0
GNN's Uncertainty Quantification using Self-Distillation	Jun 24, 2025	Knowledge DistillationUncertainty Quantification	CodeCode Available	0
PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications	Jun 23, 2025	Knowledge DistillationPrivacy Preserving	—Unverified	0
Multimodal Fusion SLAM with Fourier Attention	Jun 22, 2025	Knowledge DistillationOptical Flow Estimation	CodeCode Available	0
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models	Jun 21, 2025	Dimensionality ReductionKeyword Spotting	—Unverified	0
Fine-grained Image Retrieval via Dual-Vision Adaptation	Jun 19, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Knowledge Distillation Framework for Accelerating High-Accuracy Neural Network-Based Molecular Dynamics Simulations	Jun 18, 2025	Knowledge Distillation	—Unverified	0
Factorized RVQ-GAN For Disentangled Speech Tokenization	Jun 18, 2025	DisentanglementKnowledge Distillation	—Unverified	0
KDMOS:Knowledge Distillation for Motion Segmentation	Jun 17, 2025	Autonomous DrivingKnowledge Distillation	CodeCode Available	0
Model compression using knowledge distillation with integrated gradients	Jun 17, 2025	Data AugmentationKnowledge Distillation	—Unverified	0
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jun 17, 2025	Knowledge DistillationTransfer Learning	—Unverified	0
Lightweight Task-Oriented Semantic Communication Empowered by Large-Scale AI Models	Jun 16, 2025	Knowledge DistillationSemantic Communication	—Unverified	0
A Technical Study into Small Reasoning Language Models	Jun 16, 2025	Code GenerationComputational Efficiency	—Unverified	0
HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs	Jun 16, 2025	HallucinationKnowledge Distillation	—Unverified	0
Ground Reaction Force Estimation via Time-aware Knowledge Distillation	Jun 12, 2025	Knowledge Distillation	—Unverified	0
A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning	Jun 11, 2025	DecoderImage Captioning	—Unverified	0
Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition	Jun 10, 2025	Emotion RecognitionKnowledge Distillation	CodeCode Available	0
Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation	Jun 10, 2025	Adversarial RobustnessFairness	CodeCode Available	0
Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework	Jun 6, 2025	Instruction FollowingKnowledge Distillation	CodeCode Available	0
Label-Context-Dependent Internal Language Model Estimation for CTC	Jun 6, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation	Jun 5, 2025	Code GenerationCode Translation	—Unverified	0
Static Word Embeddings for Sentence Semantic Representation	Jun 5, 2025	Contrastive LearningKnowledge Distillation	—Unverified	0
StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation	Jun 5, 2025	Knowledge Distillation	CodeCode Available	0
Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement	Jun 4, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0

Show:10 25 50

← PrevPage 20 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified