Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 4240 papers

Title	Date	Tasks	Status	Hype
Visual-Language Model Knowledge Distillation Method for Image Quality Assessment	Jul 21, 2025	Image Quality AssessmentKnowledge Distillation	—Unverified	0
Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces	Jul 17, 2025	EEGKnowledge Distillation	—Unverified	0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition	Jul 16, 2025	BenchmarkingKnowledge Distillation	CodeCode Available	0
HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training	Jul 15, 2025	Cross-Lingual TransferKnowledge Distillation	—Unverified	0
Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning	Jul 14, 2025	Federated LearningKnowledge Distillation	—Unverified	0
SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation	Jul 11, 2025	Federated LearningKnowledge Distillation	—Unverified	0
Towards Collaborative Fairness in Federated Learning Under Imbalanced Covariate Shift	Jul 11, 2025	Collaborative FairnessFairness	—Unverified	0
KAT-V1: Kwai-AutoThink Technical Report	Jul 11, 2025	Knowledge DistillationLarge Language Model	—Unverified	0
The Trilemma of Truth in Large Language Models	Jun 30, 2025	AttributeConformal Prediction	CodeCode Available	0
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training	Jun 27, 2025	Knowledge DistillationMathematical Reasoning	—Unverified	0
Distilling Normalizing Flows	Jun 26, 2025	Density EstimationKnowledge Distillation	—Unverified	0
G^2D: Boosting Multimodal Learning with Gradient-Guided Distillation	Jun 26, 2025	Knowledge DistillationModel Optimization	CodeCode Available	0
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing	Jun 26, 2025	Continual LearningContinual Self-Supervised Learning	—Unverified	0
Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation	Jun 25, 2025	Knowledge DistillationRelation	—Unverified	0
Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Jun 25, 2025	Earth ObservationKnowledge Distillation	—Unverified	0
FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data	Jun 25, 2025	Federated LearningKnowledge Distillation	CodeCode Available	0
Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation	Jun 25, 2025	Federated LearningKnowledge Distillation	CodeCode Available	0
Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning	Jun 25, 2025	Knowledge DistillationTransfer Learning	—Unverified	0
Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy	Jun 24, 2025	Knowledge DistillationLearning with noisy labels	—Unverified	0
Distillation-Enabled Knowledge Alignment for Generative Semantic Communications in AIGC Provisioning Tasks	Jun 24, 2025	Knowledge DistillationSemantic Communication	—Unverified	0
GNN's Uncertainty Quantification using Self-Distillation	Jun 24, 2025	Knowledge DistillationUncertainty Quantification	CodeCode Available	0
PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications	Jun 23, 2025	Knowledge DistillationPrivacy Preserving	—Unverified	0
Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised Models	Jun 23, 2025	Domain AdaptationGPU	CodeCode Available	3
Multimodal Fusion SLAM with Fourier Attention	Jun 22, 2025	Knowledge DistillationOptical Flow Estimation	CodeCode Available	0
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models	Jun 21, 2025	Dimensionality ReductionKeyword Spotting	—Unverified	0
Fine-grained Image Retrieval via Dual-Vision Adaptation	Jun 19, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Knowledge Distillation Framework for Accelerating High-Accuracy Neural Network-Based Molecular Dynamics Simulations	Jun 18, 2025	Knowledge Distillation	—Unverified	0
Factorized RVQ-GAN For Disentangled Speech Tokenization	Jun 18, 2025	DisentanglementKnowledge Distillation	—Unverified	0
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jun 17, 2025	Knowledge DistillationTransfer Learning	—Unverified	0
Model compression using knowledge distillation with integrated gradients	Jun 17, 2025	Data AugmentationKnowledge Distillation	—Unverified	0
KDMOS:Knowledge Distillation for Motion Segmentation	Jun 17, 2025	Autonomous DrivingKnowledge Distillation	CodeCode Available	0
Lightweight Task-Oriented Semantic Communication Empowered by Large-Scale AI Models	Jun 16, 2025	Knowledge DistillationSemantic Communication	—Unverified	0
SeqPE: Transformer with Sequential Position Encoding	Jun 16, 2025	image-classificationImage Classification	CodeCode Available	1
HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs	Jun 16, 2025	HallucinationKnowledge Distillation	—Unverified	0
A Technical Study into Small Reasoning Language Models	Jun 16, 2025	Code GenerationComputational Efficiency	—Unverified	0
Ground Reaction Force Estimation via Time-aware Knowledge Distillation	Jun 12, 2025	Knowledge Distillation	—Unverified	0
A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning	Jun 11, 2025	DecoderImage Captioning	—Unverified	0
Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition	Jun 10, 2025	Emotion RecognitionKnowledge Distillation	CodeCode Available	0
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Jun 10, 2025	Knowledge DistillationMath	CodeCode Available	1
Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation	Jun 10, 2025	Adversarial RobustnessFairness	CodeCode Available	0
Label-Context-Dependent Internal Language Model Estimation for CTC	Jun 6, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework	Jun 6, 2025	Instruction FollowingKnowledge Distillation	CodeCode Available	0
StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation	Jun 5, 2025	Knowledge Distillation	CodeCode Available	0
Static Word Embeddings for Sentence Semantic Representation	Jun 5, 2025	Contrastive LearningKnowledge Distillation	—Unverified	0
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation	Jun 5, 2025	Code GenerationCode Translation	—Unverified	0
Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement	Jun 4, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
QA-HFL: Quality-Aware Hierarchical Federated Learning for Resource-Constrained Mobile Devices with Heterogeneous Image Quality	Jun 4, 2025	Federated LearningKnowledge Distillation	—Unverified	0
Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care	Jun 4, 2025	Intent DetectionKnowledge Distillation	—Unverified	0
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Jun 3, 2025	DecoderKnowledge Distillation	—Unverified	0
KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning	Jun 2, 2025	Knowledge DistillationLarge Language Model	—Unverified	0

Show:10 25 50

← PrevPage 1 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified