Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 4240 papers

Title	Date	Tasks	Status
XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs	Feb 27, 2025	Knowledge Distillation	—Unverified
Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models	Feb 27, 2025	Knowledge DistillationModel Compression	—Unverified
SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models	Feb 27, 2025	GPUKnowledge Distillation	—Unverified
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval	Feb 27, 2025	Cross-Modal RetrievalKnowledge Distillation	—Unverified
Granite Embedding Models	Feb 27, 2025	Information RetrievalKnowledge Distillation	—Unverified
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents	Feb 26, 2025	HallucinationKnowledge Distillation	—Unverified
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Feb 25, 2025	Knowledge DistillationLanguage Modeling	—Unverified
From underwater to aerial: a novel multi-scale knowledge distillation approach for coral reef monitoring	Feb 25, 2025	Knowledge Distillation	CodeCode Available
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation	Feb 24, 2025	Adversarial AttackDiversity	—Unverified
Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing	Feb 24, 2025	Cross-Lingual TransferDependency Parsing	—Unverified
CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers	Feb 24, 2025	Knowledge Distillation	—Unverified
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition	Feb 24, 2025	image-classificationImage Classification	—Unverified
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation	Feb 24, 2025	3D Instance SegmentationContinual Learning	CodeCode Available
PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation	Feb 24, 2025	Knowledge DistillationStyle Transfer	—Unverified
Knowledge Distillation with Training Wheels	Feb 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Feb 23, 2025	Document Layout AnalysisKnowledge Distillation	—Unverified
A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models	Feb 21, 2025	Decision MakingKnowledge Distillation	CodeCode Available
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation	Feb 21, 2025	Knowledge DistillationPrivacy Preserving	—Unverified
Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Feb 20, 2025	Depth EstimationKnowledge Distillation	—Unverified
Designing Parameter and Compute Efficient Diffusion Transformers using Distillation	Feb 20, 2025	Knowledge DistillationNVIDIA Jetson Orin Nano	—Unverified
Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Feb 20, 2025	Knowledge DistillationModel Compression	—Unverified
TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation	Feb 20, 2025	Data AugmentationKnowledge Distillation	—Unverified
Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Feb 20, 2025	Domain AdaptationFederated Learning	—Unverified
Modifying Final Splits of Classification Tree for Fine-tuning Subpopulation Target in Policy Making	Feb 20, 2025	Knowledge Distillation	—Unverified
Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles	Feb 19, 2025	DisentanglementEnsemble Learning	—Unverified
Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture	Feb 19, 2025	Knowledge Distillation	—Unverified
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning	Feb 19, 2025	Knowledge DistillationObject	—Unverified
MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation	Feb 19, 2025	Image Super-ResolutionKnowledge Distillation	—Unverified
Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Feb 18, 2025	Data AugmentationGSM8K	—Unverified
Enhancing Semi-supervised Learning with Zero-shot Pseudolabels	Feb 18, 2025	Knowledge Distillation	—Unverified
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions	Feb 18, 2025	Knowledge DistillationMath	—Unverified
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Feb 18, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
Does Training with Synthetic Data Truly Protect Privacy?	Feb 18, 2025	Data-free Knowledge DistillationDataset Distillation	CodeCode Available
Leave No One Behind: Enhancing Diversity While Maintaining Accuracy in Social Recommendation	Feb 17, 2025	DiversityKnowledge Distillation	CodeCode Available
Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation	Feb 17, 2025	Knowledge DistillationMath	CodeCode Available
Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation	Feb 16, 2025	HallucinationKnowledge Distillation	—Unverified
Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification	Feb 16, 2025	Classificationimage-classification	—Unverified
LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs	Feb 15, 2025	Edge ClassificationKnowledge Distillation	—Unverified
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs	Feb 15, 2025	DenoisingKnowledge Distillation	—Unverified
AIDE: Agentically Improve Visual Language Model with Domain Experts	Feb 13, 2025	Knowledge DistillationLanguage Modeling	—Unverified
LLM Pretraining with Continuous Concepts	Feb 12, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification	Feb 11, 2025	Knowledge Distillation	—Unverified
OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Feb 11, 2025	Knowledge DistillationMMLU	CodeCode Available
Vision-Language Models for Edge Networks: A Comprehensive Survey	Feb 11, 2025	Autonomous VehiclesImage Captioning	—Unverified
Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers	Feb 11, 2025	image-classificationImage Classification	—Unverified
Progressive Collaborative and Semantic Knowledge Fusion for Generative Recommendation	Feb 10, 2025	Knowledge Distillation	—Unverified
Rationalization Models for Text-to-SQL	Feb 10, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Right Time to Learn:Promoting Generalization via Bio-inspired Spacing Effect in Knowledge Distillation	Feb 10, 2025	Knowledge Distillation	CodeCode Available
DROP: Poison Dilution via Knowledge Distillation for Federated Learning	Feb 10, 2025	Data PoisoningFederated Learning	CodeCode Available
Contrastive Representation Distillation via Multi-Scale Feature Decoupling	Feb 9, 2025	Knowledge DistillationTransfer Learning	—Unverified

Show:10 25 50

← PrevPage 25 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified