Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 4240 papers

Title	Date	Tasks	Status	Hype
Training Domain Draft Models for Speculative Decoding: Best Practices and Insights	Mar 10, 2025	Knowledge Distillation	—Unverified	0
Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification	Mar 10, 2025	image-classificationImage Classification	—Unverified	0
PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning	Mar 10, 2025	class-incremental learningClass Incremental Learning	—Unverified	0
Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Mar 10, 2025	Image SegmentationKnowledge Distillation	—Unverified	0
ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning	Mar 10, 2025	Active LearningKnowledge Distillation	—Unverified	0
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Mar 10, 2025	Autonomous DrivingKnowledge Distillation	—Unverified	0
Small Vision-Language Models: A Survey on Compact Architectures and Techniques	Mar 9, 2025	Computational EfficiencyKnowledge Distillation	—Unverified	0
Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities	Mar 9, 2025	Graph AttentionKnowledge Distillation	—Unverified	0
HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast	Mar 9, 2025	Data-free Knowledge DistillationFederated Learning	—Unverified	0
Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence	Mar 9, 2025	Decision MakingKnowledge Distillation	—Unverified	0
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters	Mar 8, 2025	Knowledge Distillationobject-detection	—Unverified	0
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation	Mar 8, 2025	Autonomous Drivingfeature selection	—Unverified	0
Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning	Mar 7, 2025	class-incremental learningClass Incremental Learning	CodeCode Available	1
Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification	Mar 7, 2025	Brain Computer InterfaceDomain Adaptation	CodeCode Available	1
No Forgetting Learning: Memory-free Continual Learning	Mar 6, 2025	Continual LearningKnowledge Distillation	—Unverified	0
Lightweight Embedded FPGA Deployment of Learned Image Compression with Knowledge Distillation and Hybrid Quantization	Mar 5, 2025	Image CompressionKnowledge Distillation	—Unverified	0
Self-Supervised Z-Slice Augmentation for 3D Bio-Imaging via Knowledge Distillation	Mar 5, 2025	Generative Adversarial NetworkKnowledge Distillation	CodeCode Available	0
Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks	Mar 5, 2025	Computational EfficiencyKnowledge Distillation	—Unverified	0
Rapid Bone Scintigraphy Enhancement via Semantic Prior Distillation from Segment Anything Model	Mar 4, 2025	Image RestorationKnowledge Distillation	—Unverified	0
Mamba base PKD for efficient knowledge compression	Mar 3, 2025	image-classificationImage Classification	—Unverified	0
DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems	Mar 3, 2025	Edge-computingKnowledge Distillation	—Unverified	0
VRM: Knowledge Distillation via Virtual Relation Matching	Feb 28, 2025	Knowledge DistillationRelation	—Unverified	0
Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models	Feb 27, 2025	Knowledge DistillationSelf-Knowledge Distillation	—Unverified	0
SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models	Feb 27, 2025	GPUKnowledge Distillation	—Unverified	0
Granite Embedding Models	Feb 27, 2025	Information RetrievalKnowledge Distillation	—Unverified	0
XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs	Feb 27, 2025	Knowledge Distillation	—Unverified	0
Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval	Feb 27, 2025	Cross-Modal RetrievalKnowledge Distillation	—Unverified	0
Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models	Feb 27, 2025	Knowledge DistillationModel Compression	—Unverified	0
Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents	Feb 26, 2025	HallucinationKnowledge Distillation	—Unverified	0
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Feb 25, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
Advantage-Guided Distillation for Preference Alignment in Small Language Models	Feb 25, 2025	Knowledge Distillation	CodeCode Available	1
From underwater to aerial: a novel multi-scale knowledge distillation approach for coral reef monitoring	Feb 25, 2025	Knowledge Distillation	CodeCode Available	0
Knowledge Distillation with Training Wheels	Feb 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation	Feb 24, 2025	3D Instance SegmentationContinual Learning	CodeCode Available	0
PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation	Feb 24, 2025	Knowledge DistillationStyle Transfer	—Unverified	0
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition	Feb 24, 2025	image-classificationImage Classification	—Unverified	0
Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing	Feb 24, 2025	Cross-Lingual TransferDependency Parsing	—Unverified	0
CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers	Feb 24, 2025	Knowledge Distillation	—Unverified	0
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation	Feb 24, 2025	Adversarial AttackDiversity	—Unverified	0
EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Feb 23, 2025	Document Layout AnalysisKnowledge Distillation	—Unverified	0
Scaling Sparse and Dense Retrieval in Decoder-Only LLMs	Feb 21, 2025	DecoderKnowledge Distillation	CodeCode Available	1
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation	Feb 21, 2025	Knowledge DistillationPrivacy Preserving	—Unverified	0
A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models	Feb 21, 2025	Decision MakingKnowledge Distillation	CodeCode Available	0
TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation	Feb 20, 2025	Data AugmentationKnowledge Distillation	—Unverified	0
Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Feb 20, 2025	Depth EstimationKnowledge Distillation	—Unverified	0
Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Feb 20, 2025	Knowledge DistillationModel Compression	—Unverified	0
Modifying Final Splits of Classification Tree for Fine-tuning Subpopulation Target in Policy Making	Feb 20, 2025	Knowledge Distillation	—Unverified	0
Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Feb 20, 2025	Domain AdaptationFederated Learning	—Unverified	0
Designing Parameter and Compute Efficient Diffusion Transformers using Distillation	Feb 20, 2025	Knowledge DistillationNVIDIA Jetson Orin Nano	—Unverified	0
Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles	Feb 19, 2025	DisentanglementEnsemble Learning	—Unverified	0

Show:10 25 50

← PrevPage 6 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified