Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 4240 papers

Title	Date	Tasks	Status	Hype
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance	Dec 19, 2024	Knowledge DistillationStudent dropout	CodeCode Available	0
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection	Dec 19, 2024	3D Object DetectionAutonomous Vehicles	CodeCode Available	0
Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting	Dec 18, 2024	GSM8KKnowledge Distillation	CodeCode Available	0
Canine EEG Helps Human: Cross-Species and Cross-Modality Epileptic Seizure Detection via Multi-Space Alignment	Dec 18, 2024	Brain Computer InterfaceDiagnostic	—Unverified	0
A Survey on Inference Optimization Techniques for Mixture of Experts Models	Dec 18, 2024	Computational EfficiencyDistributed Computing	CodeCode Available	3
Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation	Dec 18, 2024	Image SegmentationKnowledge Distillation	CodeCode Available	2
Hybrid Data-Free Knowledge Distillation	Dec 18, 2024	Data-free Knowledge DistillationGenerative Adversarial Network	CodeCode Available	0
On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process	Dec 18, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective	Dec 18, 2024	Knowledge Distillation	CodeCode Available	0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT	Dec 18, 2024	Code SearchCode Summarization	—Unverified	0
Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction	Dec 17, 2024	Knowledge DistillationRecommendation Systems	—Unverified	0
Split Knowledge Distillation for Large Models in IoT: Architecture, Challenges, and Solutions	Dec 17, 2024	Knowledge DistillationManagement	—Unverified	0
In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning	Dec 17, 2024	In-Context LearningKnowledge Distillation	—Unverified	0
Modality-Inconsistent Continual Learning of Multimodal Large Language Models	Dec 17, 2024	Continual LearningKnowledge Distillation	—Unverified	0
Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Dec 17, 2024	Edge-computingKnowledge Distillation	—Unverified	0
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Dec 17, 2024	3D Object DetectionDepth Estimation	—Unverified	0
Relation-Guided Adversarial Learning for Data-free Knowledge Transfer	Dec 16, 2024	Data-free Knowledge DistillationData Free Quantization	CodeCode Available	1
BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions	Dec 16, 2024	Knowledge DistillationMotion Estimation	CodeCode Available	2
Neural Collapse Inspired Knowledge Distillation	Dec 16, 2024	Knowledge Distillation	—Unverified	0
Active Large Language Model-based Knowledge Distillation for Session-based Recommendation	Dec 15, 2024	Active LearningKnowledge Distillation	—Unverified	0
Knowledge Migration Framework for Smart Contract Vulnerability Detection	Dec 15, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified	0
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes	Dec 15, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation	Dec 15, 2024	Activity Recognitioncross-modal alignment	—Unverified	0
Leveraging Large Language Models for Active Merchant Non-player Characters	Dec 15, 2024	Knowledge Distillation	CodeCode Available	0
On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning	Dec 15, 2024	class-incremental learningClass Incremental Learning	—Unverified	0
Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection	Dec 15, 2024	Knowledge DistillationNovelty Detection	CodeCode Available	0
Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias	Dec 13, 2024	Knowledge DistillationModel Compression	—Unverified	0
LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering	Dec 13, 2024	Few-Shot LearningKnowledge Distillation	—Unverified	0
ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression	Dec 13, 2024	Knowledge DistillationPrivacy Preserving	—Unverified	0
Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration	Dec 12, 2024	Contrastive LearningImage Restoration	CodeCode Available	1
Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices	Dec 12, 2024	Knowledge DistillationMamba	—Unverified	0
Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation	Dec 12, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available	0
All You Need in Knowledge Distillation Is a Tailored Coordinate System	Dec 12, 2024	AllFew-Shot Learning	—Unverified	0
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dec 12, 2024	Knowledge DistillationText-to-Image Generation	—Unverified	0
DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification	Dec 12, 2024	Exemplar-FreeKnowledge Distillation	CodeCode Available	0
A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks	Dec 12, 2024	Binary ClassificationKnowledge Distillation	—Unverified	0
Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Dec 11, 2024	AstronomyComputational Efficiency	—Unverified	0
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation	Dec 11, 2024	image-classificationImage Classification	CodeCode Available	2
DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Dec 11, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Cloud Object Detector Adaptation by Integrating Different Source Knowledge	Dec 10, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available	1
TT-MPD: Test Time Model Pruning and Distillation	Dec 10, 2024	Knowledge Distillationmodel	—Unverified	0
Unlocking the Potential of Reverse Distillation for Anomaly Detection	Dec 10, 2024	Anomaly DetectionDecoder	CodeCode Available	1
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering	Dec 9, 2024	Knowledge DistillationQuestion Answering	CodeCode Available	0
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening	Dec 9, 2024	Knowledge Distillation	—Unverified	0
Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis	Dec 8, 2024	DecoderKnowledge Distillation	—Unverified	0
Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation	Dec 8, 2024	Image Quality AssessmentKnowledge Distillation	—Unverified	0
Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery	Dec 7, 2024	Contrastive LearningIncremental Learning	CodeCode Available	0
CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Dec 6, 2024	Action RecognitionKnowledge Distillation	—Unverified	0
BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits	Dec 6, 2024	BinarizationKnowledge Distillation	—Unverified	0
One-shot Federated Learning via Synthetic Distiller-Distillate Communication	Dec 6, 2024	Data-free Knowledge DistillationFederated Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified