Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 4240 papers

Title	Date	Tasks	Status	Hype
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy	May 25, 2025	DiagnosticKnowledge Distillation	CodeCode Available	0
Tokenizing Electron Cloud in Protein-Ligand Interaction Learning	May 25, 2025	Knowledge DistillationPrediction	—Unverified	0
Knowledge Grafting of Large Language Models	May 24, 2025	Continual LearningKnowledge Distillation	CodeCode Available	0
C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging	May 24, 2025	Knowledge Distillation	—Unverified	0
Single Snapshot Distillation for Phase Coded Mask Design in Phase Retrieval	May 23, 2025	global-optimizationKnowledge Distillation	—Unverified	0
ToDi: Token-wise Distillation via Fine-Grained Divergence Control	May 22, 2025	Instruction FollowingKnowledge Distillation	—Unverified	0
On Multilingual Encoder Language Model Compression for Low-Resource Languages	May 22, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
SEDD-PCC: A Single Encoder-Dual Decoder Framework For End-To-End Learned Point Cloud Compression	May 22, 2025	AttributeDecoder	—Unverified	0
MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation	May 21, 2025	Knowledge Distillation	—Unverified	0
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	May 21, 2025	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
On the Generalization vs Fidelity Paradox in Knowledge Distillation	May 21, 2025	Knowledge DistillationTransfer Learning	CodeCode Available	0
An Efficient Private GPT Never Autoregressively Decodes	May 21, 2025	Knowledge Distillation	—Unverified	0
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer	May 21, 2025	DenoisingKnowledge Distillation	CodeCode Available	1
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	May 21, 2025	Instance SegmentationKnowledge Distillation	CodeCode Available	1
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking	May 20, 2025	Document RankingInformation Retrieval	—Unverified	0
Intra-class Patch Swap for Self-Distillation	May 20, 2025	image-classificationImage Classification	CodeCode Available	0
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation	May 20, 2025	Information RetrievalKnowledge Distillation	—Unverified	0
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels	May 20, 2025	Instruction FollowingKnowledge Distillation	—Unverified	0
Improved Methods for Model Pruning and Knowledge Distillation	May 20, 2025	Knowledge Distillation	—Unverified	0
Bridging the Modality Gap: Enhancing Channel Prediction with Semantically Aligned LLMs and Knowledge Distillation	May 19, 2025	Knowledge DistillationPrediction	—Unverified	0
SMOTExT: SMOTE meets Large Language Models	May 19, 2025	Cross-Modal RetrievalData Augmentation	CodeCode Available	0
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone	May 19, 2025	Knowledge DistillationTransfer Learning	CodeCode Available	1
Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach	May 19, 2025	Knowledge DistillationObject Tracking	CodeCode Available	0
Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation	May 19, 2025	Knowledge DistillationSemantic Segmentation	CodeCode Available	0
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling	May 19, 2025	Graph GenerationKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 4 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified