Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–625 of 4240 papers

Title	Date	Tasks	Status	Hype
AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Oct 24, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Oct 24, 2024	Knowledge DistillationMathematical Reasoning	CodeCode Available	0
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	Oct 24, 2024	Knowledge Distillationregression	—Unverified	0
Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Oct 24, 2024	Knowledge DistillationNatural Language Understanding	—Unverified	0
Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Oct 23, 2024	AllFederated Learning	—Unverified	0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Oct 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Oct 23, 2024	Data-free Knowledge DistillationDiversity	CodeCode Available	0
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior	Oct 22, 2024	Knowledge Distillation	—Unverified	0
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Oct 22, 2024	AttributeKnowledge Distillation	CodeCode Available	0
CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Oct 22, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
MiniPLM: Knowledge Distillation for Pre-Training Language Models	Oct 22, 2024	DiversityKnowledge Distillation	CodeCode Available	2
Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples	Oct 21, 2024	Knowledge Distillation	—Unverified	0
Pre-training Distillation for Large Language Models: A Design Space Exploration	Oct 21, 2024	Knowledge Distillation	—Unverified	0
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Oct 20, 2024	Image RetrievalImage-text Retrieval	CodeCode Available	0
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Oct 19, 2024	Knowledge Distillation	—Unverified	0
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Oct 19, 2024	Instruction FollowingKnowledge Distillation	—Unverified	0
Interpreting Microbiome Relative Abundance Data Using Symbolic Regression	Oct 18, 2024	DiagnosticKnowledge Distillation	CodeCode Available	0
DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search	Oct 18, 2024	Conversational Information AccessConversational Search	CodeCode Available	0
Preview-based Category Contrastive Learning for Knowledge Distillation	Oct 18, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0
Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Oct 18, 2024	Backdoor AttackKnowledge Distillation	CodeCode Available	0
CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence	Oct 17, 2024	Binary ClassificationKnowledge Distillation	—Unverified	0
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs	Oct 17, 2024	Dataset GenerationKnowledge Distillation	—Unverified	0
Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach	Oct 17, 2024	Earth ObservationFederated Learning	—Unverified	0
An Active Learning Framework for Inclusive Generation by Large Language Models	Oct 17, 2024	Active LearningClustering	—Unverified	0
Proactive Detection and Calibration of Seasonal Advertisements with Multimodal Large Language Models	Oct 16, 2024	Knowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 25 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified