Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1226–1250 of 4240 papers

Title	Date	Tasks	Status	Hype
Enhanced Sparsification via Stimulative Training	Mar 11, 2024	Knowledge DistillationModel Compression	—Unverified	0
Answering Diverse Questions via Text Attached with Key Audio-Visual Clues	Mar 11, 2024	Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA)	CodeCode Available	0
Attention is all you need for boosting graph convolutional neural network	Mar 10, 2024	AllKnowledge Distillation	—Unverified	0
Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing	Mar 10, 2024	Image RetrievalKnowledge Distillation	CodeCode Available	1
Knowledge Distillation of Convolutional Neural Networks through Feature Map Transformation using Decision Trees	Mar 10, 2024	Knowledge Distillation	—Unverified	0
V_kD: Improving Knowledge Distillation using Orthogonal Projections	Mar 10, 2024	Image GenerationKnowledge Distillation	CodeCode Available	2
Cooperative Classification and Rationalization for Graph Generalization	Mar 10, 2024	ClassificationGraph Classification	CodeCode Available	0
Weakly Supervised Change Detection via Knowledge Distillation and Multiscale Sigmoid Inference	Mar 9, 2024	Change DetectionKnowledge Distillation	CodeCode Available	0
Frequency Attention for Knowledge Distillation	Mar 9, 2024	image-classificationImage Classification	CodeCode Available	1
Scene Graph Aided Radiology Report Generation	Mar 8, 2024	DecoderKnowledge Distillation	—Unverified	0
Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation	Mar 8, 2024	Image GenerationKnowledge Distillation	—Unverified	0
Attention-guided Feature Distillation for Semantic Segmentation	Mar 8, 2024	Knowledge DistillationSegmentation	—Unverified	0
Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples	Mar 8, 2024	Knowledge Distillation	—Unverified	0
RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features	Mar 8, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities	Mar 7, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Privacy-preserving Fine-tuning of Large Language Models through Flatness	Mar 7, 2024	Knowledge DistillationPrivacy Preserving	—Unverified	0
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition	Mar 7, 2024	Audio-Visual Speech RecognitionKnowledge Distillation	CodeCode Available	0
MKF-ADS: Multi-Knowledge Fusion Based Self-supervised Anomaly Detection System for Control Area Network	Mar 7, 2024	Anomaly DetectionIntrusion Detection	—Unverified	0
Can Small Language Models be Good Reasoners for Sequential Recommendation?	Mar 7, 2024	Knowledge DistillationRecommendation Systems	—Unverified	0
A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation	Mar 6, 2024	Knowledge Distillation	CodeCode Available	0
Learning to Maximize Mutual Information for Chain-of-Thought Distillation	Mar 5, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	0
PromptKD: Unsupervised Prompt Distillation for Vision-Language Models	Mar 5, 2024	Knowledge DistillationPrompt Engineering	CodeCode Available	3
JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition	Mar 4, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Distilled ChatGPT Topic & Sentiment Modeling with Applications in Finance	Mar 4, 2024	Knowledge DistillationSentiment Analysis	—Unverified	0
UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images	Mar 4, 2024	ClassificationDenoising	—Unverified	0

Show:10 25 50

← PrevPage 50 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified