Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1726–1750 of 4240 papers

Title	Date	Tasks	Status
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Jul 16, 2024	Knowledge DistillationSemantic Segmentation	—Unverified
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation	Jul 16, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data	Jul 15, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Don't Throw Away Data: Better Sequence Knowledge Distillation	Jul 15, 2024	DiversityKnowledge Distillation	—Unverified
Multi-Granularity Semantic Revision for Large Language Model Distillation	Jul 14, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels	Jul 14, 2024	Image SegmentationKnowledge Distillation	CodeCode Available
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Jul 13, 2024	Class-Incremental Semantic SegmentationExemplar-Free	—Unverified
Minimizing PLM-Based Few-Shot Intent Detectors	Jul 13, 2024	Data AugmentationKnowledge Distillation	CodeCode Available
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Jul 12, 2024	3D Semantic SegmentationAutonomous Driving	—Unverified
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation	Jul 12, 2024	Graph GenerationKnowledge Distillation	—Unverified
A Survey on Symbolic Knowledge Distillation of Large Language Models	Jul 12, 2024	Knowledge DistillationSurvey	—Unverified
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection	Jul 12, 2024	Knowledge DistillationSocial Media Mental Health Detection	CodeCode Available
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification	Jul 12, 2024	graph constructionGraph Learning	CodeCode Available
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear	Jul 11, 2024	Knowledge Distillationobject-detection	CodeCode Available
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions	Jul 11, 2024	Iris RecognitionKnowledge Distillation	—Unverified
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches	Jul 10, 2024	Abstractive Text SummarizationData Augmentation	—Unverified
LokiLM: Technical Report	Jul 10, 2024	Knowledge DistillationLanguage Modeling	—Unverified
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification	Jul 10, 2024	Computational Efficiencyimage-classification	CodeCode Available
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction	Jul 9, 2024	Autonomous DrivingDecision Making	—Unverified
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study	Jul 9, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
Reprogramming Distillation for Medical Foundation Models	Jul 9, 2024	Knowledge DistillationLightweight Deployment	CodeCode Available
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training	Jul 8, 2024	AllGPU	—Unverified
Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients	Jul 7, 2024	Federated LearningKnowledge Distillation	—Unverified
Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data	Jul 7, 2024	Activity RecognitionDeep Learning	—Unverified
Leveraging Topological Guidance for Improved Knowledge Distillation	Jul 7, 2024	image-classificationImage Classification	CodeCode Available

Show:10 25 50

← PrevPage 70 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified