Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 4240 papers

Title	Date	Tasks	Status	Hype
NC-NCD: Novel Class Discovery for Node Classification	Jul 25, 2024	ClassificationKnowledge Distillation	CodeCode Available	0
Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach	Jul 25, 2024	Anomaly DetectionKnowledge Distillation	—Unverified	0
How to Train the Teacher Model for Effective Knowledge Distillation	Jul 25, 2024	Knowledge Distillation	CodeCode Available	0
CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis	Jul 24, 2024	Knowledge DistillationLesion Detection	CodeCode Available	0
DDK: Distilling Domain Knowledge for Efficient Large Language Models	Jul 23, 2024	Knowledge Distillation	—Unverified	0
OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection	Jul 23, 2024	Code GenerationKnowledge Distillation	CodeCode Available	1
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures	Jul 22, 2024	Knowledge DistillationModel Compression	CodeCode Available	0
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models	Jul 22, 2024	Deep Learningimage-classification	—Unverified	0
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video	Jul 22, 2024	DisentanglementKnowledge Distillation	CodeCode Available	0
Synthetic Image Learning: Preserving Performance and Preventing Membership Inference Attacks	Jul 22, 2024	Knowledge Distillation	—Unverified	0
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification	Jul 21, 2024	Data-free Knowledge DistillationImage Generation	—Unverified	0
SeqMIA: Sequential-Metric Based Membership Inference Attack	Jul 21, 2024	Inference AttackKnowledge Distillation	CodeCode Available	0
Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge Distillation	Jul 20, 2024	Knowledge Distillation	CodeCode Available	0
Compact Language Models via Pruning and Knowledge Distillation	Jul 19, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	3
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images	Jul 19, 2024	Caption GenerationContinual Learning	CodeCode Available	0
ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation	Jul 19, 2024	DecoderImage Segmentation	CodeCode Available	2
Knowledge Distillation Approaches for Accurate and Efficient Recommender System	Jul 19, 2024	Knowledge DistillationRecommendation Systems	CodeCode Available	1
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning	Jul 18, 2024	Continual LearningKnowledge Distillation	—Unverified	0
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection	Jul 18, 2024	Knowledge DistillationObject	—Unverified	0
QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View	Jul 18, 2024	Action AnticipationAction Recognition	CodeCode Available	0
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation	Jul 18, 2024	Knowledge DistillationRepresentation Learning	—Unverified	0
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Jul 18, 2024	Knowledge DistillationSemantic Segmentation	CodeCode Available	0
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model	Jul 17, 2024	Knowledge Distillationscientific discovery	—Unverified	0
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection	Jul 17, 2024	Knowledge Distillationobject-detection	CodeCode Available	1
Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection	Jul 16, 2024	Knowledge Distillationobject-detection	CodeCode Available	1
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Jul 16, 2024	Knowledge DistillationSemantic Segmentation	—Unverified	0
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation	Jul 16, 2024	Autonomous DrivingKnowledge Distillation	—Unverified	0
Mitigating Background Shift in Class-Incremental Semantic Segmentation	Jul 16, 2024	Class Incremental LearningClass-Incremental Semantic Segmentation	CodeCode Available	1
Progressive Pretext Task Learning for Human Trajectory Prediction	Jul 16, 2024	Knowledge DistillationPrediction	CodeCode Available	2
Relational Representation Distillation	Jul 16, 2024	Computational EfficiencyContrastive Learning	CodeCode Available	1
Discriminative and Consistent Representation Distillation	Jul 16, 2024	Causal InferenceContrastive Learning	CodeCode Available	1
Don't Throw Away Data: Better Sequence Knowledge Distillation	Jul 15, 2024	DiversityKnowledge Distillation	—Unverified	0
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data	Jul 15, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Accessing Vision Foundation Models at ImageNet-level Costs	Jul 15, 2024	Knowledge DistillationTransfer Learning	CodeCode Available	2
Multi-Granularity Semantic Revision for Large Language Model Distillation	Jul 14, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels	Jul 14, 2024	Image SegmentationKnowledge Distillation	CodeCode Available	0
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection	Jul 14, 2024	3D Object DetectionDepth Estimation	CodeCode Available	1
Minimizing PLM-Based Few-Shot Intent Detectors	Jul 13, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	0
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Jul 13, 2024	Class-Incremental Semantic SegmentationExemplar-Free	—Unverified	0
A Survey on Symbolic Knowledge Distillation of Large Language Models	Jul 12, 2024	Knowledge DistillationSurvey	—Unverified	0
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Jul 12, 2024	3D Semantic SegmentationAutonomous Driving	—Unverified	0
BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation	Jul 12, 2024	Knowledge Distillation	CodeCode Available	1
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification	Jul 12, 2024	graph constructionGraph Learning	CodeCode Available	0
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection	Jul 12, 2024	Knowledge DistillationSocial Media Mental Health Detection	CodeCode Available	0
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation	Jul 12, 2024	Graph GenerationKnowledge Distillation	—Unverified	0
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions	Jul 11, 2024	Iris RecognitionKnowledge Distillation	—Unverified	0
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear	Jul 11, 2024	Knowledge Distillationobject-detection	CodeCode Available	0
LokiLM: Technical Report	Jul 10, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification	Jul 10, 2024	Computational Efficiencyimage-classification	CodeCode Available	0
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches	Jul 10, 2024	Abstractive Text SummarizationData Augmentation	—Unverified	0

Show:10 25 50

← PrevPage 18 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified