Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1701–1750 of 4240 papers

Title	Date	Tasks	Status
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation	Jul 28, 2024	Knowledge DistillationSequential Diagnosis	CodeCode Available
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Jul 27, 2024	Computational EfficiencyImage Super-Resolution	—Unverified
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers	Jul 26, 2024	Domain AdaptationDomain Generalization	CodeCode Available
FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction	Jul 26, 2024	Click-Through Rate PredictionFederated Learning	—Unverified
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT	Jul 25, 2024	Knowledge DistillationMulti-Object Tracking	CodeCode Available
Peak-Controlled Logits Poisoning Attack in Federated Distillation	Jul 25, 2024	Knowledge DistillationTransfer Learning	—Unverified
Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach	Jul 25, 2024	Anomaly DetectionKnowledge Distillation	—Unverified
How to Train the Teacher Model for Effective Knowledge Distillation	Jul 25, 2024	Knowledge Distillation	CodeCode Available
NC-NCD: Novel Class Discovery for Node Classification	Jul 25, 2024	ClassificationKnowledge Distillation	CodeCode Available
CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis	Jul 24, 2024	Knowledge DistillationLesion Detection	CodeCode Available
DDK: Distilling Domain Knowledge for Efficient Large Language Models	Jul 23, 2024	Knowledge Distillation	—Unverified
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video	Jul 22, 2024	DisentanglementKnowledge Distillation	CodeCode Available
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures	Jul 22, 2024	Knowledge DistillationModel Compression	CodeCode Available
Synthetic Image Learning: Preserving Performance and Preventing Membership Inference Attacks	Jul 22, 2024	Knowledge Distillation	—Unverified
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models	Jul 22, 2024	Deep Learningimage-classification	—Unverified
SeqMIA: Sequential-Metric Based Membership Inference Attack	Jul 21, 2024	Inference AttackKnowledge Distillation	CodeCode Available
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification	Jul 21, 2024	Data-free Knowledge DistillationImage Generation	—Unverified
Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge Distillation	Jul 20, 2024	Knowledge Distillation	CodeCode Available
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images	Jul 19, 2024	Caption GenerationContinual Learning	CodeCode Available
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection	Jul 18, 2024	Knowledge DistillationObject	—Unverified
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation	Jul 18, 2024	Knowledge DistillationRepresentation Learning	—Unverified
QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View	Jul 18, 2024	Action AnticipationAction Recognition	CodeCode Available
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Jul 18, 2024	Knowledge DistillationSemantic Segmentation	CodeCode Available
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning	Jul 18, 2024	Continual LearningKnowledge Distillation	—Unverified
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model	Jul 17, 2024	Knowledge Distillationscientific discovery	—Unverified
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Jul 16, 2024	Knowledge DistillationSemantic Segmentation	—Unverified
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation	Jul 16, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data	Jul 15, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Don't Throw Away Data: Better Sequence Knowledge Distillation	Jul 15, 2024	DiversityKnowledge Distillation	—Unverified
Multi-Granularity Semantic Revision for Large Language Model Distillation	Jul 14, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-Labels	Jul 14, 2024	Image SegmentationKnowledge Distillation	CodeCode Available
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Jul 13, 2024	Class-Incremental Semantic SegmentationExemplar-Free	—Unverified
Minimizing PLM-Based Few-Shot Intent Detectors	Jul 13, 2024	Data AugmentationKnowledge Distillation	CodeCode Available
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Jul 12, 2024	3D Semantic SegmentationAutonomous Driving	—Unverified
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation	Jul 12, 2024	Graph GenerationKnowledge Distillation	—Unverified
A Survey on Symbolic Knowledge Distillation of Large Language Models	Jul 12, 2024	Knowledge DistillationSurvey	—Unverified
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection	Jul 12, 2024	Knowledge DistillationSocial Media Mental Health Detection	CodeCode Available
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification	Jul 12, 2024	graph constructionGraph Learning	CodeCode Available
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear	Jul 11, 2024	Knowledge Distillationobject-detection	CodeCode Available
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions	Jul 11, 2024	Iris RecognitionKnowledge Distillation	—Unverified
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches	Jul 10, 2024	Abstractive Text SummarizationData Augmentation	—Unverified
LokiLM: Technical Report	Jul 10, 2024	Knowledge DistillationLanguage Modeling	—Unverified
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification	Jul 10, 2024	Computational Efficiencyimage-classification	CodeCode Available
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction	Jul 9, 2024	Autonomous DrivingDecision Making	—Unverified
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study	Jul 9, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
Reprogramming Distillation for Medical Foundation Models	Jul 9, 2024	Knowledge DistillationLightweight Deployment	CodeCode Available
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training	Jul 8, 2024	AllGPU	—Unverified
Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients	Jul 7, 2024	Federated LearningKnowledge Distillation	—Unverified
Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data	Jul 7, 2024	Activity RecognitionDeep Learning	—Unverified
Leveraging Topological Guidance for Improved Knowledge Distillation	Jul 7, 2024	image-classificationImage Classification	CodeCode Available

Show:10 25 50

← PrevPage 35 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified