Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 4240 papers

Title	Date	Tasks	Status
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available
Improve Knowledge Distillation via Label Revision and Data Selection	Apr 3, 2024	Knowledge DistillationModel Compression	—Unverified
Foundation Models for Structural Health Monitoring	Apr 3, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings	Apr 3, 2024	Data IntegrationKnowledge Distillation	—Unverified
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution	Apr 3, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified
Federated Distillation: A Survey	Apr 2, 2024	Federated LearningKnowledge Distillation	—Unverified
Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation	Apr 2, 2024	Autonomous VehiclesDecision Making	—Unverified
Task Integration Distillation for Object Detectors	Apr 2, 2024	Knowledge DistillationObject	—Unverified
Class-Incremental Few-Shot Event Detection	Apr 2, 2024	Event DetectionFew-Shot Learning	—Unverified
LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation	Apr 1, 2024	Knowledge Distillation	—Unverified
SUGAR: Pre-training 3D Visual Representations for Robotics	Apr 1, 2024	3D Instance Segmentation3D Object Recognition	—Unverified
A Comprehensive Review of Knowledge Distillation in Computer Vision	Apr 1, 2024	Deep LearningKnowledge Distillation	—Unverified
Weak-to-Strong 3D Object Detection with X-Ray Distillation	Mar 31, 2024	3D Object DetectionAutonomous Driving	CodeCode Available
DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection	Mar 31, 2024	Dimensionality ReductionKnowledge Distillation	CodeCode Available
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts	Mar 28, 2024	Causal InferenceData-free Knowledge Distillation	—Unverified
GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation	Mar 28, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation	Mar 28, 2024	3D Object DetectionAutonomous Driving	—Unverified
I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation	Mar 27, 2024	Knowledge DistillationSegmentation	—Unverified
Enhancing Metaphor Detection through Soft Labels and Target Word Prediction	Mar 27, 2024	Knowledge DistillationPrompt Learning	—Unverified
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation	Mar 27, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available
Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models	Mar 26, 2024	Knowledge DistillationQuantization	—Unverified
Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN	Mar 26, 2024	Knowledge DistillationModel Compression	—Unverified
From Two-Stream to One-Stream: Efficient RGB-T Tracking via Mutual Prompt Learning and Knowledge Distillation	Mar 25, 2024	Knowledge DistillationObject Tracking	—Unverified
Configurable Holography: Towards Display and Scene Adaptation	Mar 24, 2024	Depth EstimationKnowledge Distillation	—Unverified
Learning to Project for Cross-Task Knowledge Distillation	Mar 21, 2024	Depth EstimationKnowledge Distillation	—Unverified
Fed-RAC: Resource-Aware Clustering for Tackling Heterogeneity of Participants in Federated Learning	Mar 20, 2024	ClusteringFederated Learning	CodeCode Available
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models	Mar 20, 2024	ChatbotKnowledge Distillation	CodeCode Available
TransformMix: Learning Transformation and Mixing Strategies from Data	Mar 19, 2024	Data AugmentationKnowledge Distillation	—Unverified
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation	Mar 18, 2024	Knowledge DistillationNER	CodeCode Available
Scheduled Knowledge Acquisition on Lightweight Vector Symbolic Architectures for Brain-Computer Interfaces	Mar 18, 2024	Feature EngineeringKnowledge Distillation	—Unverified
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models	Mar 18, 2024	3D Semantic SegmentationKnowledge Distillation	—Unverified
KnFu: Effective Knowledge Fusion	Mar 18, 2024	Federated LearningKnowledge Distillation	—Unverified
Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification	Mar 17, 2024	image-classificationImage Classification	CodeCode Available
FlyKD: Graph Knowledge Distillation on the Fly with Curriculum Learning	Mar 16, 2024	Knowledge Distillation	—Unverified
LookALike: Human Mimicry based collaborative decision making	Mar 16, 2024	Decision MakingKnowledge Distillation	—Unverified
Group-Mix SAM: Lightweight Solution for Industrial Assembly Line Applications	Mar 15, 2024	Knowledge Distillation	—Unverified
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models	Mar 14, 2024	Continual LearningKnowledge Distillation	—Unverified
Adapting OC20-trained EquiformerV2 Models for High-Entropy Materials	Mar 14, 2024	Knowledge Distillation	—Unverified
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization	Mar 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation	Mar 14, 2024	Knowledge DistillationMachine Translation	CodeCode Available
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning	Mar 13, 2024	DenoisingKnowledge Distillation	CodeCode Available
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks	Mar 13, 2024	Knowledge Distillation	—Unverified
LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving	Mar 13, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Mar 13, 2024	Continual LearningImage Retrieval	—Unverified
Distilling Named Entity Recognition Models for Endangered Species from Large Language Models	Mar 13, 2024	In-Context LearningKnowledge Distillation	—Unverified
Low-Energy On-Device Personalization for MCUs	Mar 12, 2024	Knowledge DistillationTransfer Learning	CodeCode Available
Distilling the Knowledge in Data Pruning	Mar 12, 2024	Knowledge Distillation	—Unverified
MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning	Mar 11, 2024	DecoderIn-Context Learning	CodeCode Available
One Category One Prompt: Dataset Distillation using Diffusion Models	Mar 11, 2024	Dataset DistillationKnowledge Distillation	—Unverified
AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation	Mar 11, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available

Show:10 25 50

← PrevPage 40 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified