Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2551–2600 of 4240 papers

Title	Date	Tasks	Status
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation	May 8, 2023	Knowledge Distillation	—Unverified
Web Content Filtering through knowledge distillation of Large Language Models	May 8, 2023	Knowledge Distillation	—Unverified
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge	May 8, 2023	Knowledge Distillationvalid	—Unverified
Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation	May 6, 2023	Knowledge DistillationQuantization	—Unverified
Distilled Mid-Fusion Transformer Networks for Multi-Modal Human Activity Recognition	May 5, 2023	Activity RecognitionFeature Engineering	—Unverified
Smaller3d: Smaller Models for 3D Semantic Segmentation Using Minkowski Engine and Knowledge Distillation Methods	May 4, 2023	3D Semantic SegmentationKnowledge Distillation	CodeCode Available
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training	May 3, 2023	Knowledge DistillationText Generation	CodeCode Available
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems	May 2, 2023	Incremental LearningKnowledge Distillation	—Unverified
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models	May 2, 2023	Knowledge Distillation	—Unverified
Detect, Distill and Update: Detect, Distill and Update: Learned DB Systems Facing Out of Distribution Data	May 1, 2023	Knowledge DistillationSynthetic Data Generation	CodeCode Available
Scaffolding a Student to Instill Knowledge	May 1, 2023	Knowledge Distillation	CodeCode Available
Refined Response Distillation for Class-Incremental Player Detection	May 1, 2023	Knowledge Distillationobject-detection	CodeCode Available
Ensemble Modeling with Contrastive Knowledge Distillation for Sequential Recommendation	Apr 28, 2023	AttributeContrastive Learning	CodeCode Available
Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation	Apr 28, 2023	Knowledge DistillationSemantic Segmentation	CodeCode Available
CORSD: Class-Oriented Relational Self Distillation	Apr 28, 2023	Knowledge DistillationModel Compression	—Unverified
Learning Human-Human Interactions in Images from Weak Textual Supervision	Apr 27, 2023	Human-Human Interaction RecognitionImage Captioning	—Unverified
Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional Inputs	Apr 25, 2023	3D geometry3D Reconstruction	—Unverified
A Forward and Backward Compatible Framework for Few-shot Class-incremental Pill Recognition	Apr 24, 2023	class-incremental learningClass Incremental Learning	CodeCode Available
Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving	Apr 24, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
Improving Knowledge Distillation via Transferring Learning Ability	Apr 24, 2023	Knowledge Distillation	CodeCode Available
Decouple Non-parametric Knowledge Distillation For End-to-end Speech Translation	Apr 20, 2023	Knowledge DistillationMachine Translation	—Unverified
Word Sense Induction with Knowledge Distillation from BERT	Apr 20, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Biologically inspired structure learning with reverse knowledge distillation for spiking neural networks	Apr 19, 2023	Knowledge Distillation	—Unverified
Knowledge Distillation Under Ideal Joint Classifier Assumption	Apr 19, 2023	Domain AdaptationKnowledge Distillation	—Unverified
An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models	Apr 19, 2023	Knowledge DistillationMachine Translation	—Unverified
Deep Collective Knowledge Distillation	Apr 18, 2023	Knowledge DistillationModel Compression	—Unverified
Learning to "Segment Anything" in Thermal Infrared Images through Knowledge Distillation with a Large Scale Dataset SATIR	Apr 17, 2023	Image SegmentationKnowledge Distillation	CodeCode Available
LaSNN: Layer-wise ANN-to-SNN Distillation for Effective and Efficient Training in Deep Spiking Neural Networks	Apr 17, 2023	Knowledge Distillation	—Unverified
Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction	Apr 17, 2023	Click-Through Rate PredictionDiversity	—Unverified
Teacher Network Calibration Improves Cross-Quality Knowledge Distillation	Apr 15, 2023	image-classificationImage Classification	CodeCode Available
Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games	Apr 14, 2023	Knowledge Distillationtext-based games	CodeCode Available
Class-Incremental Learning of Plant and Disease Detection: Growing Branches with Knowledge Distillation	Apr 13, 2023	class-incremental learningClass Incremental Learning	—Unverified
Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation	Apr 12, 2023	Knowledge Distillation	—Unverified
SFT-KD-Recon: Learning a Student-friendly Teacher for Knowledge Distillation in Magnetic Resonance Image Reconstruction	Apr 11, 2023	Image ReconstructionKnowledge Distillation	CodeCode Available
Grouped Knowledge Distillation for Deep Face Recognition	Apr 10, 2023	Face RecognitionKnowledge Distillation	—Unverified
A Survey on Recent Teacher-student Learning Studies	Apr 10, 2023	Knowledge DistillationSurvey	—Unverified
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation	Apr 9, 2023	Knowledge DistillationNovel View Synthesis	—Unverified
Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning	Apr 9, 2023	image-classificationImage Classification	—Unverified
A Comprehensive Survey on Knowledge Distillation of Diffusion Models	Apr 9, 2023	Knowledge DistillationSurvey	—Unverified
Model-Agnostic Decentralized Collaborative Learning for On-Device POI Recommendation	Apr 8, 2023	Knowledge DistillationPrivacy Preserving	—Unverified
Masked Student Dataset of Expressions	Apr 7, 2023	Contrastive LearningFacial Expression Recognition	CodeCode Available
Continual Detection Transformer for Incremental Object Detection	Apr 6, 2023	Class-Incremental Object DetectionKnowledge Distillation	—Unverified
Self-Distillation for Gaussian Process Regression and Classification	Apr 5, 2023	ClassificationGPR	CodeCode Available
Towards Efficient Task-Driven Model Reprogramming with Foundation Models	Apr 5, 2023	Knowledge DistillationTransfer Learning	—Unverified
MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations	Apr 4, 2023	Knowledge Distillation	—Unverified
Cross-Class Feature Augmentation for Class Incremental Learning	Apr 4, 2023	class-incremental learningClass Incremental Learning	—Unverified
Domain Generalization for Crop Segmentation with Standardized Ensemble Knowledge Distillation	Apr 3, 2023	Domain GeneralizationKnowledge Distillation	CodeCode Available
Knowledge-Distilled Graph Neural Networks for Personalized Epileptic Seizure Detection	Apr 3, 2023	channel selectionEEG	—Unverified
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation	Apr 2, 2023	Face GenerationKnowledge Distillation	—Unverified
Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders	Mar 31, 2023	Knowledge DistillationLanguage Modeling	—Unverified

Show:10 25 50

← PrevPage 52 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified