Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2151–2200 of 4240 papers

Title	Date	Tasks	Status	Hype
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer	Feb 22, 2023	Knowledge DistillationTransfer Learning	CodeCode Available	0
Debiased Distillation by Transplanting the Last Layer	Feb 22, 2023	AttributeKnowledge Distillation	—Unverified	0
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge Computing	Feb 21, 2023	Data CompressionEdge-computing	CodeCode Available	1
Two-in-one Knowledge Distillation for Efficient Facial Forgery Detection	Feb 21, 2023	Knowledge DistillationVocal Bursts Valence Prediction	—Unverified	0
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers	Feb 21, 2023	Knowledge Distillation	—Unverified	0
CADIS: Handling Cluster-skewed Non-IID Data in Federated Learning with Clustered Aggregation and Knowledge DIStilled Regularization	Feb 21, 2023	Federated LearningKnowledge Distillation	CodeCode Available	0
Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in Tencent	Feb 20, 2023	Knowledge DistillationRecommendation Systems	CodeCode Available	2
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers	Feb 19, 2023	Knowledge DistillationModel Compression	—Unverified	0
RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness	Feb 18, 2023	Knowledge DistillationMulti-Task Learning	—Unverified	0
Fairly Predicting Graft Failure in Liver Transplant for Organ Assigning	Feb 18, 2023	FairnessKnowledge Distillation	—Unverified	0
Explicit and Implicit Knowledge Distillation via Unlabeled Data	Feb 17, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified	0
Few-shot 3D LiDAR Semantic Segmentation for Autonomous Driving	Feb 17, 2023	Autonomous DrivingFew-Shot Learning	—Unverified	0
ST-MFNet Mini: Knowledge Distillation-Driven Frame Interpolation	Feb 16, 2023	Knowledge DistillationNetwork Pruning	CodeCode Available	0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified	0
Cross Modal Distillation for Flood Extent Mapping	Feb 16, 2023	Knowledge Distillation	—Unverified	0
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation	Feb 16, 2023	Knowledge DistillationSentence	—Unverified	0
Learning From Biased Soft Labels	Feb 16, 2023	Knowledge Distillation	—Unverified	0
New Insights on Relieving Task-Recency Bias for Online Class Incremental Learning	Feb 16, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	0
A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation	Feb 15, 2023	Data AugmentationDefect Detection	—Unverified	0
Offline-to-Online Knowledge Distillation for Video Instance Segmentation	Feb 15, 2023	Data AugmentationInstance Segmentation	—Unverified	0
Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks	Feb 14, 2023	Ensemble LearningKnowledge Distillation	CodeCode Available	1
Take a Prior from Other Tasks for Severe Blur Removal	Feb 14, 2023	DeblurringImage Deblurring	—Unverified	0
PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees	Feb 13, 2023	Federated LearningGeneralization Bounds	CodeCode Available	1
Learning from Noisy Crowd Labels with Logics	Feb 13, 2023	Knowledge Distillationnamed-entity-recognition	CodeCode Available	0
Exploring Navigation Maps for Learning-Based Motion Prediction	Feb 13, 2023	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
NYCU-TWO at Memotion 3: Good Foundation, Good Teacher, then you have Good Meme Analysis	Feb 13, 2023	Knowledge DistillationSentiment Analysis	—Unverified	0
SCLIFD:Supervised Contrastive Knowledge Distillation for Incremental Fault Diagnosis under Limited Fault Data	Feb 12, 2023	class-incremental learningClass Incremental Learning	—Unverified	0
Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels	Feb 11, 2023	Knowledge DistillationSemantic Segmentation	CodeCode Available	1
Dual Relation Knowledge Distillation for Object Detection	Feb 11, 2023	Knowledge DistillationModel Compression	CodeCode Available	1
Feature Affinity Assisted Knowledge Distillation and Quantization of Deep Neural Networks on Label-Free Data	Feb 10, 2023	Knowledge DistillationQuantization	—Unverified	0
CEN-HDR: Computationally Efficient neural Network for real-time High Dynamic Range imaging	Feb 10, 2023	Efficient Neural NetworkKnowledge Distillation	CodeCode Available	1
SOCRATES: Text-based Human Search and Approach using a Robot Dog	Feb 10, 2023	Knowledge Distillation	—Unverified	0
Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge Transfer	Feb 9, 2023	Knowledge DistillationNeural Architecture Search	CodeCode Available	0
Lightweight Transformers for Clinical Natural Language Processing	Feb 9, 2023	Continual LearningKnowledge Distillation	CodeCode Available	1
Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System	Feb 8, 2023	Knowledge Distillation	—Unverified	0
Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation	Feb 8, 2023	Brain Tumor SegmentationImage Generation	—Unverified	0
SLaM: Student-Label Mixing for Distillation with Unlabeled Examples	Feb 8, 2023	Knowledge Distillation	—Unverified	0
An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking	Feb 8, 2023	Document RankingKnowledge Distillation	—Unverified	0
Audio Representation Learning by Distilling Video as Privileged Information	Feb 6, 2023	Emotion RecognitionKnowledge Distillation	—Unverified	0
Heterogeneous Federated Knowledge Graph Embedding Learning and Unlearning	Feb 4, 2023	Federated LearningGraph Embedding	—Unverified	0
Knowledge Distillation in Vision Transformers: A Critical Review	Feb 4, 2023	Decoderimage-classification	—Unverified	0
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective	Feb 3, 2023	Knowledge Distillation	CodeCode Available	0
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits	Feb 3, 2023	AllKnowledge Distillation	—Unverified	0
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications	Feb 2, 2023	Knowledge DistillationModel Compression	—Unverified	0
Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection	Feb 1, 2023	Defocus Blur DetectionGenerative Adversarial Network	CodeCode Available	0
Adaptive Search-and-Training for Robust and Efficient Network Pruning	Feb 1, 2023	Knowledge DistillationNetwork Pruning	CodeCode Available	0
Knowledge Distillation on Graphs: A Survey	Feb 1, 2023	Knowledge DistillationModel Compression	—Unverified	0
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans	Feb 1, 2023	Continual Semantic SegmentationDecoder	—Unverified	0
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection	Feb 1, 2023	Knowledge Distillation	—Unverified	0
AMD: Adaptive Masked Distillation for Object Detection	Jan 31, 2023	Knowledge DistillationModel Compression	—Unverified	0

Show:10 25 50

← PrevPage 44 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified