Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2701–2750 of 4240 papers

Title	Date	Tasks	Status
Take a Prior from Other Tasks for Severe Blur Removal	Feb 14, 2023	DeblurringImage Deblurring	—Unverified
Learning from Noisy Crowd Labels with Logics	Feb 13, 2023	Knowledge Distillationnamed-entity-recognition	CodeCode Available
NYCU-TWO at Memotion 3: Good Foundation, Good Teacher, then you have Good Meme Analysis	Feb 13, 2023	Knowledge DistillationSentiment Analysis	—Unverified
SCLIFD:Supervised Contrastive Knowledge Distillation for Incremental Fault Diagnosis under Limited Fault Data	Feb 12, 2023	class-incremental learningClass Incremental Learning	—Unverified
Feature Affinity Assisted Knowledge Distillation and Quantization of Deep Neural Networks on Label-Free Data	Feb 10, 2023	Knowledge DistillationQuantization	—Unverified
SOCRATES: Text-based Human Search and Approach using a Robot Dog	Feb 10, 2023	Knowledge Distillation	—Unverified
Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge Transfer	Feb 9, 2023	Knowledge DistillationNeural Architecture Search	CodeCode Available
Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System	Feb 8, 2023	Knowledge Distillation	—Unverified
SLaM: Student-Label Mixing for Distillation with Unlabeled Examples	Feb 8, 2023	Knowledge Distillation	—Unverified
Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation	Feb 8, 2023	Brain Tumor SegmentationImage Generation	—Unverified
An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking	Feb 8, 2023	Document RankingKnowledge Distillation	—Unverified
Audio Representation Learning by Distilling Video as Privileged Information	Feb 6, 2023	Emotion RecognitionKnowledge Distillation	—Unverified
Knowledge Distillation in Vision Transformers: A Critical Review	Feb 4, 2023	Decoderimage-classification	—Unverified
Heterogeneous Federated Knowledge Graph Embedding Learning and Unlearning	Feb 4, 2023	Federated LearningGraph Embedding	—Unverified
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective	Feb 3, 2023	Knowledge Distillation	CodeCode Available
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits	Feb 3, 2023	AllKnowledge Distillation	—Unverified
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications	Feb 2, 2023	Knowledge DistillationModel Compression	—Unverified
Adaptive Search-and-Training for Robust and Efficient Network Pruning	Feb 1, 2023	Knowledge DistillationNetwork Pruning	CodeCode Available
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection	Feb 1, 2023	Knowledge Distillation	—Unverified
Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection	Feb 1, 2023	Defocus Blur DetectionGenerative Adversarial Network	CodeCode Available
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans	Feb 1, 2023	Continual Semantic SegmentationDecoder	—Unverified
Knowledge Distillation on Graphs: A Survey	Feb 1, 2023	Knowledge DistillationModel Compression	—Unverified
AMD: Adaptive Masked Distillation for Object Detection	Jan 31, 2023	Knowledge DistillationModel Compression	—Unverified
Knowledge Distillation Label Smoothing: Fact or Fallacy?	Jan 30, 2023	Knowledge Distillationtext-classification	—Unverified
On student-teacher deviations in distillation: does it pay to disobey?	Jan 30, 2023	Knowledge Distillation	—Unverified
FractalAD: A simple industrial anomaly detection method using fractal anomaly generation and backbone knowledge distillation	Jan 30, 2023	Anomaly DetectionKnowledge Distillation	CodeCode Available
Few-shot Face Image Translation via GAN Prior Distillation	Jan 28, 2023	Knowledge DistillationTranslation	—Unverified
MVKT-ECG: Efficient Single-lead ECG Classification on Multi-Label Arrhythmia by Multi-View Knowledge Transferring	Jan 28, 2023	DiagnosticECG Classification	—Unverified
Supervision Complexity and its Role in Knowledge Distillation	Jan 28, 2023	image-classificationImage Classification	—Unverified
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?	Jan 27, 2023	Knowledge DistillationNatural Language Understanding	—Unverified
Improved knowledge distillation by utilizing backward pass knowledge in neural networks	Jan 27, 2023	Knowledge DistillationModel Compression	—Unverified
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval	Jan 27, 2023	Information RetrievalKnowledge Distillation	—Unverified
Improving Text-based Early Prediction by Distillation from Privileged Time-Series Text	Jan 26, 2023	Knowledge DistillationPrediction	—Unverified
A Simple Recipe for Competitive Low-compute Self supervised Vision Models	Jan 23, 2023	Knowledge Distillation	—Unverified
Unifying Synergies between Self-supervised Learning and Dynamic Computation	Jan 22, 2023	image-classificationImage Classification	CodeCode Available
The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation	Jan 21, 2023	Federated LearningKnowledge Distillation	—Unverified
ProKD: An Unsupervised Prototypical Knowledge Distillation Network for Zero-Resource Cross-Lingual Named Entity Recognition	Jan 21, 2023	Contrastive LearningCross-Lingual NER	—Unverified
RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation	Jan 19, 2023	Knowledge DistillationNeural Architecture Search	—Unverified
Adaptively Integrated Knowledge Distillation and Prediction Uncertainty for Continual Learning	Jan 18, 2023	Continual LearningKnowledge Distillation	—Unverified
Knowledge Distillation in Federated Edge Learning: A Survey	Jan 14, 2023	Knowledge DistillationSurvey	—Unverified
A Cohesive Distillation Architecture for Neural Language Models	Jan 12, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Effective Decision Boundary Learning for Class Incremental Learning	Jan 12, 2023	class-incremental learningClass Incremental Learning	—Unverified
Synthetic data generation method for data-free knowledge distillation in regression neural networks	Jan 11, 2023	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization	Jan 9, 2023	Knowledge DistillationLanguage Modelling	CodeCode Available
Designing an Improved Deep Learning-based Model for COVID-19 Recognition in Chest X-ray Images: A Knowledge Distillation Approach	Jan 6, 2023	Knowledge Distillation	—Unverified
RELIANT: Fair Knowledge Distillation for Graph Neural Networks	Jan 3, 2023	FairnessGraph Learning	CodeCode Available
Knowledge-guided Causal Intervention for Weakly-supervised Object Localization	Jan 3, 2023	Knowledge DistillationObject	CodeCode Available
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator	Jan 1, 2023	Knowledge DistillationRetrieval	—Unverified
CaPriDe Learning: Confidential and Private Decentralized Learning Based on Encryption-Friendly Distillation Loss	Jan 1, 2023	Federated LearningKnowledge Distillation	CodeCode Available
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors	Jan 1, 2023	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 55 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified