Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3851–3900 of 4240 papers

Title	Date	Tasks	Status
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition	Sep 15, 2020	Action RecognitionKnowledge Distillation	—Unverified
Autoregressive Knowledge Distillation through Imitation Learning	Sep 15, 2020	Imitation LearningKnowledge Distillation	CodeCode Available
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning	Sep 13, 2020	Graph EmbeddingKnowledge Distillation	—Unverified
SSKD: Self-Supervised Knowledge Distillation for Cross Domain Adaptive Person Re-Identification	Sep 13, 2020	ClusteringDomain Adaptive Person Re-Identification	—Unverified
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks	Sep 13, 2020	Ensemble LearningKnowledge Distillation	—Unverified
Extending Label Smoothing Regularization with Self-Knowledge Distillation	Sep 11, 2020	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
On the Orthogonality of Knowledge Distillation with Other Techniques: From an Ensemble Perspective	Sep 9, 2020	Data AugmentationEfficient Neural Network	—Unverified
Lifelong Object Detection	Sep 2, 2020	Knowledge DistillationLifelong learning	—Unverified
SAIL: Self-Augmented Graph Contrastive Learning	Sep 2, 2020	Contrastive LearningKnowledge Distillation	—Unverified
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation	Sep 1, 2020	Data AugmentationKnowledge Distillation	CodeCode Available
Classification of Diabetic Retinopathy Using Unlabeled Data and Knowledge Distillation	Sep 1, 2020	ClassificationGeneral Classification	—Unverified
Initial Classifier Weights Replay for Memoryless Class Incremental Learning	Aug 31, 2020	Allclass-incremental learning	—Unverified
MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation	Aug 27, 2020	Knowledge DistillationMeta-Learning	—Unverified
Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition	Aug 26, 2020	Adversarial AttackData Augmentation	—Unverified
Active Class Incremental Learning for Imbalanced Datasets	Aug 25, 2020	class-incremental learningClass Incremental Learning	—Unverified
Learn to Talk via Proactive Knowledge Transfer	Aug 23, 2020	de-enKnowledge Distillation	—Unverified
Multi-Person Full Body Pose Estimation	Aug 23, 2020	Knowledge DistillationMulti-Person Pose Estimation	—Unverified
Rectified Decision Trees: Exploring the Landscape of Interpretable and Effective Machine Learning	Aug 21, 2020	BIG-bench Machine LearningKnowledge Distillation	—Unverified
Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach	Aug 20, 2020	AttributeAttribute Value Extraction	—Unverified
Cascaded channel pruning using hierarchical self-distillation	Aug 16, 2020	Knowledge DistillationModel Compression	—Unverified
An Ensemble of Knowledge Sharing Models for Dynamic Hand Gesture Recognition	Aug 13, 2020	Gesture RecognitionHand Gesture Recognition	—Unverified
Compression of Deep Learning Models for Text: A Survey	Aug 12, 2020	Deep LearningInformation Retrieval	—Unverified
Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer	Aug 12, 2020	Crowd CountingKnowledge Distillation	—Unverified
Compact Speaker Embedding: lrx-vector	Aug 11, 2020	Knowledge DistillationSpeaker Recognition	—Unverified
S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification	Aug 11, 2020	General ClassificationKnowledge Distillation	—Unverified
Knowledge Distillation and Data Selection for Semi-Supervised Learning in CTC Acoustic Models	Aug 10, 2020	Knowledge Distillationspeech-recognition	—Unverified
Knowledge Distillation-aided End-to-End Learning for Linear Precoding in Multiuser MIMO Downlink Systems with Finite-Rate Feedback	Aug 10, 2020	BinarizationKnowledge Distillation	—Unverified
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition	Aug 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
MED-TEX: Transferring and Explaining Knowledge with Less Data from Pretrained Medical Imaging Models	Aug 6, 2020	image-classificationImage Classification	—Unverified
Prime-Aware Adaptive Distillation	Aug 4, 2020	Knowledge DistillationMetric Learning	—Unverified
TutorNet: Towards Flexible Knowledge Distillation for End-to-End Speech Recognition	Aug 3, 2020	Knowledge DistillationModel Compression	—Unverified
Teacher-Student Training and Triplet Loss for Facial Expression Recognition under Occlusion	Aug 3, 2020	Facial Expression RecognitionFacial Expression Recognition (FER)	—Unverified
Differentiable Feature Aggregation Search for Knowledge Distillation	Aug 2, 2020	Knowledge DistillationModel Compression	—Unverified
Feature Normalized Knowledge Distillation for Image Classification	Aug 1, 2020	ClassificationGeneral Classification	CodeCode Available
YOLO in the Dark - Domain Adaptation Method for Merging Multiple Models -	Aug 1, 2020	Domain AdaptationKnowledge Distillation	—Unverified
Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition	Aug 1, 2020	DiversityFace Recognition	—Unverified
Local Correlation Consistency for Knowledge Distillation	Aug 1, 2020	Knowledge Distillation	—Unverified
AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation	Aug 1, 2020	Knowledge DistillationTransfer Learning	—Unverified
Weight Decay Scheduling and Knowledge Distillation for Active Learning	Aug 1, 2020	Active LearningKnowledge Distillation	—Unverified
Dynamic Knowledge Distillation for Black-box Hypothesis Transfer Learning	Jul 24, 2020	Knowledge DistillationTransfer Learning	—Unverified
Multi-label Contrastive Predictive Coding	Jul 20, 2020	Knowledge DistillationMulti-class Classification	—Unverified
Interpretable Foreground Object Search As Knowledge Distillation	Jul 20, 2020	Knowledge DistillationObject	—Unverified
CovidCare: Transferring Knowledge from Existing EMR to Emerging Epidemic for Interpretable Prognosis	Jul 17, 2020	DiagnosticKnowledge Distillation	—Unverified
Knowledge Distillation in Deep Learning and its Applications	Jul 17, 2020	Deep LearningKnowledge Distillation	—Unverified
UniTrans: Unifying Model Transfer and Data Transfer for Cross-Lingual Named Entity Recognition with Unlabeled Data	Jul 15, 2020	Cross-Lingual NERCross-Lingual Transfer	CodeCode Available
P-KDGAN: Progressive Knowledge Distillation with GANs for One-class Novelty Detection	Jul 14, 2020	Anomaly DetectionDecoder	—Unverified
Add a SideNet to your MainNet	Jul 14, 2020	General ClassificationKnowledge Distillation	—Unverified
Dual-Teacher: Integrating Intra-domain and Inter-domain Teachers for Annotation-efficient Cardiac Segmentation	Jul 13, 2020	Cardiac SegmentationDomain Adaptation	—Unverified
Representation Transfer by Optimal Transport	Jul 13, 2020	Knowledge DistillationModel Compression	—Unverified
Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer	Jul 10, 2020	Knowledge DistillationOptical Flow Estimation	—Unverified

Show:10 25 50

← PrevPage 78 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified