Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3451–3500 of 4240 papers

Title	Date	Tasks	Status
Neural Architecture Search via Ensemble-based Knowledge Distillation	Sep 29, 2021	DiversityKnowledge Distillation	—Unverified
Feature Kernel Distillation	Sep 29, 2021	image-classificationImage Classification	—Unverified
Pseudo Knowledge Distillation: Towards Learning Optimal Instance-specific Label Smoothing Regularization	Sep 29, 2021	image-classificationImage Classification	—Unverified
Prototypical Contrastive Predictive Coding	Sep 29, 2021	Contrastive LearningKnowledge Distillation	—Unverified
Improving Question Answering Performance Using Knowledge Distillation and Active Learning	Sep 26, 2021	Active LearningKnowledge Distillation	CodeCode Available
Partial to Whole Knowledge Distillation: Progressive Distilling Decomposed Knowledge Boosts Student Better	Sep 26, 2021	Knowledge Distillation	—Unverified
Recent Advances of Continual Learning in Computer Vision: An Overview	Sep 23, 2021	Continual LearningKnowledge Distillation	—Unverified
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation	Sep 22, 2021	cross-modal alignmentKnowledge Distillation	CodeCode Available
The NiuTrans Machine Translation Systems for WMT21	Sep 22, 2021	Knowledge DistillationMachine Translation	—Unverified
K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering	Sep 22, 2021	CPUKnowledge Distillation	—Unverified
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network	Sep 22, 2021	Knowledge DistillationLanguage Modeling	—Unverified
Knowledge Distillation with Noisy Labels for Natural Language Understanding	Sep 21, 2021	Knowledge DistillationNatural Language Understanding	—Unverified
RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation	Sep 21, 2021	Knowledge Distillation	—Unverified
Releasing Graph Neural Networks with Differential Privacy Guarantees	Sep 18, 2021	Knowledge DistillationPrivacy Preserving	CodeCode Available
Towards Full Utilization on Mask Task for Distilling PLMs into NMT	Sep 17, 2021	Knowledge DistillationMachine Translation	—Unverified
Label Assignment Distillation for Object Detection	Sep 16, 2021	Knowledge DistillationObject	—Unverified
New Perspective on Progressive GANs Distillation for One-class Novelty Detection	Sep 15, 2021	DecoderGenerative Adversarial Network	—Unverified
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate	Sep 14, 2021	DecoderKnowledge Distillation	—Unverified
Multihop: Leveraging Complex Models to Learn Accurate Simple Models	Sep 14, 2021	Explainable artificial intelligenceKnowledge Distillation	—Unverified
A Note on Knowledge Distillation Loss Function for Object Classification	Sep 14, 2021	Knowledge DistillationModel Compression	—Unverified
Secure Your Ride: Real-time Matching Success Rate Prediction for Passenger-Driver Pairs	Sep 14, 2021	Decision MakingKnowledge Distillation	—Unverified
UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation	Sep 13, 2021	Abstractive Text SummarizationDecoder	—Unverified
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation	Sep 13, 2021	Knowledge DistillationLanguage Modeling	—Unverified
On the Efficiency of Subclass Knowledge Distillation in Classification Tasks	Sep 12, 2021	Binary ClassificationClassification	—Unverified
Federated Ensemble Model-based Reinforcement Learning in Edge Computing	Sep 12, 2021	Autonomous Drivingcontinuous-control	—Unverified
Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation	Sep 10, 2021	Knowledge DistillationQuestion Answering	—Unverified
Learning to Teach with Student Feedback	Sep 10, 2021	Knowledge Distillation	—Unverified
Dual Correction Strategy for Ranking Distillation in Top-N Recommender System	Sep 8, 2021	Knowledge DistillationRecommendation Systems	CodeCode Available
CAM-loss: Towards Learning Spatially Discriminative Feature Representations	Sep 3, 2021	Few-Shot Learningimage-classification	—Unverified
Complementary Calibration: Boosting General Continual Learning with Collaborative Distillation and Self-Supervision	Sep 3, 2021	Continual LearningContrastive Learning	CodeCode Available
Decoupled Transformer for Scalable Inference in Open-domain Question Answering	Sep 1, 2021	Knowledge DistillationMachine Reading Comprehension	—Unverified
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation	Sep 1, 2021	Deep Reinforcement LearningGeneral Reinforcement Learning	CodeCode Available
Knowledge Distillation with BERT for Image Tag-Based Privacy Prediction	Sep 1, 2021	Knowledge DistillationTAG	—Unverified
FedKD: Communication Efficient Federated Learning via Knowledge Distillation	Aug 30, 2021	Federated LearningKnowledge Distillation	—Unverified
Lipschitz Continuity Guided Knowledge Distillation	Aug 29, 2021	Knowledge DistillationModel Compression	—Unverified
Distilling the Knowledge of Large-scale Generative Models into Retrieval Models for Efficient Open-domain Conversation	Aug 28, 2021	Knowledge DistillationRetrieval	CodeCode Available
CoCo DistillNet: a Cross-layer Correlation Distillation Network for Pathological Gastric Cancer Segmentation	Aug 27, 2021	Image SegmentationKnowledge Distillation	—Unverified
SIGN: Spatial-information Incorporated Generative Network for Generalized Zero-shot Semantic Segmentation	Aug 27, 2021	Knowledge DistillationSegmentation	—Unverified
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation	Aug 26, 2021	Density EstimationKnowledge Distillation	—Unverified
Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches	Aug 23, 2021	CPUData Augmentation	—Unverified
Personalised Federated Learning: A Combinational Approach	Aug 22, 2021	Federated LearningKnowledge Distillation	—Unverified
Boosting of Head Pose Estimation by Knowledge Distillation	Aug 20, 2021	Head Pose EstimationKnowledge Distillation	—Unverified
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation	Aug 17, 2021	Knowledge Distillationobject-detection	—Unverified
BERT Learns to Teach: Knowledge Distillation with Meta Learning	Aug 17, 2021	Knowledge DistillationMeta-Learning	—Unverified
Online Continual Learning For Visual Food Classification	Aug 15, 2021	ClassificationContinual Learning	—Unverified
Multi-granularity for knowledge distillation	Aug 15, 2021	Knowledge DistillationPerson Re-Identification	CodeCode Available
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval	Aug 13, 2021	Knowledge DistillationNatural Questions	—Unverified
Learning from Matured Dumb Teacher for Fine Generalization	Aug 12, 2021	image-classificationImage Classification	—Unverified
Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data	Aug 11, 2021	Knowledge DistillationModel Compression	—Unverified
Lifelong Intent Detection via Multi-Strategy Rebalancing	Aug 10, 2021	Intent DetectionKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 70 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified