Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2801–2850 of 4240 papers

Title	Date	Tasks	Status	Hype
Class-Incremental Learning by Knowledge Distillation with Adaptive Feature Consolidation	Apr 2, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	1
A Dual-Contrastive Framework for Low-Resource Cross-Lingual Named Entity Recognition	Apr 2, 2022	Contrastive LearningCross-Lingual NER	CodeCode Available	0
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation	Apr 2, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring	Apr 1, 2022	Knowledge DistillationLanguage Modeling	CodeCode Available	1
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation	Apr 1, 2022	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	1
Knowledge distillation with error-correcting transfer learning for wind power prediction	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified	0
Unified and Effective Ensemble Knowledge Distillation	Apr 1, 2022	Knowledge DistillationTransfer Learning	—Unverified	0
Rethinking Position Bias Modeling with Knowledge Distillation for CTR Prediction	Apr 1, 2022	Click-Through Rate PredictionKnowledge Distillation	—Unverified	0
Preventing Distillation-based Attacks on Neural Network IP	Apr 1, 2022	Knowledge Distillation	—Unverified	0
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings	Apr 1, 2022	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Conditional Autoregressors are Interpretable Classifiers	Mar 31, 2022	Classificationimage-classification	—Unverified	0
A Closer Look at Rehearsal-Free Continual Learning	Mar 31, 2022	Continual LearningKnowledge Distillation	—Unverified	0
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher	Mar 31, 2022	AllData Free Quantization	CodeCode Available	1
Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification	Mar 31, 2022	Knowledge DistillationSpeaker Verification	—Unverified	0
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting	Mar 30, 2022	Data AugmentationDiversity	CodeCode Available	1
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models	Mar 30, 2022	Knowledge Distillation	—Unverified	0
Monitored Distillation for Positive Congruent Depth Completion	Mar 30, 2022	Depth CompletionImage Reconstruction	CodeCode Available	1
Self-Distillation from the Last Mini-Batch for Consistency Regularization	Mar 30, 2022	Knowledge Distillation	CodeCode Available	1
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation	Mar 29, 2022	CPUDecoder	CodeCode Available	2
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection	Mar 29, 2022	Domain AdaptationKnowledge Distillation	CodeCode Available	1
Knowledge Distillation: Bad Models Can Be Good Role Models	Mar 28, 2022	Knowledge DistillationLearning Theory	—Unverified	0
RAVIR: A Dataset and Methodology for the Semantic Segmentation and Quantitative Analysis of Retinal Arteries and Veins in Infrared Reflectance Imaging	Mar 28, 2022	Domain AdaptationKnowledge Distillation	—Unverified	0
Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches	Mar 28, 2022	class-incremental learningClass Incremental Learning	—Unverified	0
Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation	Mar 26, 2022	Contrastive Learningimage-classification	CodeCode Available	1
Knowledge Distillation with the Reused Teacher Classifier	Mar 26, 2022	Knowledge Distillation	CodeCode Available	1
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks	Mar 25, 2022	Incremental LearningKnowledge Distillation	CodeCode Available	1
PCA-Based Knowledge Distillation Towards Lightweight and Content-Style Balanced Photorealistic Style Transfer Models	Mar 25, 2022	Knowledge DistillationStyle Transfer	CodeCode Available	1
A Cross-Domain Approach for Continuous Impression Recognition from Dyadic Audio-Visual-Physio Signals	Mar 25, 2022	Knowledge DistillationSpoken Dialogue Systems	—Unverified	0
Class-Incremental Learning for Action Recognition in Videos	Mar 25, 2022	Action RecognitionAction Recognition In Videos	—Unverified	0
Rich Feature Construction for the Optimization-Generalization Dilemma	Mar 24, 2022	Inductive BiasKnowledge Distillation	CodeCode Available	1
Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction	Mar 24, 2022	Grammatical Error CorrectionKnowledge Distillation	CodeCode Available	1
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning	Mar 24, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	1
Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator	Mar 24, 2022	Emotion RecognitionKnowledge Distillation	—Unverified	0
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal	Mar 23, 2022	counterfactualFairness	—Unverified	0
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis	Mar 23, 2022	Expressive Speech SynthesisKnowledge Distillation	—Unverified	0
Scale-Equivalent Distillation for Semi-Supervised Object Detection	Mar 23, 2022	Knowledge DistillationObject	—Unverified	0
On Neural Network Equivalence Checking using SMT Solvers	Mar 22, 2022	Knowledge Distillation	—Unverified	0
Channel Self-Supervision for Online Knowledge Distillation	Mar 22, 2022	DiversityKnowledge Distillation	—Unverified	0
SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images	Mar 22, 2022	Knowledge DistillationLesion Classification	CodeCode Available	1
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization	Mar 21, 2022	Knowledge DistillationModel Compression	CodeCode Available	1
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation	Mar 21, 2022	Document-level Relation ExtractionKnowledge Distillation	CodeCode Available	1
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation	Mar 20, 2022	Knowledge DistillationLanguage Modelling	CodeCode Available	1
Emulating Quantum Dynamics with Neural Networks via Knowledge Distillation	Mar 19, 2022	Knowledge Distillation	CodeCode Available	0
A Closer Look at Knowledge Distillation with Features, Logits, and Gradients	Mar 18, 2022	Incremental LearningKnowledge Distillation	—Unverified	0
Delta Distillation for Efficient Video Processing	Mar 17, 2022	Knowledge Distillationobject-detection	CodeCode Available	0
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation	Mar 17, 2022	Data AugmentationHellaSwag	CodeCode Available	1
Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning	Mar 17, 2022	Data-free Knowledge DistillationFederated Learning	CodeCode Available	1
Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation	Mar 16, 2022	Data AugmentationKnowledge Distillation	—Unverified	0
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild	Mar 16, 2022	Domain AdaptationKnowledge Distillation	—Unverified	0
Decoupled Knowledge Distillation	Mar 16, 2022	image-classificationImage Classification	CodeCode Available	2

Show:10 25 50

← PrevPage 57 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified