Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2351–2400 of 4240 papers

Title	Date	Tasks	Status	Hype
Understanding and Improving Knowledge Distillation for Quantization-Aware Training of Large Transformer Encoders	Nov 20, 2022	Knowledge DistillationModel Compression	CodeCode Available	0
EEG aided boosting of single-lead ECG based sleep staging with Deep Knowledge Distillation	Nov 18, 2022	ECG based Sleep StagingEEG	CodeCode Available	1
DASECount: Domain-Agnostic Sample-Efficient Wireless Indoor Crowd Counting via Few-shot Learning	Nov 18, 2022	Crowd CountingFew-Shot Learning	—Unverified	0
Is Smaller Always Faster? Tradeoffs in Compressing Self-Supervised Speech Transformers	Nov 17, 2022	Knowledge DistillationModel Compression	CodeCode Available	0
Knowledge distillation for fast and accurate DNA sequence correction	Nov 17, 2022	Knowledge Distillation	—Unverified	0
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families	Nov 17, 2022	Knowledge Distillationobject-detection	—Unverified	0
ConNER: Consistency Training for Cross-lingual Named Entity Recognition	Nov 17, 2022	Cross-Lingual NERKnowledge Distillation	CodeCode Available	1
Sub-Graph Learning for Spatiotemporal Forecasting via Knowledge Distillation	Nov 17, 2022	DiversityGraph Learning	—Unverified	0
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection	Nov 17, 2022	3D Object DetectionDepth Estimation	CodeCode Available	1
D^3ETR: Decoder Distillation for Detection Transformer	Nov 17, 2022	DecoderKnowledge Distillation	—Unverified	0
Yield Evaluation of Citrus Fruits based on the YoloV5 compressed by Knowledge Distillation	Nov 16, 2022	Knowledge Distillation	—Unverified	0
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling	Nov 15, 2022	General KnowledgeKnowledge Distillation	CodeCode Available	0
An Efficient Active Learning Pipeline for Legal Text Classification	Nov 15, 2022	Active LearningClassification	—Unverified	0
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding	Nov 15, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	0
Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation	Nov 15, 2022	Domain AdaptationKnowledge Distillation	—Unverified	0
FedCL: Federated Multi-Phase Curriculum Learning to Synchronously Correlate User Heterogeneity	Nov 14, 2022	Federated LearningKnowledge Distillation	CodeCode Available	1
Feature Correlation-guided Knowledge Transfer for Federated Self-supervised Learning	Nov 14, 2022	Feature CorrelationFederated Learning	—Unverified	0
An Interpretable Neuron Embedding for Static Knowledge Distillation	Nov 14, 2022	Knowledge Distillation	—Unverified	0
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection	Nov 14, 2022	Knowledge Distillation	—Unverified	0
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection	Nov 14, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer	Nov 14, 2022	Image ClassificationKnowledge Distillation	CodeCode Available	1
Long-Range Zero-Shot Generative Deep Network Quantization	Nov 13, 2022	Knowledge DistillationQuantization	—Unverified	0
MDFlow: Unsupervised Optical Flow Learning by Reliable Mutual Knowledge Distillation	Nov 11, 2022	BlockingData Augmentation	CodeCode Available	1
Knowledge Distillation from Cross Teaching Teachers for Efficient Semi-Supervised Abdominal Organ Segmentation in CT	Nov 11, 2022	Image SegmentationKnowledge Distillation	CodeCode Available	0
FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection	Nov 11, 2022	Action Unit DetectionFace Alignment	—Unverified	0
PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation	Nov 11, 2022	Knowledge Distillation	—Unverified	0
Efficient Large-scale Audio Tagging via Transformer-to-CNN Knowledge Distillation	Nov 9, 2022	Audio ClassificationAudio Tagging	CodeCode Available	2
Knowledge Distillation for Federated Learning: a Practical Guide	Nov 9, 2022	Federated LearningKnowledge Distillation	—Unverified	0
Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study	Nov 8, 2022	AttributeData Augmentation	CodeCode Available	0
Bridging Fairness and Environmental Sustainability in Natural Language Processing	Nov 8, 2022	Dimensionality ReductionFairness	—Unverified	0
CoNMix for Source-free Single and Multi-target Domain Adaptation	Nov 7, 2022	Domain AdaptationKnowledge Distillation	CodeCode Available	1
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time	Nov 7, 2022	Knowledge DistillationMulti-Person Pose Estimation	CodeCode Available	5
Closing the Gap between Client and Global Model Performance in Heterogeneous Federated Learning	Nov 7, 2022	Federated LearningKnowledge Distillation	—Unverified	0
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization	Nov 7, 2022	Knowledge Distillation	—Unverified	0
Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation	Nov 5, 2022	Knowledge DistillationSpeech Enhancement	—Unverified	0
SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection	Nov 4, 2022	Domain AdaptationKnowledge Distillation	CodeCode Available	2
LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation	Nov 2, 2022	DecoderKnowledge Distillation	—Unverified	0
MPCFormer: fast, performant and private Transformer inference with MPC	Nov 2, 2022	Knowledge Distillation	CodeCode Available	1
Gradient Knowledge Distillation for Pre-trained Language Models	Nov 2, 2022	Knowledge Distillation	CodeCode Available	0
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model	Nov 2, 2022	Knowledge DistillationLanguage Modeling	—Unverified	0
Fairness without Demographics through Knowledge Distillation	Nov 1, 2022	FairnessKnowledge Distillation	CodeCode Available	0
Lightweight Sound Event Detection Model with RepVGG Architecture	Nov 1, 2022	Event DetectionKnowledge Distillation	—Unverified	0
Enhancing Chinese Multi-Label Text Classification Performance with Response-based Knowledge Distillation	Nov 1, 2022	Knowledge DistillationMulti Label Text Classification	—Unverified	0
Maximum Likelihood Distillation for Robust Modulation Classification	Nov 1, 2022	ClassificationKnowledge Distillation	—Unverified	0
ARDIR: Improving Robustness using Knowledge Distillation of Internal Representation	Nov 1, 2022	Knowledge Distillation	—Unverified	0
Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation	Oct 31, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Lightweight Neural Network with Knowledge Distillation for CSI Feedback	Oct 31, 2022	Knowledge Distillation	—Unverified	0
QuaLA-MiniLM: a Quantized Length Adaptive MiniLM	Oct 31, 2022	Computational EfficiencyKnowledge Distillation	—Unverified	0
Generative Negative Text Replay for Continual Vision-Language Pretraining	Oct 31, 2022	Continual Learningimage-classification	—Unverified	0
Application of Knowledge Distillation to Multi-task Speech Representation Learning	Oct 29, 2022	Keyword SpottingKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 48 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified