Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 4240 papers

Title	Date	Tasks	Status	Hype
FedACK: Federated Adversarial Contrastive Knowledge Distillation for Cross-Lingual and Cross-Model Social Bot Detection	Mar 10, 2023	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Enhancing Low-resolution Face Recognition with Feature Similarity Knowledge Distillation	Mar 8, 2023	Face RecognitionKnowledge Distillation	CodeCode Available	1
DistilPose: Tokenized Pose Regression with Heatmap Distillation	Mar 4, 2023	Knowledge DistillationPose Estimation	CodeCode Available	1
Distillation from Heterogeneous Models for Top-K Recommendation	Mar 2, 2023	Knowledge DistillationRecommendation Systems	CodeCode Available	1
Towards Activated Muscle Group Estimation in the Wild	Mar 2, 2023	Activity RecognitionHuman Activity Recognition	CodeCode Available	1
Generic-to-Specific Distillation of Masked Autoencoders	Feb 28, 2023	Decoderimage-classification	CodeCode Available	1
Graph-based Knowledge Distillation: A survey and experimental evaluation	Feb 27, 2023	Knowledge DistillationSelf-Knowledge Distillation	CodeCode Available	1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNet	Feb 23, 2023	BenchmarkingKnowledge Distillation	CodeCode Available	1
A Neural Span-Based Continual Named Entity Recognition Model	Feb 23, 2023	Continual LearningContinual Named Entity Recognition	CodeCode Available	1
CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images	Feb 22, 2023	Knowledge DistillationScene Understanding	CodeCode Available	1
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge Computing	Feb 21, 2023	Data CompressionEdge-computing	CodeCode Available	1
Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks	Feb 14, 2023	Ensemble LearningKnowledge Distillation	CodeCode Available	1
Exploring Navigation Maps for Learning-Based Motion Prediction	Feb 13, 2023	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees	Feb 13, 2023	Federated LearningGeneralization Bounds	CodeCode Available	1
Dual Relation Knowledge Distillation for Object Detection	Feb 11, 2023	Knowledge DistillationModel Compression	CodeCode Available	1
Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels	Feb 11, 2023	Knowledge DistillationSemantic Segmentation	CodeCode Available	1
CEN-HDR: Computationally Efficient neural Network for real-time High Dynamic Range imaging	Feb 10, 2023	Efficient Neural NetworkKnowledge Distillation	CodeCode Available	1
Lightweight Transformers for Clinical Natural Language Processing	Feb 9, 2023	Continual LearningKnowledge Distillation	CodeCode Available	1
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation	Jan 30, 2023	Automatic Speech RecognitionKnowledge Distillation	CodeCode Available	1
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1
Online Hyperparameter Optimization for Class-Incremental Learning	Jan 11, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	1
TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation	Jan 11, 2023	Knowledge DistillationPrediction	CodeCode Available	1
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation	Jan 3, 2023	BenchmarkingFew-shot Instance Segmentation	CodeCode Available	1
Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection	Jan 1, 2023	Anomaly DetectionKnowledge Distillation	CodeCode Available	1
Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval	Jan 1, 2023	Knowledge DistillationLanguage Modelling	CodeCode Available	1
Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation	Jan 1, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	1
Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint	Jan 1, 2023	Data AugmentationData-free Knowledge Distillation	CodeCode Available	1
Revisiting Prototypical Network for Cross Domain Few-Shot Learning	Jan 1, 2023	Cross-Domain Few-Shotcross-domain few-shot learning	CodeCode Available	1
Multi-Level Logit Distillation	Jan 1, 2023	Knowledge DistillationPrediction	CodeCode Available	1
Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds	Jan 1, 2023	Continual Semantic SegmentationKnowledge Distillation	CodeCode Available	1
Data-Free Class-Incremental Hand Gesture Recognition	Jan 1, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	1
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection	Jan 1, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Discriminator-Cooperated Feature Map Distillation for GAN Compression	Dec 29, 2022	Image GenerationKnowledge Distillation	CodeCode Available	1
Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning	Dec 29, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	1
NeRN -- Learning Neural Representations for Neural Networks	Dec 27, 2022	Knowledge Distillation	CodeCode Available	1
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection	Dec 19, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?	Dec 16, 2022	3D Point Cloud ClassificationFew-Shot 3D Point Cloud Classification	CodeCode Available	1
Gradient-based Intra-attention Pruning on Pre-trained Language Models	Dec 15, 2022	Knowledge Distillation	CodeCode Available	1
Towards Practical Plug-and-Play Diffusion Models	Dec 12, 2022	Depth EstimationImage Generation	CodeCode Available	1
Enhancing Low-Density EEG-Based Brain-Computer Interfaces with Similarity-Keeping Knowledge Distillation	Dec 6, 2022	EEGEeg Decoding	CodeCode Available	1
FedUKD: Federated UNet Model with Knowledge Distillation for Land Use Classification from Satellite and Street Views	Dec 5, 2022	Knowledge DistillationModel Compression	CodeCode Available	1
Improving Simultaneous Machine Translation with Monolingual Data	Dec 2, 2022	HallucinationKnowledge Distillation	CodeCode Available	1
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection	Dec 1, 2022	3D Object DetectionAutonomous Driving	CodeCode Available	1
Knowledge Distillation based Degradation Estimation for Blind Super-Resolution	Nov 30, 2022	Blind Super-ResolutionImage Super-Resolution	CodeCode Available	1
Curriculum Temperature for Knowledge Distillation	Nov 29, 2022	Image ClassificationKnowledge Distillation	CodeCode Available	1
Dense Interspecies Face Embedding	Nov 28, 2022	Image ManipulationInterspecies Facial Keypoint Transfer	CodeCode Available	1
Unbiased Knowledge Distillation for Recommendation	Nov 27, 2022	Knowledge DistillationModel Compression	CodeCode Available	1
Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding	Nov 25, 2022	3D visual groundingKnowledge Distillation	CodeCode Available	1
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning	Nov 25, 2022	Action ClassificationClassification	CodeCode Available	1
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention	Nov 25, 2022	Knowledge DistillationNeural Architecture Search	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified