Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–800 of 4240 papers

Title	Date	Tasks	Status	Hype	Score
EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation	Nov 24, 2021	Event-based Object SegmentationKnowledge Distillation	CodeCode Available	1	5
Improving Knowledge Distillation via Category Structure	Aug 1, 2020	Knowledge Distillation	CodeCode Available	1	5
Initialization and Regularization of Factorized Neural Layers	May 3, 2021	Knowledge DistillationModel Compression	CodeCode Available	1	5
CTC-based Non-autoregressive Textless Speech-to-Speech Translation	Jun 11, 2024	Knowledge DistillationMachine Translation	CodeCode Available	1	5
AIM 2024 Challenge on UHD Blind Photo Quality Assessment	Sep 24, 2024	4kComputational Efficiency	CodeCode Available	1	5
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention	Nov 18, 2023	Concept AlignmentGraph Generation	CodeCode Available	1	5
Evolving Search Space for Neural Architecture Search	Nov 22, 2020	Knowledge DistillationNeural Architecture Search	CodeCode Available	1	5
Contrastive Deep Supervision	Jul 12, 2022	Contrastive LearningFine-Grained Image Classification	CodeCode Available	1	5
Contrastive Distillation on Intermediate Representations for Language Model Compression	Sep 29, 2020	Knowledge DistillationLanguage Modeling	CodeCode Available	1	5
Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning	Mar 1, 2021	Few-Shot Image ClassificationFew-Shot Learning	CodeCode Available	1	5
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?	Dec 16, 2022	3D Point Cloud ClassificationFew-Shot 3D Point Cloud Classification	CodeCode Available	1	5
Contrastive Model Inversion for Data-Free Knowledge Distillation	May 18, 2021	Contrastive LearningData-free Knowledge Distillation	CodeCode Available	1	5
Contrastive Representation Distillation	Oct 23, 2019	Contrastive LearningKnowledge Distillation	CodeCode Available	1	5
AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks	Jun 15, 2020	AutoMLKnowledge Distillation	CodeCode Available	1	5
Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation	Feb 8, 2022	DiversityKnowledge Distillation	CodeCode Available	1	5
Exploring Inter-Channel Correlation for Diversity-Preserved Knowledge Distillation	Jan 1, 2021	DiversityKnowledge Distillation	CodeCode Available	1	5
AdaptGuard: Defending Against Universal Attacks for Model Adaptation	Mar 19, 2023	Knowledge Distillationmodel	CodeCode Available	1	5
The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image	Dec 1, 2021	Knowledge Distillation	CodeCode Available	1	5
FitNets: Hints for Thin Deep Nets	Dec 19, 2014	Knowledge Distillation	CodeCode Available	1	5
Generic-to-Specific Distillation of Masked Autoencoders	Feb 28, 2023	Decoderimage-classification	CodeCode Available	1	5
FairDistillation: Mitigating Stereotyping in Language Models	Jul 10, 2022	Knowledge Distillation	CodeCode Available	1	5
CaMEL: Mean Teacher Learning for Image Captioning	Feb 21, 2022	Image CaptioningKnowledge Distillation	CodeCode Available	1	5
Improve Object Detection with Feature-based Knowledge Distillation: Towards Accurate and Efficient Detectors	Jan 1, 2021	image-classificationImage Classification	CodeCode Available	1	5
CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection	Jan 1, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	1	5
FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction	Feb 16, 2022	Active LearningKnowledge Distillation	CodeCode Available	1	5
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells	Oct 25, 2018	Depth EstimationDepth Prediction	CodeCode Available	1	5
One Step Diffusion-based Super-Resolution with Time-Aware Distillation	Aug 14, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available	1	5
One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification	May 27, 2023	Knowledge DistillationSelf-Supervised Learning	CodeCode Available	1	5
Cumulative Spatial Knowledge Distillation for Vision Transformers	Jul 17, 2023	Inductive BiasKnowledge Distillation	CodeCode Available	1	5
FastFormers: Highly Efficient Transformer Models for Natural Language Understanding	Oct 26, 2020	CPUGPU	CodeCode Available	1	5
Faster ILOD: Incremental Learning for Object Detectors based on Faster RCNN	Mar 9, 2020	Incremental LearningKnowledge Distillation	CodeCode Available	1	5
Online Knowledge Distillation for Efficient Pose Estimation	Aug 4, 2021	Knowledge DistillationPose Estimation	CodeCode Available	1	5
Improved Techniques for Training Adaptive Deep Networks	Aug 17, 2019	Computational EfficiencyKnowledge Distillation	CodeCode Available	1	5
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis	Jul 11, 2022	GPUKnowledge Distillation	CodeCode Available	1	5
FDCNet: Feature Drift Compensation Network for Class-Incremental Weakly Supervised Object Localization	Sep 17, 2023	class-incremental learningIncremental Learning	CodeCode Available	1	5
FCS: Feature Calibration and Separation for Non-Exemplar Class Incremental Learning	Jan 1, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	1	5
Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty	May 4, 2023	Knowledge Distillationobject-detection	CodeCode Available	1	5
On Representation Knowledge Distillation for Graph Neural Networks	Nov 9, 2021	Contrastive LearningKnowledge Distillation	CodeCode Available	1	5
A Knowledge Distillation Framework For Enhancing Ear-EEG Based Sleep Staging With Scalp-EEG Data	Oct 27, 2022	Domain AdaptationEEG	CodeCode Available	1	5
Improving Continual Relation Extraction by Distinguishing Analogous Semantics	May 11, 2023	Continual Relation ExtractionKnowledge Distillation	CodeCode Available	1	5
Implicit Chain of Thought Reasoning via Knowledge Distillation	Nov 2, 2023	Knowledge DistillationMath	CodeCode Available	1	5
FedACK: Federated Adversarial Contrastive Knowledge Distillation for Cross-Lingual and Cross-Model Social Bot Detection	Mar 10, 2023	Contrastive LearningKnowledge Distillation	CodeCode Available	1	5
FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning	Aug 21, 2023	Federated LearningKnowledge Distillation	CodeCode Available	1	5
FedCL: Federated Multi-Phase Curriculum Learning to Synchronously Correlate User Heterogeneity	Nov 14, 2022	Federated LearningKnowledge Distillation	CodeCode Available	1	5
Curriculum Learning for Dense Retrieval Distillation	Apr 28, 2022	Knowledge DistillationPassage Retrieval	CodeCode Available	1	5
Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing	Apr 1, 2020	Knowledge DistillationRetrieval	CodeCode Available	1	5
Federated Knowledge Distillation	Nov 4, 2020	Federated LearningKnowledge Distillation	CodeCode Available	1	5
KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks	Oct 6, 2021	Emotion RecognitionEmotion Recognition in Conversation	CodeCode Available	1	5
FedMD: Heterogenous Federated Learning via Model Distillation	Oct 8, 2019	Federated LearningKnowledge Distillation	CodeCode Available	1	5
Improve Cross-Architecture Generalization on Dataset Distillation	Feb 20, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 16 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified