Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 4240 papers

Title	Date	Tasks	Status	Hype
Decoupled Kullback-Leibler Divergence Loss	May 23, 2023	Adversarial DefenseAdversarial Robustness	CodeCode Available	1
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?	May 22, 2023	Data-free Knowledge DistillationFew-Shot Learning	CodeCode Available	1
DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining	May 20, 2023	Extractive SummarizationKnowledge Distillation	CodeCode Available	1
Lifting the Curse of Capacity Gap in Distilling Language Models	May 20, 2023	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Cross-modality Data Augmentation for End-to-End Sign Language Translation	May 18, 2023	Data AugmentationKnowledge Distillation	CodeCode Available	1
AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression	May 17, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation	May 16, 2023	Knowledge Distillationtext-classification	CodeCode Available	1
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models	May 15, 2023	3D Object DetectionImage Captioning	CodeCode Available	1
Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction	May 11, 2023	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Improving Continual Relation Extraction by Distinguishing Analogous Semantics	May 11, 2023	Continual Relation ExtractionKnowledge Distillation	CodeCode Available	1
FedNoRo: Towards Noise-Robust Federated Learning by Addressing Class Imbalance and Label Noise Heterogeneity	May 9, 2023	Federated LearningKnowledge Distillation	CodeCode Available	1
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models	May 9, 2023	Image GenerationKnowledge Distillation	CodeCode Available	1
Distilling Script Knowledge from Large Language Models for Constrained Language Planning	May 9, 2023	Knowledge Distillation	CodeCode Available	1
Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty	May 4, 2023	Knowledge Distillationobject-detection	CodeCode Available	1
SCOTT: Self-Consistent Chain-of-Thought Distillation	May 3, 2023	counterfactualCounterfactual Reasoning	CodeCode Available	1
DeepAqua: Self-Supervised Semantic Segmentation of Wetland Surface Water Extent with SAR Images using Knowledge Distillation	May 2, 2023	Knowledge DistillationSemantic Segmentation	CodeCode Available	1
A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering	Apr 26, 2023	DecoderKnowledge Distillation	CodeCode Available	1
Class Attention Transfer Based Knowledge Distillation	Apr 25, 2023	Knowledge DistillationModel Compression	CodeCode Available	1
Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation	Apr 22, 2023	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs	Apr 20, 2023	Knowledge DistillationNode Classification	CodeCode Available	1
Attention Weighted Local Descriptors	Apr 19, 2023	3D ReconstructionHomography Estimation	CodeCode Available	1
OVTrack: Open-Vocabulary Multiple Object Tracking	Apr 17, 2023	DenoisingHallucination	CodeCode Available	1
Robust Cross-Modal Knowledge Distillation for Unconstrained Videos	Apr 16, 2023	Action RecognitionAudio Tagging	CodeCode Available	1
Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning	Apr 13, 2023	Knowledge DistillationRepresentation Learning	CodeCode Available	1
Continual Learning for LiDAR Semantic Segmentation: Class-Incremental and Coarse-to-Fine strategies on Sparse Data	Apr 8, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	1
DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation	Apr 5, 2023	Data AugmentationKnowledge Distillation	CodeCode Available	1
Selective Knowledge Sharing for Privacy-Preserving Federated Distillation without A Good Teacher	Apr 4, 2023	Federated LearningKnowledge Distillation	CodeCode Available	1
Knowledge Distillation for Feature Extraction in Underwater VSLAM	Mar 31, 2023	BinarizationKnowledge Distillation	CodeCode Available	1
Kaizen: Practical Self-supervised Continual Learning with Continual Fine-tuning	Mar 30, 2023	Continual LearningKnowledge Distillation	CodeCode Available	1
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection	Mar 29, 2023	3D geometry3D Object Detection	CodeCode Available	1
DisWOT: Student Architecture Search for Distillation WithOut Training	Mar 28, 2023	Knowledge Distillation	CodeCode Available	1
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models	Mar 28, 2023	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Dice Semimetric Losses: Optimizing the Dice Score with Soft Labels	Mar 28, 2023	Knowledge Distillation	CodeCode Available	1
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View	Mar 27, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	1
Preserving Linear Separability in Continual Learning by Backward Feature Projection	Mar 26, 2023	Continual LearningKnowledge Distillation	CodeCode Available	1
Supervised Masked Knowledge Distillation for Few-Shot Transformers	Mar 25, 2023	Few-Shot LearningInductive Bias	CodeCode Available	1
CCL: Continual Contrastive Learning for LiDAR Place Recognition	Mar 24, 2023	Autonomous DrivingContinual Learning	CodeCode Available	1
Decoupled Multimodal Distilling for Emotion Recognition	Mar 24, 2023	Emotion RecognitionKnowledge Distillation	CodeCode Available	1
Understanding the Role of the Projector in Knowledge Distillation	Mar 20, 2023	image-classificationImage Classification	CodeCode Available	1
AdaptGuard: Defending Against Universal Attacks for Model Adaptation	Mar 19, 2023	Knowledge Distillationmodel	CodeCode Available	1
Channel-Aware Distillation Transformer for Depth Estimation on Nano Drones	Mar 18, 2023	Autonomous NavigationDepth Estimation	CodeCode Available	1
Prototype Knowledge Distillation for Medical Segmentation with Missing Modality	Mar 17, 2023	Image SegmentationKnowledge Distillation	CodeCode Available	1
TeSLA: Test-Time Self-Learning With Automatic Adversarial Augmentation	Mar 17, 2023	Knowledge DistillationSelf-Learning	CodeCode Available	1
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation	Mar 16, 2023	Knowledge DistillationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Action knowledge for video captioning with graph neural networks	Mar 16, 2023	Action RecognitionGraph Neural Network	CodeCode Available	1
DualFair: Fair Representation Learning at Both Group and Individual Levels via Contrastive Self-supervision	Mar 15, 2023	counterfactualFairness	CodeCode Available	1
Graph-less Collaborative Filtering	Mar 15, 2023	Collaborative FilteringContrastive Learning	CodeCode Available	1
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement	Mar 15, 2023	Data AugmentationKnowledge Distillation	CodeCode Available	1
SCPNet: Semantic Scene Completion on Point Cloud	Mar 13, 2023	3D Semantic Scene CompletionKnowledge Distillation	CodeCode Available	1
Extending global-local view alignment for self-supervised learning with remote sensing imagery	Mar 12, 2023	Change DetectionContrastive Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified