Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 4240 papers

Title	Date	Tasks	Status	Hype
Generative Adversarial Super-Resolution at the Edge with Knowledge Distillation	Sep 7, 2022	CPUGenerative Adversarial Network	CodeCode Available	1
Domain Generalization for Prostate Segmentation in Transrectal Ultrasound Images: A Multi-center Study	Sep 5, 2022	Domain AdaptationDomain Generalization	CodeCode Available	1
A New Knowledge Distillation Network for Incremental Few-Shot Surface Defect Detection	Sep 1, 2022	Defect DetectionKnowledge Distillation	CodeCode Available	1
Membership Inference Attacks by Exploiting Loss Trajectory	Aug 31, 2022	Knowledge Distillation	CodeCode Available	1
Progressive Self-Distillation for Ground-to-Aerial Perception Knowledge Transfer	Aug 29, 2022	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective	Aug 26, 2022	Few-Shot LearningFew-Shot Object Detection	CodeCode Available	1
CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation	Aug 26, 2022	3D Action RecognitionAction Recognition	CodeCode Available	1
Masked Autoencoders Enable Efficient Knowledge Distillers	Aug 25, 2022	Knowledge Distillation	CodeCode Available	1
Semi-supervised Semantic Segmentation with Mutual Knowledge Distillation	Aug 24, 2022	DiversityKnowledge Distillation	CodeCode Available	1
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation	Aug 22, 2022	General KnowledgeKnowledge Distillation	CodeCode Available	1
LTE4G: Long-Tail Experts for Graph Neural Networks	Aug 22, 2022	Knowledge DistillationNode Classification	CodeCode Available	1
Multi-Granularity Distillation Scheme Towards Lightweight Semi-Supervised Semantic Segmentation	Aug 22, 2022	Knowledge DistillationSemantic Segmentation	CodeCode Available	1
A semi-supervised Teacher-Student framework for surgical tool detection and localization	Aug 21, 2022	Knowledge DistillationPseudo Label	CodeCode Available	1
Mind the Gap in Distilling StyleGANs	Aug 18, 2022	Knowledge Distillation	CodeCode Available	1
PA-Seg: Learning from Point Annotations for 3D Medical Image Segmentation using Contextual Regularization and Cross Knowledge Distillation	Aug 11, 2022	Brain Tumor SegmentationImage Segmentation	CodeCode Available	1
MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition	Aug 11, 2022	Data Augmentationimage-classification	CodeCode Available	1
Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction	Aug 4, 2022	Knowledge DistillationPrediction	CodeCode Available	1
KD-SCFNet: Towards More Accurate and Efficient Salient Object Detection via Knowledge Distillation	Aug 3, 2022	Knowledge Distillationobject-detection	CodeCode Available	1
Generative Bias for Robust Visual Question Answering	Aug 1, 2022	Knowledge DistillationQuestion Answering	CodeCode Available	1
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval	Jul 31, 2022	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Chinese grammatical error correction based on knowledge distillation	Jul 31, 2022	Grammatical Error CorrectionKnowledge Distillation	CodeCode Available	1
Meta-Learning based Degradation Representation for Blind Super-Resolution	Jul 28, 2022	Blind Super-ResolutionKnowledge Distillation	CodeCode Available	1
Black-box Few-shot Knowledge Distillation	Jul 25, 2022	image-classificationImage Classification	CodeCode Available	1
Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition	Jul 23, 2022	Contrastive Learningimage-classification	CodeCode Available	1
Hyper-Representations for Pre-Training and Transfer Learning	Jul 22, 2022	Knowledge DistillationNeural Architecture Search	CodeCode Available	1
KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo	Jul 21, 2022	Knowledge DistillationSelf-Supervised Learning	CodeCode Available	1
Informative knowledge distillation for image anomaly segmentation	Jul 19, 2022	Anomaly DetectionAnomaly Segmentation	CodeCode Available	1
FedX: Unsupervised Federated Learning with Cross Knowledge Distillation	Jul 19, 2022	Contrastive LearningFederated Learning	CodeCode Available	1
Class-incremental Novel Class Discovery	Jul 18, 2022	Incremental LearningKnowledge Distillation	CodeCode Available	1
Rethinking Data Augmentation for Robust Visual Question Answering	Jul 18, 2022	Data AugmentationKnowledge Distillation	CodeCode Available	1
Multi-Level Branched Regularization for Federated Learning	Jul 14, 2022	Federated LearningKnowledge Distillation	CodeCode Available	1
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources	Jul 14, 2022	Knowledge Distillation	CodeCode Available	1
Re2G: Retrieve, Rerank, Generate	Jul 13, 2022	Fact CheckingFact Verification	CodeCode Available	1
Contrastive Deep Supervision	Jul 12, 2022	Contrastive LearningFine-Grained Image Classification	CodeCode Available	1
Knowledge Condensation Distillation	Jul 12, 2022	Knowledge Distillation	CodeCode Available	1
HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors	Jul 12, 2022	Knowledge DistillationObject	CodeCode Available	1
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis	Jul 11, 2022	GPUKnowledge Distillation	CodeCode Available	1
FairDistillation: Mitigating Stereotyping in Language Models	Jul 10, 2022	Knowledge Distillation	CodeCode Available	1
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer	Jul 5, 2022	Image-text matchingKnowledge Distillation	CodeCode Available	1
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning	Jul 1, 2022	Knowledge DistillationPhoneme Recognition	CodeCode Available	1
Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing?	Jun 29, 2022	image-classificationImage Classification	CodeCode Available	1
The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation	Jun 13, 2022	Knowledge DistillationTransfer Learning	CodeCode Available	1
itKD: Interchange Transfer-based Knowledge Distillation for 3D Object Detection	May 31, 2022	3D Object DetectionCloud Detection	CodeCode Available	1
Towards Efficient 3D Object Detection with Knowledge Distillation	May 30, 2022	3D Object DetectionKnowledge Distillation	CodeCode Available	1
RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch	May 30, 2022	Continuous ControlDeep Reinforcement Learning	CodeCode Available	1
Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors	May 28, 2022	Domain AdaptationKnowledge Distillation	CodeCode Available	1
Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation	May 27, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	1
Continual evaluation for lifelong learning: Identifying the stability gap	May 26, 2022	Continual LearningIncremental Learning	CodeCode Available	1
Compressing Deep Graph Neural Networks via Adversarial Knowledge Distillation	May 24, 2022	Graph ClassificationKnowledge Distillation	CodeCode Available	1
Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis	May 24, 2022	BenchmarkingFederated Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 12 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified