Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3201–3250 of 4240 papers

Title	Date	Tasks	Status
Domain Adaptation for Dense Retrieval through Self-Supervision by Pseudo-Relevance Labeling	Dec 13, 2022	Domain AdaptationInformation Retrieval	—Unverified
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild	Mar 16, 2022	Domain AdaptationKnowledge Distillation	—Unverified
Domain-Agnostic Clustering with Self-Distillation	Nov 23, 2021	ClusteringData Augmentation	—Unverified
Domain Discrepancy Aware Distillation for Model Aggregation in Federated Learning	Oct 4, 2022	Federated LearningKnowledge Distillation	—Unverified
Domain Generalization on Efficient Acoustic Scene Classification using Residual Normalization	Nov 12, 2021	Acoustic Scene ClassificationClassification	—Unverified
Domain-invariant Feature Exploration for Domain Generalization	Jul 25, 2022	DiversityDomain Generalization	—Unverified
Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection	Aug 21, 2024	Knowledge DistillationObject	—Unverified
Domain Knowledge Distillation from Large Language Model: An Empirical Study in the Autonomous Driving Domain	Jul 17, 2023	Autonomous DrivingKnowledge Distillation	—Unverified
Domain-specific knowledge distillation yields smaller and better models for conversational commerce	May 1, 2022	Knowledge DistillationLanguage Modeling	—Unverified
Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis	Dec 8, 2024	DecoderKnowledge Distillation	—Unverified
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks	Sep 26, 2023	DenoisingImage Denoising	—Unverified
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation	May 8, 2023	Knowledge Distillation	—Unverified
Do Not Forget to Attend to Uncertainty while Mitigating Catastrophic Forgetting	Feb 3, 2021	Deep LearningIncremental Learning	—Unverified
Don't be picky, all students in the right family can learn from good teachers	Jan 1, 2021	AllBayesian Optimization	—Unverified
Don't Throw Away Data: Better Sequence Knowledge Distillation	Jul 15, 2024	DiversityKnowledge Distillation	—Unverified
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency	Nov 9, 2023	document understandingKey Information Extraction	—Unverified
Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches	Mar 28, 2022	class-incremental learningClass Incremental Learning	—Unverified
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification	Nov 26, 2023	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
Double Similarity Distillation for Semantic Image Segmentation	Jul 19, 2021	Image SegmentationKnowledge Distillation	—Unverified
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model	Apr 6, 2024	Knowledge Distillation	—Unverified
DreamTeacher: Pretraining Image Backbones with Deep Generative Models	Jul 14, 2023	Knowledge DistillationRepresentation Learning	—Unverified
DRKF: Distilled Rotated Kernel Fusion for Efficient Rotation Invariant Descriptors in Local Feature Matching	Sep 22, 2022	Knowledge Distillation	—Unverified
DS3-Net: Difficulty-perceived Common-to-T1ce Semi-Supervised Multimodal MRI Synthesis Network	Mar 14, 2022	Knowledge DistillationSSIM	—Unverified
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization	Dec 20, 2023	Knowledge DistillationNatural Language Understanding	—Unverified
DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning	Jul 13, 2022	Knowledge DistillationLinear evaluation	—Unverified
DST: Dynamic Substitute Training for Data-free Black-box Attack	Apr 3, 2022	Knowledge Distillation	—Unverified
DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis	Sep 11, 2024	ClassificationKnowledge Distillation	—Unverified
DTCM: Deep Transformer Capsule Mutual Distillation for Multivariate Time Series Classification	Feb 26, 2024	Knowledge DistillationRelation Network	—Unverified
Dual Discriminator Adversarial Distillation for Data-free Model Compression	Apr 12, 2021	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Dual Embodied-Symbolic Concept Representations for Deep Learning	Mar 1, 2022	class-incremental learningClass Incremental Learning	—Unverified
Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head	Nov 13, 2024	AttributeKnowledge Distillation	—Unverified
Dual Knowledge Distillation for Efficient Sound Event Detection	Feb 5, 2024	Event DetectionKnowledge Distillation	—Unverified
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection	Aug 7, 2024	Anomaly DetectionAnomaly Localization	—Unverified
Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection	Jan 13, 2025	Knowledge Distillationobject-detection	—Unverified
Dual-Student Knowledge Distillation Networks for Unsupervised Anomaly Detection	Feb 1, 2024	Anomaly DetectionAnomaly Segmentation	—Unverified
Dual-Teacher Class-Incremental Learning With Data-Free Generative Replay	Jun 17, 2021	class-incremental learningClass Incremental Learning	—Unverified
Dual-Teacher: Integrating Intra-domain and Inter-domain Teachers for Annotation-efficient Cardiac Segmentation	Jul 13, 2020	Cardiac SegmentationDomain Adaptation	—Unverified
Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing	Jan 2, 2024	Adversarial AttackFace Anti-Spoofing	—Unverified
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion	Sep 27, 2023	DecoderKnowledge Distillation	—Unverified
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding	May 21, 2023	Data AugmentationDecoder	—Unverified
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset	Mar 27, 2025	Knowledge DistillationObject Recognition	—Unverified
DVFL: A Vertical Federated Learning Method for Dynamic Data	Nov 5, 2021	Federated LearningKnowledge Distillation	—Unverified
DyLiN: Making Light Field Networks Dynamic	Mar 24, 2023	AttributeKnowledge Distillation	—Unverified
Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles	Feb 19, 2025	DisentanglementEnsemble Learning	—Unverified
Dynamically pruning segformer for efficient semantic segmentation	Nov 18, 2021	Knowledge DistillationSegmentation	—Unverified
DynamicKD: An Effective Knowledge Distillation via Dynamic Entropy Correction-Based Distillation for Gap Optimizing	May 9, 2023	Knowledge Distillation	—Unverified
Dynamic Knowledge Distillation for Black-box Hypothesis Transfer Learning	Jul 24, 2020	Knowledge DistillationTransfer Learning	—Unverified
Dynamic Knowledge Distillation With Noise Elimination for RGB-D Salient Object Detection	Jun 17, 2021	Knowledge Distillationobject-detection	—Unverified
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting	Jul 14, 2022	global-optimizationKnowledge Distillation	—Unverified
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization	Sep 1, 2022	Bayesian InferenceKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 65 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified