Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 4240 papers

Title	Date	Tasks	Status	Hype
UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images	Mar 4, 2024	ClassificationDenoising	—Unverified	0
PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station	Mar 4, 2024	Knowledge DistillationPose Estimation	CodeCode Available	0
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement	Mar 3, 2024	Automatic Speech RecognitionKeyword Spotting	—Unverified	0
Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation	Mar 3, 2024	Knowledge DistillationMachine Translation	CodeCode Available	0
Logit Standardization in Knowledge Distillation	Mar 3, 2024	Knowledge Distillation	CodeCode Available	3
Hyperspectral Image Analysis in Single-Modal and Multimodal setting using Deep Learning Techniques	Mar 3, 2024	Dimensionality ReductionHyperspectral image analysis	—Unverified	0
On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving	Mar 2, 2024	Autonomous DrivingKnowledge Distillation	CodeCode Available	2
Teaching MLP More Graph Information: A Three-stage Multitask Knowledge Distillation Framework	Mar 2, 2024	Knowledge Distillation	—Unverified	0
Distilling Text Style Transfer With Self-Explanation From LLMs	Mar 2, 2024	In-Context LearningKnowledge Distillation	—Unverified	0
Differentially Private Knowledge Distillation via Synthetic Text Generation	Mar 1, 2024	Knowledge DistillationModel Compression	CodeCode Available	0
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling	Mar 1, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified	0
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs	Feb 29, 2024	Dataset GenerationKnowledge Distillation	—Unverified	0
A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving	Feb 29, 2024	Autonomous DrivingDecision Making	CodeCode Available	2
Weakly Supervised Monocular 3D Detection with a Single-View Image	Feb 29, 2024	Knowledge DistillationObject Localization	—Unverified	0
MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery	Feb 28, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction	Feb 28, 2024	Image EnhancementKnowledge Distillation	—Unverified	0
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding	Feb 28, 2024	document understandingForm	CodeCode Available	0
Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection	Feb 28, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Gradient Reweighting: Towards Imbalanced Class-Incremental Learning	Feb 28, 2024	class-incremental learningClass Incremental Learning	—Unverified	0
Sinkhorn Distance Minimization for Knowledge Distillation	Feb 27, 2024	DecoderKnowledge Distillation	CodeCode Available	2
PromptMM: Multi-Modal Knowledge Distillation for Recommendation with Prompt-Tuning	Feb 27, 2024	Knowledge DistillationModel Compression	CodeCode Available	2
Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization	Feb 27, 2024	Anomaly DetectionKnowledge Distillation	—Unverified	0
SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection	Feb 27, 2024	class-incremental learningClass Incremental Learning	—Unverified	0
MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning	Feb 27, 2024	class-incremental learningClass Incremental Learning	—Unverified	0
DTCM: Deep Transformer Capsule Mutual Distillation for Multivariate Time Series Classification	Feb 26, 2024	Knowledge DistillationRelation Network	—Unverified	0
m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers	Feb 26, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	0
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning	Feb 26, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified	0
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification	Feb 26, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
LLM Inference Unveiled: Survey and Roofline Model Insights	Feb 26, 2024	Knowledge DistillationLanguage Modelling	CodeCode Available	4
Distilling Adversarial Robustness Using Heterogeneous Teachers	Feb 23, 2024	Adversarial RobustnessKnowledge Distillation	—Unverified	0
Practical Insights into Knowledge Distillation for Pre-Trained Models	Feb 22, 2024	Federated LearningKnowledge Distillation	—Unverified	0
TIE-KD: Teacher-Independent and Explainable Knowledge Distillation for Monocular Depth Estimation	Feb 22, 2024	Depth EstimationKnowledge Distillation	CodeCode Available	0
Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off	Feb 22, 2024	Adversarial DefenseKnowledge Distillation	—Unverified	0
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic	Feb 22, 2024	Formal LogicKnowledge Distillation	—Unverified	0
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models	Feb 21, 2024	image-classificationImage Classification	CodeCode Available	0
Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model	Feb 21, 2024	Knowledge Distillationmodel	—Unverified	0
In-Distribution Consistency Regularization Improves the Generalization of Quantization-Aware Training	Feb 21, 2024	Knowledge DistillationQuantization	—Unverified	0
Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions	Feb 21, 2024	In-Context LearningKnowledge Distillation	—Unverified	0
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods	Feb 20, 2024	Information RetrievalKnowledge Distillation	—Unverified	0
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning	Feb 20, 2024	Instruction FollowingKnowledge Distillation	CodeCode Available	1
FGAD: Self-boosted Knowledge Distillation for An Effective Federated Graph Anomaly Detection Framework	Feb 20, 2024	Anomaly DetectionFederated Learning	—Unverified	0
A Survey on Knowledge Distillation of Large Language Models	Feb 20, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	5
Improve Cross-Architecture Generalization on Dataset Distillation	Feb 20, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available	1
ELAD: Explanation-Guided Large Language Models Active Distillation	Feb 20, 2024	Active LearningKnowledge Distillation	—Unverified	0
Induced Model Matching: How Restricted Models Can Help Larger Ones	Feb 19, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	0
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs	Feb 19, 2024	Knowledge Distillation	CodeCode Available	4
Revisiting Knowledge Distillation for Autoregressive Language Models	Feb 19, 2024	Knowledge Distillation	CodeCode Available	0
On the Byzantine-Resilience of Distillation-Based Federated Learning	Feb 19, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation	Feb 18, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available	0
GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation	Feb 17, 2024	Knowledge Distillationobject-detection	CodeCode Available	1

Show:10 25 50

← PrevPage 26 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified