Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1801–1850 of 4240 papers

Title	Date	Tasks	Status
Intermediate Distillation: Data-Efficient Distillation from Black-Box LLMs for Information Retrieval	Jun 18, 2024	Information RetrievalKnowledge Distillation	—Unverified
STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft	Jun 17, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Mutual Learning for Finetuning Click-Through Rate Prediction Models	Jun 17, 2024	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Graph Knowledge Distillation to Mixture of Experts	Jun 17, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation	Jun 17, 2024	Knowledge DistillationNeRF	—Unverified
Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions	Jun 16, 2024	Federated LearningKnowledge Distillation	—Unverified
Self-Knowledge Distillation for Learning Ambiguity	Jun 14, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation	Jun 13, 2024	Knowledge DistillationModel Compression	—Unverified
Contextual Distillation Model for Diversified Recommendation	Jun 13, 2024	DiversityKnowledge Distillation	—Unverified
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications	Jun 12, 2024	document-image-classificationDocument Image Classification	—Unverified
Adaptive Teaching with Shared Classifier for Knowledge Distillation	Jun 12, 2024	Knowledge Distillation	CodeCode Available
Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation	Jun 12, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model	Jun 12, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning	Jun 12, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified
Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network	Jun 12, 2024	Acoustic Scene ClassificationData Augmentation	CodeCode Available
Self-Distillation Learning Based on Temporal-Spatial Consistency for Spiking Neural Networks	Jun 12, 2024	Knowledge Distillation	—Unverified
FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation	Jun 11, 2024	Audio ClassificationKnowledge Distillation	CodeCode Available
TernaryLLM: Ternarized Large Language Model	Jun 11, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection	Jun 11, 2024	Knowledge Distillationobject-detection	—Unverified
Weighted KL-Divergence for Document Ranking Model Refinement	Jun 10, 2024	Contrastive LearningDocument Ranking	—Unverified
BS-PLCNet 2: Two-stage Band-split Packet Loss Concealment Network with Intra-model Knowledge Distillation	Jun 10, 2024	Knowledge DistillationPacket Loss Concealment	—Unverified
Online Policy Distillation with Decision-Attention	Jun 8, 2024	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios	Jun 8, 2024	Knowledge Distillation	—Unverified
Data-Free Generative Replay for Class-Incremental Learning on Imbalanced Data	Jun 7, 2024	class-incremental learningClass Incremental Learning	CodeCode Available
IOR: Inversed Objects Replay for Incremental Object Detection	Jun 7, 2024	Knowledge DistillationObject	—Unverified
To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation	Jun 6, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Step Out and Seek Around: On Warm-Start Training with Incremental Data	Jun 6, 2024	Autonomous DrivingKnowledge Distillation	—Unverified
Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders	Jun 5, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Decision Boundary-aware Knowledge Consolidation Generates Better Instance-Incremental Learner	Jun 5, 2024	class-incremental learningClass Incremental Learning	—Unverified
Tiny models from tiny data: Textual and null-text inversion for few-shot distillation	Jun 5, 2024	Few-Shot Image Classificationimage-classification	CodeCode Available
Adversarial Moment-Matching Distillation of Large Language Models	Jun 5, 2024	Imitation LearningInstruction Following	CodeCode Available
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs	Jun 5, 2024	Knowledge DistillationLanguage Modeling	—Unverified
RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models	Jun 4, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking	Jun 4, 2024	Entity LinkingKnowledge Distillation	CodeCode Available
DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark	Jun 4, 2024	Action RecognitionKnowledge Distillation	—Unverified
Decoupled Alignment for Robust Plug-and-Play Adaptation	Jun 3, 2024	Knowledge Distillation	—Unverified
Toward Efficient Deep Spiking Neuron Networks:A Survey On Compression	Jun 3, 2024	Knowledge DistillationQuantization	—Unverified
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection	Jun 1, 2024	Knowledge DistillationObject	—Unverified
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model	Jun 1, 2024	Knowledge DistillationModel Compression	CodeCode Available
Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning	May 31, 2024	Action RecognitionContrastive Learning	CodeCode Available
Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling	May 31, 2024	DenoisingImage Generation	CodeCode Available
Multi-label Class Incremental Emotion Decoding with Augmented Emotional Semantics Learning	May 31, 2024	class-incremental learningClass Incremental Learning	—Unverified
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark	May 30, 2024	Knowledge DistillationObject Tracking	—Unverified
GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment	May 30, 2024	GSM8KKnowledge Distillation	CodeCode Available
Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach	May 30, 2024	Activity RecognitionKnowledge Distillation	—Unverified
Relation Modeling and Distillation for Learning with Noisy Labels	May 30, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Distribution Aligned Semantics Adaption for Lifelong Person Re-Identification	May 30, 2024	Knowledge DistillationPerson Re-Identification	CodeCode Available
Scalable Detection of Salient Entities in News Articles	May 30, 2024	ArticlesKnowledge Distillation	—Unverified
BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation	May 29, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Forward-Backward Knowledge Distillation for Continual Clustering	May 29, 2024	ClusteringContinual Learning	—Unverified

Show:10 25 50

← PrevPage 37 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified