Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1701–1750 of 4240 papers

Title	Date	Tasks	Status
Edge Bias in Federated Learning and its Solution by Buffered Knowledge Distillation	Oct 20, 2020	Federated LearningKnowledge Distillation	—Unverified
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models	Oct 7, 2020	AllKnowledge Distillation	—Unverified
ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Jul 29, 2024	Activity RecognitionGroup Activity Recognition	—Unverified
GAN-Knowledge Distillation for one-stage Object Detection	Jun 20, 2019	Knowledge DistillationObject	—Unverified
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified
GazeGen: Gaze-Driven User Interaction for Visual Content Generation	Nov 7, 2024	Gaze EstimationKnowledge Distillation	—Unverified
End-to-End Automatic Speech Recognition with Deep Mutual Learning	Feb 16, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Endpoints Weight Fusion for Class Incremental Semantic Segmentation	Jan 1, 2023	class-incremental learningClass Incremental Learning	—Unverified
EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder	Apr 21, 2024	image-classificationImage Classification	—Unverified
Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning	Mar 14, 2025	Federated LearningKnowledge Distillation	—Unverified
Compositional Data Augmentation for Abstractive Conversation Summarization	Nov 16, 2021	Conversation SummarizationData Augmentation	—Unverified
Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks	Jul 1, 2022	Knowledge DistillationMulti-Task Learning	—Unverified
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation	Mar 12, 2022	Image CaptioningKnowledge Distillation	—Unverified
Generalized Continual Zero-Shot Learning	Nov 17, 2020	Continual LearningKnowledge Distillation	—Unverified
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation	Nov 16, 2021	Image CaptioningKnowledge Distillation	—Unverified
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications	Feb 2, 2023	Knowledge DistillationModel Compression	—Unverified
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling	Mar 1, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Asymmetric Temperature Scaling Makes Larger Networks Teach Well Again	Oct 10, 2022	Knowledge Distillation	—Unverified
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference	Apr 29, 2020	Knowledge DistillationQuantization	—Unverified
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation	Sep 29, 2021	Few-Shot LearningKnowledge Distillation	—Unverified
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation	Oct 23, 2020	DecoderKnowledge Distillation	—Unverified
Complex Emotion Recognition System using basic emotions via Facial Expression, EEG, and ECG Signals: a review	Sep 9, 2024	EEGElectroencephalogram (EEG)	—Unverified
Generation and Consolidation of Recollections for Efficient Deep Lifelong Learning	Jan 1, 2018	Knowledge DistillationLifelong learning	—Unverified
Generation-Distillation for Efficient Natural Language Understanding in Low-Data Settings	Jan 25, 2020	General ClassificationKnowledge Distillation	—Unverified
Generative Adversarial Simulator	Nov 23, 2020	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Empowering Knowledge Distillation via Open Set Recognition for Robust 3D Point Cloud Classification	Oct 25, 2020	3D Point Cloud ClassificationGeneral Classification	—Unverified
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Feb 25, 2025	Knowledge DistillationLanguage Modeling	—Unverified
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation	Dec 19, 2022	Imitation LearningKnowledge Distillation	—Unverified
I^2KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding	Oct 4, 2023	Intent DetectionKnowledge Distillation	—Unverified
Empowering Dual-Encoder with Query Generator for Cross-Lingual Dense Retrieval	Mar 27, 2023	Knowledge DistillationRetrieval	—Unverified
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Apr 19, 2025	Knowledge DistillationState Space Models	—Unverified
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning	Dec 10, 2022	Knowledge DistillationRepresentation Learning	—Unverified
Knowledge distillation for optimization of quantized deep neural networks	Sep 4, 2019	Knowledge Distillation	—Unverified
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Apr 23, 2025	Emotion ClassificationGPU	—Unverified
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified
Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation	Jul 6, 2021	Domain Generalizationimage-classification	—Unverified
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval	Jan 27, 2023	Information RetrievalKnowledge Distillation	—Unverified
Completely Heterogeneous Federated Learning	Oct 28, 2022	Data-free Knowledge DistillationFederated Learning	—Unverified
GhostNetV3: Exploring the Training Strategies for Compact Models	Apr 17, 2024	Image ClassificationKnowledge Distillation	—Unverified
Embedding Compression for Teacher-to-Student Knowledge Transfer	Feb 9, 2024	Knowledge DistillationTransfer Learning	—Unverified
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes	Jun 23, 2023	Arithmetic ReasoningKnowledge Distillation	—Unverified
Asymmetric Image Retrieval with Cross Model Compatible Ensembles	Mar 30, 2023	DiversityFace Recognition	—Unverified
ABKD: Graph Neural Network Compression with Attention-Based Knowledge Distillation	Oct 24, 2023	Drug DiscoveryFake News Detection	—Unverified
Embedded Knowledge Distillation in Depth-Level Dynamic Neural Network	Mar 1, 2021	Dynamic neural networksKnowledge Distillation	—Unverified
ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation	May 7, 2024	Knowledge DistillationLIDAR Semantic Segmentation	—Unverified
Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR	Oct 11, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Global Intervention and Distillation for Federated Out-of-Distribution Generalization	Apr 1, 2025	AttributeData Augmentation	—Unverified
ADPS: Asymmetric Distillation Post-Segmentation for Image Anomaly Detection	Oct 19, 2022	Anomaly DetectionAnomaly Localization	—Unverified
VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation	Aug 6, 2024	ECG ClassificationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 35 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified