Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2901–2950 of 4240 papers

Title	Date	Tasks	Status
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Sep 30, 2024	2D Human Pose Estimationimage-classification	—Unverified
Class Similarity Weighted Knowledge Distillation for Continual Semantic Segmentation	Jan 1, 2022	Continual LearningContinual Semantic Segmentation	—Unverified
Class Token and Knowledge Distillation for Multi-head Self-Attention Speaker Verification Systems	Nov 6, 2021	Knowledge DistillationPhilosophy	—Unverified
Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Sep 4, 2024	Domain AdaptationKnowledge Distillation	—Unverified
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	Mar 12, 2025	3D Object DetectionAutonomous Driving	—Unverified
CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition	Nov 21, 2024	Continual LearningFace Recognition	—Unverified
Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning	Jun 25, 2025	Knowledge DistillationTransfer Learning	—Unverified
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination	Aug 18, 2024	Knowledge DistillationTransfer Learning	—Unverified
CLIPPING: Distilling CLIP-Based Models With a Student Base for Video-Language Retrieval	Jan 1, 2023	Knowledge DistillationLanguage Modelling	—Unverified
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs	Feb 15, 2025	DenoisingKnowledge Distillation	—Unverified
Closing the Gap between Client and Global Model Performance in Heterogeneous Federated Learning	Nov 7, 2022	Federated LearningKnowledge Distillation	—Unverified
Cloud-Device Collaborative Learning for Multimodal Large Language Models	Dec 26, 2023	Device-Cloud CollaborationKnowledge Distillation	—Unverified
CL-ReKD: Cross-lingual Knowledge Distillation for Multilingual Retrieval Question Answering	Jan 16, 2022	Knowledge DistillationLanguage Modeling	—Unverified
ClST: A Convolutional Transformer Framework for Automatic Modulation Recognition by Knowledge Distillation	Dec 29, 2023	Automatic Modulation RecognitionKnowledge Distillation	—Unverified
CMU’s IWSLT 2022 Dialect Speech Translation System	May 1, 2022	DecoderKnowledge Distillation	—Unverified
CoCo DistillNet: a Cross-layer Correlation Distillation Network for Pathological Gastric Cancer Segmentation	Aug 27, 2021	Image SegmentationKnowledge Distillation	—Unverified
CoDERT: Distilling Encoder Representations with Co-learning for Transducer-based Speech Recognition	Jun 14, 2021	DecoderKnowledge Distillation	—Unverified
Cold & Warm Net: Addressing Cold-Start Users in Recommender Systems	Sep 27, 2023	Knowledge DistillationMeta-Learning	—Unverified
Collaborative Distillation for Top-N Recommendation	Nov 13, 2019	Collaborative FilteringKnowledge Distillation	—Unverified
Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition	Sep 15, 2020	Action RecognitionKnowledge Distillation	—Unverified
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning	Sep 25, 2019	Decision MakingKnowledge Distillation	—Unverified
Collaborative Learning for Deep Neural Networks	May 30, 2018	Knowledge DistillationMulti-Task Learning	—Unverified
Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks	Oct 27, 2022	Knowledge DistillationQuantization	—Unverified
Collaborative Teacher-Student Learning via Multiple Knowledge Transfer	Jan 21, 2021	Knowledge DistillationModel Compression	—Unverified
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders	Sep 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
Collective Knowledge Graph Completion with Mutual Knowledge Distillation	May 25, 2023	Knowledge DistillationKnowledge Graph Completion	—Unverified
Collective Wisdom: Improving Low-resource Neural Machine Translation using Adaptive Knowledge Distillation	Oct 12, 2020	Knowledge DistillationLow Resource Neural Machine Translation	—Unverified
Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks	Aug 20, 2022	Knowledge DistillationNeural Network Compression	—Unverified
Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation	Nov 1, 2021	Dialogue GenerationKnowledge Distillation	—Unverified
CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Jan 1, 2025	Knowledge DistillationSemantic Segmentation	—Unverified
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Aug 8, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Batch Selection and Communication for Active Learning with Edge Labeling	Nov 14, 2023	Active LearningKnowledge Distillation	—Unverified
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices	Oct 11, 2021	Knowledge DistillationNetwork Pruning	—Unverified
Compact CNN Structure Learning by Knowledge Distillation	Apr 19, 2021	Knowledge DistillationModel Compression	—Unverified
Compacting Deep Neural Networks for Internet of Things: Methods and Applications	Mar 20, 2021	DiversityKnowledge Distillation	—Unverified
Compact Speaker Embedding: lrx-vector	Aug 11, 2020	Knowledge DistillationSpeaker Recognition	—Unverified
Comparing Fisher Information Regularization with Distillation for DNN Quantization	Oct 19, 2020	Knowledge DistillationQuantization	—Unverified
Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR	Oct 11, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Completely Heterogeneous Federated Learning	Oct 28, 2022	Data-free Knowledge DistillationFederated Learning	—Unverified
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning	Dec 10, 2022	Knowledge DistillationRepresentation Learning	—Unverified
Complex Emotion Recognition System using basic emotions via Facial Expression, EEG, and ECG Signals: a review	Sep 9, 2024	EEGElectroencephalogram (EEG)	—Unverified
Compositional Data Augmentation for Abstractive Conversation Summarization	Nov 16, 2021	Conversation SummarizationData Augmentation	—Unverified
Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis	Jan 6, 2025	Decision MakingDiversity	—Unverified
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models	Jul 22, 2024	Deep Learningimage-classification	—Unverified
Comprehensive Survey of Model Compression and Speed up for Vision Transformers	Apr 16, 2024	Computational EfficiencyEdge-computing	—Unverified
Compressed Meta-Optical Encoder for Image Classification	Apr 23, 2024	Classificationimage-classification	—Unverified
Compressing Deep Image Super-resolution Models	Dec 31, 2023	Image Super-ResolutionKnowledge Distillation	—Unverified
Compressing GANs using Knowledge Distillation	Feb 1, 2019	Knowledge DistillationSuper-Resolution	—Unverified
Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold	Dec 22, 2023	Density EstimationImage-to-Image Translation	—Unverified
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Oct 1, 2024	Computational EfficiencyKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 59 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified