Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1851–1900 of 4240 papers

Title	Date	Tasks	Status
Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence	Mar 9, 2025	Decision MakingKnowledge Distillation	—Unverified
Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification	Sep 14, 2024	Knowledge DistillationSpeaker Verification	—Unverified
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images	May 22, 2024	GPUKnowledge Distillation	—Unverified
Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Feb 18, 2025	Data AugmentationGSM8K	—Unverified
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss	Feb 7, 2024	DecoderGPU	—Unverified
How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?	May 27, 2021	DiversityKnowledge Distillation	—Unverified
Deep Neural Network Models Compression	Mar 4, 2021	Knowledge DistillationQuantization	—Unverified
How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting	Mar 9, 2022	Knowledge DistillationTrajectory Forecasting	—Unverified
Compact Speaker Embedding: lrx-vector	Aug 11, 2020	Knowledge DistillationSpeaker Recognition	—Unverified
How to Backdoor the Knowledge Distillation	Apr 30, 2025	Knowledge Distillation	—Unverified
Efficient Video Segmentation Models with Per-frame Inference	Feb 24, 2022	Image MattingInstance Segmentation	—Unverified
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark	Dec 21, 2023	Knowledge DistillationLanguage Modeling	—Unverified
How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding	Nov 1, 2021	Adversarial RobustnessAll	—Unverified
Efficient Verified Machine Unlearning For Distillation	Mar 28, 2025	Knowledge DistillationMachine Unlearning	—Unverified
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model	Jul 17, 2024	Knowledge Distillationscientific discovery	—Unverified
Amortized Noisy Channel Neural Machine Translation	Dec 16, 2021	Imitation LearningKnowledge Distillation	—Unverified
Integrating ChatGPT into Secure Hospital Networks: A Case Study on Improving Radiology Report Analysis	Feb 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
HRPose: Real-Time High-Resolution 6D Pose Estimation Network Using Knowledge Distillation	Apr 20, 2022	6D Pose Estimation6D Pose Estimation using RGB	—Unverified
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training	Apr 27, 2022	Action RecognitionContrastive Learning	—Unverified
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning	Sep 17, 2020	Edge-computingKnowledge Distillation	—Unverified
Compacting Deep Neural Networks for Internet of Things: Methods and Applications	Mar 20, 2021	DiversityKnowledge Distillation	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Human in the Latent Loop (HILL): Interactively Guiding Model Training Through Human Intuition	May 9, 2025	Knowledge Distillation	—Unverified
Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation	Aug 26, 2021	Density EstimationKnowledge Distillation	—Unverified
Compact CNN Structure Learning by Knowledge Distillation	Apr 19, 2021	Knowledge DistillationModel Compression	—Unverified
HW-TSC’s Participation in the WMT 2020 News Translation Shared Task	Nov 1, 2020	Knowledge DistillationTranslation	—Unverified
HW-TSC’s Participation in the WMT 2021 Large-Scale Multilingual Translation Task	Nov 1, 2021	Knowledge DistillationTranslation	—Unverified
A Survey on Transformer Compression	Feb 5, 2024	Knowledge DistillationMamba	—Unverified
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices	Oct 11, 2021	Knowledge DistillationNetwork Pruning	—Unverified
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified
Hybrid Paradigm-based Brain-Computer Interface for Robotic Arm Control	Dec 14, 2022	Brain Computer InterfaceEEG	—Unverified
HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Sep 30, 2024	Federated LearningKnowledge Distillation	—Unverified
A Survey on Symbolic Knowledge Distillation of Large Language Models	Jul 12, 2024	Knowledge DistillationSurvey	—Unverified
A Flexible Multi-Task Model for BERT Serving	Nov 16, 2021	Knowledge Distillationmodel	—Unverified
In Teacher We Trust: Learning Compressed Models for Pedestrian Detection	Dec 1, 2016	Knowledge DistillationPedestrian Detection	—Unverified
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding	Apr 15, 2021	intent-classificationIntent Classification	—Unverified
Efficient speech detection in environmental audio using acoustic recognition and knowledge distillation	Dec 14, 2023	Knowledge DistillationModel Selection	—Unverified
I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation	Mar 27, 2024	Knowledge DistillationSegmentation	—Unverified
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation	Dec 19, 2022	Imitation LearningKnowledge Distillation	—Unverified
I^2KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding	Oct 4, 2023	Intent DetectionKnowledge Distillation	—Unverified
A Survey on Recent Teacher-student Learning Studies	Apr 10, 2023	Knowledge DistillationSurvey	—Unverified
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions	Nov 30, 2023	Knowledge DistillationRAG	—Unverified
ICD-Face: Intra-class Compactness Distillation for Face Recognition	Jan 1, 2023	Face RecognitionKnowledge Distillation	—Unverified
Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Dec 17, 2024	Edge-computingKnowledge Distillation	—Unverified
Batch Selection and Communication for Active Learning with Edge Labeling	Nov 14, 2023	Active LearningKnowledge Distillation	—Unverified
Cross-resolution Face Recognition via Identity-Preserving Network and Knowledge Distillation	Mar 15, 2023	Face RecognitionKnowledge Distillation	—Unverified
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Mar 30, 2023	Image RetrievalKnowledge Distillation	—Unverified
Active Large Language Model-based Knowledge Distillation for Session-based Recommendation	Dec 15, 2024	Active LearningKnowledge Distillation	—Unverified
Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Sep 3, 2024	Data AugmentationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 38 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified