Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2701–2750 of 4240 papers

Title	Date	Tasks	Status
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both	Oct 11, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Single image calibration using knowledge distillation approaches	Dec 5, 2022	Camera CalibrationIncremental Learning	—Unverified
Single Snapshot Distillation for Phase Coded Mask Design in Phase Retrieval	May 23, 2025	global-optimizationKnowledge Distillation	—Unverified
Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation	Sep 17, 2024	Knowledge DistillationSpeech Synthesis	—Unverified
SKDBERT: Compressing BERT via Stochastic Knowledge Distillation	Nov 26, 2022	Knowledge DistillationLanguage Modeling	—Unverified
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	May 29, 2025	Image RetrievalKnowledge Distillation	—Unverified
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning	Feb 26, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
SLaM: Student-Label Mixing for Distillation with Unlabeled Examples	Feb 8, 2023	Knowledge Distillation	—Unverified
SlimSeg: Slimmable Semantic Segmentation with Boundary Supervision	Jul 13, 2022	Knowledge DistillationSegmentation	—Unverified
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling	Aug 29, 2024	DiversityKnowledge Distillation	—Unverified
Small Language Models are Equation Reasoners	Sep 19, 2024	Arithmetic ReasoningKnowledge Distillation	—Unverified
Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications	Mar 26, 2025	ArticlesData Augmentation	—Unverified
Small Vision-Language Models: A Survey on Compact Architectures and Techniques	Mar 9, 2025	Computational EfficiencyKnowledge Distillation	—Unverified
Smart Inference for Multidigit Convolutional Neural Network based Barcode Decoding	Apr 14, 2020	Knowledge Distillation	—Unverified
SMOC-Net: Leveraging Camera Pose for Self-Supervised Monocular Object Pose Estimation	Jan 1, 2023	6D Pose Estimation using RGBKnowledge Distillation	—Unverified
Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation	Feb 16, 2025	HallucinationKnowledge Distillation	—Unverified
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dec 12, 2024	Knowledge DistillationText-to-Image Generation	—Unverified
SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks	Oct 10, 2024	AttributeKnowledge Distillation	—Unverified
SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos	Jul 15, 2023	Action RecognitionKnowledge Distillation	—Unverified
Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression	Jan 16, 2025	Contrastive LearningDeblurring	—Unverified
Soft Prompt Decoding for Multilingual Dense Retrieval	May 15, 2023	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Solvable Model for Inheriting the Regularization through Knowledge Distillation	Dec 1, 2020	Knowledge DistillationTransfer Learning	—Unverified
SonoSAMTrack -- Segment and Track Anything on Ultrasound Images	Oct 25, 2023	Knowledge Distillation	—Unverified
Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model	Sep 4, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Toward Student-Oriented Teacher Network Training For Knowledge Distillation	Jun 14, 2022	Data AugmentationKnowledge Distillation	—Unverified
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation	Apr 13, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Source-Target Unified Knowledge Distillation for Memory-Efficient Federated Domain Adaptation on Edge Devices	Sep 29, 2021	Domain AdaptationKnowledge Distillation	—Unverified
Space-Time Distillation for Video Super-Resolution	Jun 19, 2021	Knowledge DistillationSuper-Resolution	—Unverified
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm	Oct 15, 2021	Knowledge Distillation	—Unverified
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm	Nov 16, 2021	Knowledge Distillation	—Unverified
Spatial Knowledge Distillation to aid Visual Reasoning	Dec 10, 2018	DiagnosticKnowledge Distillation	—Unverified
Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection	Apr 14, 2022	Knowledge DistillationMultiple Instance Learning	—Unverified
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading	Aug 7, 2021	Audio-Visual Speech RecognitionKnowledge Distillation	—Unverified
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation	Mar 31, 2020	Knowledge DistillationObject	—Unverified
Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency	Apr 10, 2019	GPUKnowledge Distillation	—Unverified
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation	Jul 2, 2022	Knowledge DistillationMulti-Task Learning	—Unverified
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations	Sep 9, 2023	Emotion RecognitionKnowledge Distillation	—Unverified
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23	Jun 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Spiking CenterNet: A Distillation-boosted Spiking Neural Network for Object Detection	Feb 2, 2024	DecoderKnowledge Distillation	—Unverified
Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer	Apr 29, 2021	General KnowledgeKnowledge Distillation	—Unverified
Spirit Distillation: Precise Real-time Semantic Segmentation of Road Scenes with Insufficient Data	Mar 25, 2021	Autonomous DrivingFew-Shot Learning	—Unverified
Split Knowledge Distillation for Large Models in IoT: Architecture, Challenges, and Solutions	Dec 17, 2024	Knowledge DistillationManagement	—Unverified
Squeezing nnU-Nets with Knowledge Distillation for On-Board Cloud Detection	Jun 16, 2023	Cloud DetectionKnowledge Distillation	—Unverified
SRIL: Selective Regularization for Class-Incremental Learning	May 9, 2023	class-incremental learningClass Incremental Learning	—Unverified
SSKD: Self-Supervised Knowledge Distillation for Cross Domain Adaptive Person Re-Identification	Sep 13, 2020	ClusteringDomain Adaptive Person Re-Identification	—Unverified
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection	Jul 16, 2022	Anomaly DetectionKnowledge Distillation	—Unverified
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning	May 18, 2025	Knowledge DistillationSpatial Reasoning	—Unverified
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders	May 12, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Static Word Embeddings for Sentence Semantic Representation	Jun 5, 2025	Contrastive LearningKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 55 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified