Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 4240 papers

Title	Date	Tasks	Status
Data Techniques For Online End-to-end Speech Recognition	Jan 24, 2020	Data AugmentationDomain Adaptation	—Unverified
Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Nov 25, 2024	Knowledge DistillationMulti-Task Learning	—Unverified
Mining Data Impressions from Deep Models as Substitute for the Unavailable Training Data	Jan 15, 2021	Adversarial RobustnessContinual Learning	—Unverified
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation	Oct 29, 2018	Dimensionality ReductionKnowledge Distillation	—Unverified
Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation	Feb 8, 2023	Brain Tumor SegmentationImage Generation	—Unverified
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Sep 30, 2024	Data AugmentationKnowledge Distillation	—Unverified
Data-Free Knowledge Transfer: A Survey	Dec 31, 2021	Data-free Knowledge DistillationDomain Adaptation	—Unverified
Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis	Apr 10, 2021	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Data-Free Knowledge Distillation Using Adversarially Perturbed OpenGL Shader Images	Oct 20, 2023	Data AugmentationData-free Knowledge Distillation	—Unverified
Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers	Aug 20, 2024	Knowledge Distillation	—Unverified
A Classifier-Free Incremental Learning Framework for Scalable Medical Image Segmentation	May 25, 2024	Contrastive LearningImage Segmentation	—Unverified
All You Need in Knowledge Distillation Is a Tailored Coordinate System	Dec 12, 2024	AllFew-Shot Learning	—Unverified
Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging	May 15, 2025	Continual LearningDiagnostic	—Unverified
Beyond Classification: Knowledge Distillation using Multi-Object Impressions	Oct 27, 2021	ClassificationKnowledge Distillation	—Unverified
Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models	Jun 21, 2025	Dimensionality ReductionKeyword Spotting	—Unverified
Enhancing Generalization in Chain of Thought Reasoning for Smaller Models	Jan 16, 2025	Knowledge DistillationMemorization	—Unverified
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Mar 27, 2025	HallucinationKnowledge Distillation	—Unverified
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models	Mar 7, 2023	Knowledge DistillationSpoken Language Understanding	—Unverified
Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory	May 22, 2024	class-incremental learningClass Incremental Learning	—Unverified
Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Sep 5, 2024	Image RestorationKnowledge Distillation	—Unverified
Enhancing CTC-Based Visual Speech Recognition	Sep 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Data-Free Distillation of Language Model by Text-to-Text Transfer	Nov 3, 2023	Data-free Knowledge DistillationDiversity	—Unverified
Dense Depth Distillation with Out-of-Distribution Simulated Images	Aug 26, 2022	Data-free Knowledge DistillationDepth Estimation	—Unverified
Data-Free Adversarial Knowledge Distillation for Graph Neural Networks	May 8, 2022	Generative Adversarial NetworkGraph Classification	—Unverified
Alleviating Catastrophic Forgetting of Incremental Object Detection via Within-Class and Between-Class Knowledge Distillation	Jan 1, 2023	Knowledge Distillationobject-detection	—Unverified
Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation	Feb 23, 2021	Knowledge Distillation	—Unverified
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning	Jan 19, 2024	GPUKnowledge Distillation	—Unverified
Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation	Apr 28, 2024	Action RecognitionGeneral Knowledge	—Unverified
Accurate Knowledge Distillation with n-best Reranking	May 20, 2023	Knowledge DistillationReranking	—Unverified
Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression	Mar 11, 2024	Backdoor AttackImage Compression	—Unverified
Data-Efficient Ranking Distillation for Image Retrieval	Jul 10, 2020	Image RetrievalKnowledge Distillation	—Unverified
Adaptive Instance Distillation for Object Detection in Autonomous Driving	Jan 26, 2022	Autonomous DrivingKnowledge Distillation	—Unverified
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling	Mar 1, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified
Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction	Dec 23, 2024	Federated LearningKnowledge Distillation	—Unverified
Data-Driven Compression of Convolutional Neural Networks	Nov 28, 2019	Knowledge DistillationModel Compression	—Unverified
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition	Feb 28, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancing Chinese Multi-Label Text Classification Performance with Response-based Knowledge Distillation	Nov 1, 2022	Knowledge DistillationMulti Label Text Classification	—Unverified
DASECount: Domain-Agnostic Sample-Efficient Wireless Indoor Crowd Counting via Few-shot Learning	Nov 18, 2022	Crowd CountingFew-Shot Learning	—Unverified
BeSound: Bluetooth-Based Position Estimation Enhancing with Cross-Modality Distillation	Apr 24, 2024	Knowledge DistillationPosition	—Unverified
Adaptive Group Robust Ensemble Knowledge Distillation	Nov 22, 2024	Knowledge Distillation	—Unverified
BERT Learns to Teach: Knowledge Distillation with Meta Learning	Aug 17, 2021	Knowledge DistillationMeta-Learning	—Unverified
DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Dec 11, 2024	Data AugmentationKnowledge Distillation	—Unverified
DaFKD: Domain-Aware Federated Knowledge Distillation	Jan 1, 2023	Knowledge Distillation	—Unverified
BERM: Training the Balanced and Extractable Representation for Matching to Improve Generalization Ability of Dense Retrieval	May 18, 2023	Information RetrievalKnowledge Distillation	—Unverified
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization	Jun 29, 2024	Knowledge Distillation	—Unverified
Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation	Dec 8, 2024	Image Quality AssessmentKnowledge Distillation	—Unverified
Energy-efficient Knowledge Distillation for Spiking Neural Networks	Jun 14, 2021	Knowledge DistillationModel Compression	—Unverified
StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Jul 31, 2024	Knowledge DistillationNeRF	—Unverified
Enhanced Multimodal Representation Learning with Cross-modal KD	Jun 13, 2023	Contrastive LearningEmotion Classification	—Unverified

Show:10 25 50

← PrevPage 24 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified