Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 4240 papers

Title	Date	Tasks	Status	Hype
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation	Jun 2, 2025	Automatic Speech RecognitionKnowledge Distillation	—Unverified	0
Feature Fusion and Knowledge-Distilled Multi-Modal Multi-Target Detection	May 31, 2025	Domain AdaptationKnowledge Distillation	—Unverified	0
Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization	May 30, 2025	GPUKnowledge Distillation	—Unverified	0
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation	May 30, 2025	Autonomous DrivingContrastive Learning	CodeCode Available	0
Progressive Class-level Distillation	May 30, 2025	BenchmarkingKnowledge Distillation	—Unverified	0
CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning	May 30, 2025	class-incremental learningClass Incremental Learning	CodeCode Available	1
A Simple Linear Patch Revives Layer-Pruned Large Language Models	May 30, 2025	Knowledge DistillationQuestion Answering	—Unverified	0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	May 30, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
Proactive Guidance of Multi-Turn Conversation in Industrial Search	May 30, 2025	Knowledge Distillationreinforcement-learning	—Unverified	0
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	May 29, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition	May 29, 2025	Activity RecognitionEdge-computing	—Unverified	0
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation	May 28, 2025	Domain AdaptationInstance Segmentation	—Unverified	0
Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles	May 28, 2025	Knowledge DistillationSound Classification	CodeCode Available	0
Multi-MLLM Knowledge Distillation for Out-of-Context News Detection	May 28, 2025	Knowledge DistillationMisinformation	—Unverified	0
EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models	May 27, 2025	Knowledge Distillation	—Unverified	0
Model Stitching by Functional Latent Alignment	May 26, 2025	Knowledge Distillationmodel	—Unverified	0
Light distillation for Incremental Graph Convolution Collaborative Filtering	May 26, 2025	Collaborative FilteringKnowledge Distillation	—Unverified	0
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation	May 26, 2025	Graph GenerationKnowledge Distillation	—Unverified	0
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining	May 26, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation	May 26, 2025	Knowledge Distillation	CodeCode Available	0
Efficient Speech Translation through Model Compression and Knowledge Distillation	May 26, 2025	Knowledge DistillationModel Compression	CodeCode Available	0
Optimizing edge AI models on HPC systems with the edge in the loop	May 26, 2025	Hardware Aware Neural Architecture SearchKnowledge Distillation	CodeCode Available	0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	May 26, 2025	Data-free Knowledge DistillationFederated Learning	CodeCode Available	0
Online Knowledge Distillation with Reward Guidance	May 25, 2025	Imitation LearningKnowledge Distillation	—Unverified	0
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy	May 25, 2025	DiagnosticKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 3 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified