Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 4240 papers

Title	Date	Tasks	Status	Hype
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation	Jun 2, 2025	Automatic Speech RecognitionKnowledge Distillation	—Unverified	0
Feature Fusion and Knowledge-Distilled Multi-Modal Multi-Target Detection	May 31, 2025	Domain AdaptationKnowledge Distillation	—Unverified	0
Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization	May 30, 2025	GPUKnowledge Distillation	—Unverified	0
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation	May 30, 2025	Autonomous DrivingContrastive Learning	CodeCode Available	0
Progressive Class-level Distillation	May 30, 2025	BenchmarkingKnowledge Distillation	—Unverified	0
CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning	May 30, 2025	class-incremental learningClass Incremental Learning	CodeCode Available	1
Proactive Guidance of Multi-Turn Conversation in Industrial Search	May 30, 2025	Knowledge Distillationreinforcement-learning	—Unverified	0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	May 30, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
A Simple Linear Patch Revives Layer-Pruned Large Language Models	May 30, 2025	Knowledge DistillationQuestion Answering	—Unverified	0
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	May 29, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition	May 29, 2025	Activity RecognitionEdge-computing	—Unverified	0
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation	May 28, 2025	Domain AdaptationInstance Segmentation	—Unverified	0
Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles	May 28, 2025	Knowledge DistillationSound Classification	CodeCode Available	0
Multi-MLLM Knowledge Distillation for Out-of-Context News Detection	May 28, 2025	Knowledge DistillationMisinformation	—Unverified	0
EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models	May 27, 2025	Knowledge Distillation	—Unverified	0
Light distillation for Incremental Graph Convolution Collaborative Filtering	May 26, 2025	Collaborative FilteringKnowledge Distillation	—Unverified	0
Model Stitching by Functional Latent Alignment	May 26, 2025	Knowledge Distillationmodel	—Unverified	0
Efficient Speech Translation through Model Compression and Knowledge Distillation	May 26, 2025	Knowledge DistillationModel Compression	CodeCode Available	0
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation	May 26, 2025	Graph GenerationKnowledge Distillation	—Unverified	0
DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation	May 26, 2025	Knowledge Distillation	CodeCode Available	0
ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining	May 26, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
Optimizing edge AI models on HPC systems with the edge in the loop	May 26, 2025	Hardware Aware Neural Architecture SearchKnowledge Distillation	CodeCode Available	0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	May 26, 2025	Data-free Knowledge DistillationFederated Learning	CodeCode Available	0
Online Knowledge Distillation with Reward Guidance	May 25, 2025	Imitation LearningKnowledge Distillation	—Unverified	0
Remote Sensing Image Classification with Decoupled Knowledge Distillation	May 25, 2025	Classificationimage-classification	—Unverified	0
Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy	May 25, 2025	DiagnosticKnowledge Distillation	CodeCode Available	0
Tokenizing Electron Cloud in Protein-Ligand Interaction Learning	May 25, 2025	Knowledge DistillationPrediction	—Unverified	0
Knowledge Grafting of Large Language Models	May 24, 2025	Continual LearningKnowledge Distillation	CodeCode Available	0
C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging	May 24, 2025	Knowledge Distillation	—Unverified	0
Single Snapshot Distillation for Phase Coded Mask Design in Phase Retrieval	May 23, 2025	global-optimizationKnowledge Distillation	—Unverified	0
ToDi: Token-wise Distillation via Fine-Grained Divergence Control	May 22, 2025	Instruction FollowingKnowledge Distillation	—Unverified	0
On Multilingual Encoder Language Model Compression for Low-Resource Languages	May 22, 2025	Knowledge DistillationLanguage Modeling	—Unverified	0
SEDD-PCC: A Single Encoder-Dual Decoder Framework For End-To-End Learned Point Cloud Compression	May 22, 2025	AttributeDecoder	—Unverified	0
MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation	May 21, 2025	Knowledge Distillation	—Unverified	0
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	May 21, 2025	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
On the Generalization vs Fidelity Paradox in Knowledge Distillation	May 21, 2025	Knowledge DistillationTransfer Learning	CodeCode Available	0
An Efficient Private GPT Never Autoregressively Decodes	May 21, 2025	Knowledge Distillation	—Unverified	0
DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer	May 21, 2025	DenoisingKnowledge Distillation	CodeCode Available	1
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	May 21, 2025	Instance SegmentationKnowledge Distillation	CodeCode Available	1
Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking	May 20, 2025	Document RankingInformation Retrieval	—Unverified	0
Intra-class Patch Swap for Self-Distillation	May 20, 2025	image-classificationImage Classification	CodeCode Available	0
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation	May 20, 2025	Information RetrievalKnowledge Distillation	—Unverified	0
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels	May 20, 2025	Instruction FollowingKnowledge Distillation	—Unverified	0
Improved Methods for Model Pruning and Knowledge Distillation	May 20, 2025	Knowledge Distillation	—Unverified	0
Bridging the Modality Gap: Enhancing Channel Prediction with Semantically Aligned LLMs and Knowledge Distillation	May 19, 2025	Knowledge DistillationPrediction	—Unverified	0
SMOTExT: SMOTE meets Large Language Models	May 19, 2025	Cross-Modal RetrievalData Augmentation	CodeCode Available	0
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone	May 19, 2025	Knowledge DistillationTransfer Learning	CodeCode Available	1
Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach	May 19, 2025	Knowledge DistillationObject Tracking	CodeCode Available	0
Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation	May 19, 2025	Knowledge DistillationSemantic Segmentation	CodeCode Available	0
ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling	May 19, 2025	Graph GenerationKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 2 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified