Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 4240 papers

Title	Date	Tasks	Status
Swapped Logit Distillation via Bi-level Teacher Alignment	Apr 27, 2025	image-classificationImage Classification	CodeCode Available
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs	Apr 24, 2025	Image-text RetrievalInstruction Following	—Unverified
Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Apr 24, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?	Apr 24, 2025	In-Context LearningKnowledge Distillation	—Unverified
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Apr 23, 2025	Emotion ClassificationGPU	—Unverified
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Apr 20, 2025	2kKnowledge Distillation	—Unverified
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Apr 20, 2025	Dataset DistillationDiversity	—Unverified
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Apr 19, 2025	Knowledge DistillationState Space Models	—Unverified
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models	Apr 18, 2025	image-classificationImage Classification	—Unverified
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs	Apr 18, 2025	Knowledge DistillationModel Compression	—Unverified
Scaling Laws for Data-Efficient Visual Transfer Learning	Apr 17, 2025	Knowledge DistillationTransfer Learning	—Unverified
Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption	Apr 16, 2025	DecoderDomain Adaptation	—Unverified
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Apr 15, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Digital Staining with Knowledge Distillation: A Unified Framework for Unpaired and Paired-But-Misaligned Data	Apr 14, 2025	ColorizationKnowledge Distillation	CodeCode Available
Optimizing Multi-Gateway LoRaWAN via Cloud-Edge Collaboration and Knowledge Distillation	Apr 13, 2025	Decision MakingKnowledge Distillation	—Unverified
Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?	Apr 13, 2025	Computational EfficiencyEfficient Neural Network	—Unverified
Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images	Apr 11, 2025	General KnowledgeKnowledge Distillation	—Unverified
Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities	Apr 11, 2025	Action RecognitionKnowledge Distillation	—Unverified
Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery	Apr 11, 2025	Continual LearningKnowledge Distillation	CodeCode Available
Towards Unconstrained 2D Pose Estimation of the Human Spine	Apr 10, 2025	2D Pose EstimationActive Learning	—Unverified
WK-Pnet: FM-Based Positioning via Wavelet Packet Decomposition and Knowledge Distillation	Apr 10, 2025	Knowledge DistillationPosition	—Unverified
ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement	Apr 10, 2025	Knowledge DistillationStereo Matching	CodeCode Available
Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation	Apr 10, 2025	Knowledge DistillationSemantic Segmentation	—Unverified
Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer	Apr 9, 2025	Knowledge Distillationparameter-efficient fine-tuning	—Unverified
Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Apr 7, 2025	Autonomous DrivingBeam Prediction	—Unverified
GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision	Apr 7, 2025	Attributeclass-incremental learning	CodeCode Available
A Novel Algorithm for Personalized Federated Learning: Knowledge Distillation with Weighted Combination Loss	Apr 6, 2025	Federated LearningKnowledge Distillation	—Unverified
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible	Apr 5, 2025	Federated LearningKnowledge Distillation	—Unverified
UNDO: Understanding Distillation as Optimization	Apr 3, 2025	Knowledge Distillation	—Unverified
Beyond Conventional Transformers: The Medical X-ray Attention (MXA) Block for Improved Multi-Label Diagnosis Using Knowledge Distillation	Apr 3, 2025	Anomaly DetectionKnowledge Distillation	CodeCode Available
Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation	Apr 3, 2025	Knowledge DistillationSegmentation	—Unverified
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation	Apr 3, 2025	Image SegmentationKnowledge Distillation	—Unverified
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation	Apr 3, 2025	DecoderKnowledge Distillation	—Unverified
FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs	Apr 2, 2025	Knowledge DistillationPrediction	CodeCode Available
KD^2M: An unifying framework for feature knowledge distillation	Apr 2, 2025	Knowledge Distillation	—Unverified
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression	Apr 2, 2025	DenoisingKnowledge Distillation	—Unverified
Style over Substance: Distilled Language Models Reason Via Stylistic Replication	Apr 2, 2025	Knowledge Distillation	—Unverified
A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines	Apr 2, 2025	Knowledge Distillationtext-classification	—Unverified
Global Intervention and Distillation for Federated Out-of-Distribution Generalization	Apr 1, 2025	AttributeData Augmentation	—Unverified
OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF	Apr 1, 2025	DenoisingKnowledge Distillation	—Unverified
Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks	Apr 1, 2025	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Mar 31, 2025	ArticlesKnowledge Distillation	—Unverified
Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification	Mar 31, 2025	image-classificationImage Classification	—Unverified
A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing	Mar 31, 2025	Continual Self-Supervised LearningKnowledge Distillation	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
Efficient Verified Machine Unlearning For Distillation	Mar 28, 2025	Knowledge DistillationMachine Unlearning	—Unverified
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Mar 28, 2025	Depth EstimationDepth Prediction	—Unverified
Delving Deep into Semantic Relation Distillation	Mar 27, 2025	Knowledge DistillationModel Compression	—Unverified
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Mar 27, 2025	HallucinationKnowledge Distillation	—Unverified
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset	Mar 27, 2025	Knowledge DistillationObject Recognition	—Unverified

Show:10 25 50

← PrevPage 23 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified