Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 4240 papers

Title	Date	Tasks	Status
Contrastive Representation Distillation via Multi-Scale Feature Decoupling	Feb 9, 2025	Knowledge DistillationTransfer Learning	—Unverified
Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector	Feb 8, 2025	Incremental LearningKnowledge Distillation	—Unverified
ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data	Feb 8, 2025	Knowledge Distillation	—Unverified
Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)	Feb 6, 2025	Knowledge Distillation	—Unverified
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation	Feb 6, 2025	In-Context LearningKnowledge Distillation	—Unverified
Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation	Feb 6, 2025	Knowledge DistillationMachine Translation	CodeCode Available
A Unified Knowledge-Distillation and Semi-Supervised Learning Framework to Improve Industrial Ads Delivery Systems	Feb 5, 2025	Knowledge Distillation	—Unverified
Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons	Feb 5, 2025	Instruction FollowingKnowledge Distillation	—Unverified
MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks	Feb 3, 2025	ImputationKnowledge Distillation	—Unverified
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified
A method for estimating forest carbon storage distribution density via artificial intelligence generated content model	Feb 2, 2025	Knowledge Distillation	—Unverified
VLM-Assisted Continual learning for Visual Question Answering in Self-Driving	Feb 2, 2025	Autonomous DrivingContinual Learning	—Unverified
FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation	Feb 2, 2025	Knowledge Distillationreinforcement-learning	CodeCode Available
Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data	Feb 2, 2025	Data AugmentationKnowledge Distillation	—Unverified
Robust Knowledge Distillation in Federated Learning: Counteracting Backdoor Attacks	Feb 1, 2025	Federated LearningKnowledge Distillation	CodeCode Available
Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution	Jan 30, 2025	Hyperspectral Image Super-ResolutionImage Super-Resolution	—Unverified
Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design	Jan 30, 2025	Emotion RecognitionFacial Emotion Recognition	—Unverified
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Jan 29, 2025	Knowledge DistillationNatural Language Understanding	—Unverified
Distilling Knowledge for Designing Computational Imaging Systems	Jan 29, 2025	DecoderImage Reconstruction	CodeCode Available
Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Jan 28, 2025	Computational EfficiencyDecoder	—Unverified
Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Jan 28, 2025	Federated LearningKnowledge Distillation	—Unverified
FedEFM: Federated Endovascular Foundation Model with Unseen Data	Jan 28, 2025	Federated LearningKnowledge Distillation	—Unverified
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models	Jan 28, 2025	Knowledge DistillationModel Compression	—Unverified
Target-driven Self-Distillation for Partial Observed Trajectories Forecasting	Jan 28, 2025	Autonomous DrivingKnowledge Distillation	—Unverified
A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts	Jan 28, 2025	Autonomous DrivingKnowledge Distillation	—Unverified
Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment	Jan 27, 2025	Knowledge Distillation	CodeCode Available
PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Jan 27, 2025	GPUKnowledge Distillation	—Unverified
MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation	Jan 26, 2025	Gait RecognitionGait Recognition in the Wild	CodeCode Available
Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis	Jan 26, 2025	ArticlesHallucination	—Unverified
Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval	Jan 25, 2025	Domain AdaptationKnowledge Distillation	—Unverified
Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning	Jan 25, 2025	Adversarial RobustnessFederated Learning	—Unverified
On Accelerating Edge AI: Optimizing Resource-Constrained Environments	Jan 25, 2025	Knowledge DistillationModel Compression	—Unverified
Multimodal Prescriptive Deep Learning	Jan 24, 2025	Deep LearningKnowledge Distillation	—Unverified
Remining Hard Negatives for Generative Pseudo Labeled Domain Adaptation	Jan 24, 2025	Domain AdaptationInformation Retrieval	—Unverified
Multi-aspect Knowledge Distillation with Large Language Model	Jan 23, 2025	image-classificationImage Classification	CodeCode Available
Unlearning Clients, Features and Samples in Vertical Federated Learning	Jan 23, 2025	Federated LearningInference Attack	—Unverified
Toward Model-centric Heterogeneous Federated Graph Learning: A Knowledge-driven Approach	Jan 22, 2025	DiversityGraph Learning	—Unverified
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Jan 22, 2025	Knowledge DistillationResponse Generation	—Unverified
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jan 22, 2025	Image GenerationKnowledge Distillation	—Unverified
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation	Jan 22, 2025	Knowledge Distillation	—Unverified
Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor	Jan 21, 2025	DiagnosticKnowledge Distillation	CodeCode Available
Learning to reconstruct signals with inexact sensing operator via knowledge distillation	Jan 18, 2025	Knowledge Distillation	—Unverified
DNA 1.0 Technical Report	Jan 18, 2025	BelebeleGSM8K	—Unverified
Enhancing Generalization in Chain of Thought Reasoning for Smaller Models	Jan 16, 2025	Knowledge DistillationMemorization	—Unverified
Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression	Jan 16, 2025	Contrastive LearningDeblurring	—Unverified
Class Incremental Fault Diagnosis under Limited Fault Data via Supervised Contrastive Knowledge Distillation	Jan 16, 2025	Fault DiagnosisKnowledge Distillation	CodeCode Available
Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images	Jan 16, 2025	DecoderImage Reconstruction	—Unverified
Feature-based One-For-All: A Universal Framework for Heterogeneous Knowledge Distillation	Jan 15, 2025	AllKnowledge Distillation	—Unverified
VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Jan 15, 2025	Generative Adversarial NetworkKnowledge Distillation	CodeCode Available
Induced Model Matching: Restricted Models Help Train Full-Featured Models	Jan 15, 2025	Knowledge DistillationLanguage Modeling	CodeCode Available

Show:10 25 50

← PrevPage 26 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified