Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2051–2100 of 4240 papers

Title	Date	Tasks	Status
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes	Jun 23, 2023	Arithmetic ReasoningKnowledge Distillation	—Unverified
GhostNetV3: Exploring the Training Strategies for Compact Models	Apr 17, 2024	Image ClassificationKnowledge Distillation	—Unverified
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts	Apr 8, 2024	DescriptiveImage Segmentation	—Unverified
Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory	May 22, 2024	class-incremental learningClass Incremental Learning	—Unverified
GeoMask3D: Geometrically Informed Mask Selection for Self-Supervised Point Cloud Learning in 3D	May 20, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
GenURL: A General Framework for Unsupervised Representation Learning	Oct 27, 2021	Contrastive LearningDimensionality Reduction	—Unverified
Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Sep 5, 2024	Image RestorationKnowledge Distillation	—Unverified
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Mar 27, 2025	HallucinationKnowledge Distillation	—Unverified
Data-Free Distillation of Language Model by Text-to-Text Transfer	Nov 3, 2023	Data-free Knowledge DistillationDiversity	—Unverified
Generative Negative Text Replay for Continual Vision-Language Pretraining	Oct 31, 2022	Continual Learningimage-classification	—Unverified
Dense Depth Distillation with Out-of-Distribution Simulated Images	Aug 26, 2022	Data-free Knowledge DistillationDepth Estimation	—Unverified
Generative Dataset Distillation Based on Self-knowledge Distillation	Jan 8, 2025	Dataset DistillationKnowledge Distillation	—Unverified
Data-Free Adversarial Knowledge Distillation for Graph Neural Networks	May 8, 2022	Generative Adversarial NetworkGraph Classification	—Unverified
Alleviating Catastrophic Forgetting of Incremental Object Detection via Within-Class and Between-Class Knowledge Distillation	Jan 1, 2023	Knowledge Distillationobject-detection	—Unverified
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models	Mar 7, 2023	Knowledge DistillationSpoken Language Understanding	—Unverified
A Classifier-Free Incremental Learning Framework for Scalable Medical Image Segmentation	May 25, 2024	Contrastive LearningImage Segmentation	—Unverified
Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging	May 15, 2025	Continual LearningDiagnostic	—Unverified
Generative Adversarial Simulator	Nov 23, 2020	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Generation-Distillation for Efficient Natural Language Understanding in Low-Data Settings	Jan 25, 2020	General ClassificationKnowledge Distillation	—Unverified
Generation and Consolidation of Recollections for Efficient Deep Lifelong Learning	Jan 1, 2018	Knowledge DistillationLifelong learning	—Unverified
Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation	Aug 20, 2024	FairnessKnowledge Distillation	—Unverified
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation	Oct 23, 2020	DecoderKnowledge Distillation	—Unverified
Generate, Annotate, and Learn: Generative Models Advance Self-Training and Knowledge Distillation	Sep 29, 2021	Few-Shot LearningKnowledge Distillation	—Unverified
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference	Apr 29, 2020	Knowledge DistillationQuantization	—Unverified
Data-Efficient Ranking Distillation for Image Retrieval	Jul 10, 2020	Image RetrievalKnowledge Distillation	—Unverified
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications	Feb 2, 2023	Knowledge DistillationModel Compression	—Unverified
Data-efficient Event Camera Pre-training via Disentangled Masked Modeling	Mar 1, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified
Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction	Dec 23, 2024	Federated LearningKnowledge Distillation	—Unverified
Generalized Continual Zero-Shot Learning	Nov 17, 2020	Continual LearningKnowledge Distillation	—Unverified
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics	Sep 21, 2024	Knowledge DistillationSound Classification	—Unverified
Data-Driven Compression of Convolutional Neural Networks	Nov 28, 2019	Knowledge DistillationModel Compression	—Unverified
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition	Feb 28, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Adaptive Instance Distillation for Object Detection in Autonomous Driving	Jan 26, 2022	Autonomous DrivingKnowledge Distillation	—Unverified
GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model	Jun 12, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
GenDistiller: Distilling Pre-trained Language Models based on Generative Models	Oct 20, 2023	Knowledge DistillationLanguage Modeling	—Unverified
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation	Aug 17, 2021	Knowledge Distillationobject-detection	—Unverified
GazeGen: Gaze-Driven User Interaction for Visual Content Generation	Nov 7, 2024	Gaze EstimationKnowledge Distillation	—Unverified
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified
GAN-Knowledge Distillation for one-stage Object Detection	Jun 20, 2019	Knowledge DistillationObject	—Unverified
GAML-BERT: Improving BERT Early Exiting by Gradient Aligned Mutual Learning	Nov 1, 2021	Knowledge Distillation	—Unverified
DASECount: Domain-Agnostic Sample-Efficient Wireless Indoor Crowd Counting via Few-shot Learning	Nov 18, 2022	Crowd CountingFew-Shot Learning	—Unverified
BeSound: Bluetooth-Based Position Estimation Enhancing with Cross-Modality Distillation	Apr 24, 2024	Knowledge DistillationPosition	—Unverified
Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models	Oct 7, 2020	AllKnowledge Distillation	—Unverified
GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning	Oct 11, 2024	Federated LearningKnowledge Distillation	—Unverified
BERT Learns to Teach: Knowledge Distillation with Meta Learning	Aug 17, 2021	Knowledge DistillationMeta-Learning	—Unverified
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified
Future-Guided Incremental Transformer for Simultaneous Translation	Dec 23, 2020	Knowledge DistillationTranslation	—Unverified
DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Dec 11, 2024	Data AugmentationKnowledge Distillation	—Unverified
Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage	May 13, 2025	Knowledge DistillationLarge Language Model	—Unverified

Show:10 25 50

← PrevPage 42 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified