Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1550 of 4240 papers

Title	Date	Tasks	Status
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Oct 19, 2024	Knowledge Distillation	—Unverified
DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search	Oct 18, 2024	Conversational Information AccessConversational Search	CodeCode Available
Interpreting Microbiome Relative Abundance Data Using Symbolic Regression	Oct 18, 2024	DiagnosticKnowledge Distillation	CodeCode Available
Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Oct 18, 2024	Backdoor AttackKnowledge Distillation	CodeCode Available
Preview-based Category Contrastive Learning for Knowledge Distillation	Oct 18, 2024	Contrastive LearningKnowledge Distillation	—Unverified
An Active Learning Framework for Inclusive Generation by Large Language Models	Oct 17, 2024	Active LearningClustering	—Unverified
Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach	Oct 17, 2024	Earth ObservationFederated Learning	—Unverified
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs	Oct 17, 2024	Dataset GenerationKnowledge Distillation	—Unverified
CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence	Oct 17, 2024	Binary ClassificationKnowledge Distillation	—Unverified
Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Oct 16, 2024	Knowledge DistillationObject	—Unverified
TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant	Oct 16, 2024	Knowledge DistillationTransfer Learning	—Unverified
SAM-Guided Masked Token Prediction for 3D Scene Understanding	Oct 16, 2024	3D Object DetectionKnowledge Distillation	—Unverified
Proactive Detection and Calibration of Seasonal Advertisements with Multimodal Large Language Models	Oct 16, 2024	Knowledge Distillation	—Unverified
Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL	Oct 15, 2024	Knowledge DistillationText to SQL	—Unverified
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router	Oct 15, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation	Oct 14, 2024	Knowledge Distillation	CodeCode Available
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation	Oct 14, 2024	Knowledge DistillationMedical Image Analysis	CodeCode Available
ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection	Oct 14, 2024	Knowledge Distillationobject-detection	CodeCode Available
Large Model for Small Data: Foundation Model for Cross-Modal RF Human Activity Recognition	Oct 13, 2024	Activity RecognitionFew-Shot Learning	—Unverified
Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets	Oct 12, 2024	Knowledge DistillationQuestion Answering	CodeCode Available
Distilling Invariant Representations with Dual Augmentation	Oct 12, 2024	Knowledge Distillation	—Unverified
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both	Oct 11, 2024	Knowledge DistillationLanguage Modeling	—Unverified
GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning	Oct 11, 2024	Federated LearningKnowledge Distillation	—Unverified
Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI	Oct 11, 2024	Autonomous VehiclesIntrusion Detection	—Unverified
A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways	Oct 10, 2024	Autonomous NavigationKnowledge Distillation	CodeCode Available
Relational Diffusion Distillation for Efficient Image Generation	Oct 10, 2024	Image GenerationKnowledge Distillation	CodeCode Available
What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias	Oct 10, 2024	Age/UnbiasedFairness	—Unverified
SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks	Oct 10, 2024	AttributeKnowledge Distillation	—Unverified
Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Oct 9, 2024	Knowledge DistillationScheduling	—Unverified
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Oct 9, 2024	Knowledge DistillationNeural Network Compression	—Unverified
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning	Oct 9, 2024	Knowledge Distillation	—Unverified
Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Oct 9, 2024	Depth EstimationKnowledge Distillation	—Unverified
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server	Oct 8, 2024	Federated LearningKnowledge Distillation	CodeCode Available
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Oct 7, 2024	Decision MakingInformation Retrieval	—Unverified
Progressive distillation induces an implicit curriculum	Oct 7, 2024	Knowledge Distillation	—Unverified
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Oct 6, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available
CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Oct 6, 2024	DescriptiveImage Captioning	CodeCode Available
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech	Oct 5, 2024	HallucinationKnowledge Distillation	—Unverified
Accelerating Diffusion Models with One-to-Many Knowledge Distillation	Oct 5, 2024	Image GenerationKnowledge Distillation	—Unverified
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified
Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Oct 4, 2024	Keypoint DetectionKnowledge Distillation	—Unverified
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Oct 4, 2024	document understandingKnowledge Distillation	—Unverified
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Oct 3, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available
BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation	Oct 2, 2024	Knowledge DistillationTime Series Analysis	CodeCode Available
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks	Oct 2, 2024	Knowledge Distillation	CodeCode Available
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Oct 2, 2024	Knowledge Distillation	—Unverified
"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning	Oct 2, 2024	Backdoor Attackbackdoor defense	CodeCode Available
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Oct 1, 2024	Code GenerationHumanEval	CodeCode Available
Self-Updatable Large Language Models with Parameter Integration	Oct 1, 2024	Continual LearningConversational Recommendation	—Unverified

Show:10 25 50

← PrevPage 31 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified