Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 4240 papers

Title	Date	Tasks	Status	Hype
AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Oct 24, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Oct 24, 2024	Knowledge DistillationNatural Language Understanding	—Unverified	0
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Oct 24, 2024	Knowledge DistillationMathematical Reasoning	CodeCode Available	0
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	Oct 24, 2024	Knowledge Distillationregression	—Unverified	0
Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Oct 23, 2024	AllFederated Learning	—Unverified	0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Oct 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Oct 23, 2024	Data-free Knowledge DistillationDiversity	CodeCode Available	0
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior	Oct 22, 2024	Knowledge Distillation	—Unverified	0
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Oct 22, 2024	AttributeKnowledge Distillation	CodeCode Available	0
CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Oct 22, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
MiniPLM: Knowledge Distillation for Pre-Training Language Models	Oct 22, 2024	DiversityKnowledge Distillation	CodeCode Available	2
Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples	Oct 21, 2024	Knowledge Distillation	—Unverified	0
Pre-training Distillation for Large Language Models: A Design Space Exploration	Oct 21, 2024	Knowledge Distillation	—Unverified	0
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Oct 20, 2024	Image RetrievalImage-text Retrieval	CodeCode Available	0
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Oct 19, 2024	Instruction FollowingKnowledge Distillation	—Unverified	0
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Oct 19, 2024	Knowledge Distillation	—Unverified	0
Interpreting Microbiome Relative Abundance Data Using Symbolic Regression	Oct 18, 2024	DiagnosticKnowledge Distillation	CodeCode Available	0
DiSCo: LLM Knowledge Distillation for Efficient Sparse Retrieval in Conversational Search	Oct 18, 2024	Conversational Information AccessConversational Search	CodeCode Available	0
Preview-based Category Contrastive Learning for Knowledge Distillation	Oct 18, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0
Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Oct 18, 2024	Backdoor AttackKnowledge Distillation	CodeCode Available	0
CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence	Oct 17, 2024	Binary ClassificationKnowledge Distillation	—Unverified	0
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs	Oct 17, 2024	Dataset GenerationKnowledge Distillation	—Unverified	0
An Active Learning Framework for Inclusive Generation by Large Language Models	Oct 17, 2024	Active LearningClustering	—Unverified	0
Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach	Oct 17, 2024	Earth ObservationFederated Learning	—Unverified	0
Proactive Detection and Calibration of Seasonal Advertisements with Multimodal Large Language Models	Oct 16, 2024	Knowledge Distillation	—Unverified	0
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration	Oct 16, 2024	Knowledge DistillationTransfer Learning	CodeCode Available	1
Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Oct 16, 2024	Knowledge DistillationObject	—Unverified	0
SAM-Guided Masked Token Prediction for 3D Scene Understanding	Oct 16, 2024	3D Object DetectionKnowledge Distillation	—Unverified	0
TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant	Oct 16, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router	Oct 15, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL	Oct 15, 2024	Knowledge DistillationText to SQL	—Unverified	0
Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation	Oct 15, 2024	Knowledge DistillationRgb-T Tracking	CodeCode Available	1
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified	0
Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation	Oct 14, 2024	Knowledge Distillation	CodeCode Available	0
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation	Oct 14, 2024	Knowledge DistillationMedical Image Analysis	CodeCode Available	0
ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection	Oct 14, 2024	Knowledge Distillationobject-detection	CodeCode Available	0
Large Model for Small Data: Foundation Model for Cross-Modal RF Human Activity Recognition	Oct 13, 2024	Activity RecognitionFew-Shot Learning	—Unverified	0
Distilling Invariant Representations with Dual Augmentation	Oct 12, 2024	Knowledge Distillation	—Unverified	0
Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets	Oct 12, 2024	Knowledge DistillationQuestion Answering	CodeCode Available	0
Mentor-KD: Making Small Language Models Better Multi-step Reasoners	Oct 11, 2024	Knowledge Distillation	CodeCode Available	1
Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI	Oct 11, 2024	Autonomous VehiclesIntrusion Detection	—Unverified	0
GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning	Oct 11, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both	Oct 11, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias	Oct 10, 2024	Age/UnbiasedFairness	—Unverified	0
Relational Diffusion Distillation for Efficient Image Generation	Oct 10, 2024	Image GenerationKnowledge Distillation	CodeCode Available	0
SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks	Oct 10, 2024	AttributeKnowledge Distillation	—Unverified	0
A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways	Oct 10, 2024	Autonomous NavigationKnowledge Distillation	CodeCode Available	0
Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Oct 9, 2024	Knowledge DistillationScheduling	—Unverified	0
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning	Oct 9, 2024	Knowledge Distillation	—Unverified	0
Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Oct 9, 2024	Depth EstimationKnowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 13 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified