Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 4240 papers

Title	Date	Tasks	Status
Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head	Nov 13, 2024	AttributeKnowledge Distillation	—Unverified
UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Nov 13, 2024	DecoderFew-Shot Object Detection	—Unverified
Feature Interaction Fusion Self-Distillation Network For CTR Prediction	Nov 12, 2024	Click-Through Rate PredictionKnowledge Distillation	—Unverified
Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models	Nov 12, 2024	Knowledge DistillationQuestion Answering	—Unverified
Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Nov 12, 2024	Knowledge Distillation	—Unverified
Joint Diffusion models in Continual Learning	Nov 12, 2024	Continual LearningKnowledge Distillation	—Unverified
Quantifying Knowledge Distillation Using Partial Information Decomposition	Nov 12, 2024	Knowledge DistillationTransfer Learning	—Unverified
An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning	Nov 11, 2024	class-incremental learningClass Incremental Learning	CodeCode Available
CULL-MT: Compression Using Language and Layer pruning for Machine Translation	Nov 10, 2024	Knowledge DistillationMachine Translation	—Unverified
Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation	Nov 10, 2024	Knowledge DistillationTensor Decomposition	CodeCode Available
Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification	Nov 9, 2024	Knowledge DistillationPerson Re-Identification	—Unverified
Multi-Document Financial Question Answering using LLMs	Nov 8, 2024	Knowledge DistillationKnowledge Graphs	—Unverified
Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles	Nov 8, 2024	Autonomous VehiclesDescriptive	—Unverified
Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nov 8, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Asterisk*: Keep it Simple	Nov 8, 2024	ClassificationKnowledge Distillation	—Unverified
Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine	Nov 8, 2024	Computational EfficiencyHallucination	—Unverified
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale	Nov 7, 2024	Active LearningBenchmarking	—Unverified
GazeGen: Gaze-Driven User Interaction for Visual Content Generation	Nov 7, 2024	Gaze EstimationKnowledge Distillation	—Unverified
Towards Personalized Federated Learning via Comprehensive Knowledge Distillation	Nov 6, 2024	Federated LearningKnowledge Distillation	—Unverified
Multimodal Commonsense Knowledge Distillation for Visual Question Answering	Nov 5, 2024	Knowledge DistillationQuestion Answering	—Unverified
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Nov 5, 2024	Fault DetectionIn-Context Learning	—Unverified
Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Nov 5, 2024	Knowledge Distillationobject-detection	—Unverified
Training on the Test Model: Contamination in Ranking Distillation	Nov 4, 2024	Knowledge Distillation	CodeCode Available
Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment	Nov 3, 2024	Knowledge DistillationPhilosophy	—Unverified
Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation	Nov 1, 2024	Knowledge DistillationReinforcement Learning (RL)	—Unverified
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation	Nov 1, 2024	EpidemiologyKnowledge Distillation	—Unverified
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance	Nov 1, 2024	Knowledge Distillation	—Unverified
Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification	Oct 31, 2024	Earth Observationimage-classification	CodeCode Available
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Oct 30, 2024	Knowledge DistillationLanguage Modelling	—Unverified
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation	Oct 30, 2024	Knowledge Distillation	—Unverified
Unsupervised Training of a Dynamic Context-Aware Deep Denoising Framework for Low-Dose Fluoroscopic Imaging	Oct 29, 2024	DenoisingDiagnostic	CodeCode Available
Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study	Oct 28, 2024	Knowledge Distillation	—Unverified
Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications	Oct 28, 2024	Audio TaggingClassification	—Unverified
Unveiling Context-Aware Criteria in Self-Assessing LLMs	Oct 28, 2024	Knowledge Distillation	—Unverified
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Oct 28, 2024	Knowledge Distillation	—Unverified
SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models	Oct 25, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Oct 24, 2024	Knowledge DistillationNatural Language Understanding	—Unverified
AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Oct 24, 2024	Knowledge DistillationLanguage Modeling	—Unverified
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Oct 24, 2024	Knowledge DistillationMathematical Reasoning	CodeCode Available
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	Oct 24, 2024	Knowledge Distillationregression	—Unverified
Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Oct 23, 2024	Data-free Knowledge DistillationDiversity	CodeCode Available
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Oct 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Oct 23, 2024	AllFederated Learning	—Unverified
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Oct 22, 2024	AttributeKnowledge Distillation	CodeCode Available
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior	Oct 22, 2024	Knowledge Distillation	—Unverified
CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Oct 22, 2024	Data AugmentationKnowledge Distillation	—Unverified
Pre-training Distillation for Large Language Models: A Design Space Exploration	Oct 21, 2024	Knowledge Distillation	—Unverified
Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples	Oct 21, 2024	Knowledge Distillation	—Unverified
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Oct 20, 2024	Image RetrievalImage-text Retrieval	CodeCode Available
LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Oct 19, 2024	Instruction FollowingKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 30 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified