Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 4240 papers

Title	Date	Tasks	Status	Hype
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Oct 9, 2024	Knowledge DistillationNeural Network Compression	—Unverified	0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server	Oct 8, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
Progressive distillation induces an implicit curriculum	Oct 7, 2024	Knowledge Distillation	—Unverified	0
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Oct 7, 2024	Decision MakingInformation Retrieval	—Unverified	0
DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Oct 6, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available	0
CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Oct 6, 2024	DescriptiveImage Captioning	CodeCode Available	0
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech	Oct 5, 2024	HallucinationKnowledge Distillation	—Unverified	0
Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution	Oct 5, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available	2
Accelerating Diffusion Models with One-to-Many Knowledge Distillation	Oct 5, 2024	Image GenerationKnowledge Distillation	—Unverified	0
Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Oct 5, 2024	Knowledge Distillation	—Unverified	0
Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation	Oct 4, 2024	Keypoint DetectionKnowledge Distillation	—Unverified	0
Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review	Oct 4, 2024	Knowledge DistillationLogical Reasoning	CodeCode Available	2
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Oct 4, 2024	document understandingKnowledge Distillation	—Unverified	0
Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Oct 3, 2024	Dataset DistillationKnowledge Distillation	CodeCode Available	0
BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation	Oct 2, 2024	Knowledge DistillationTime Series Analysis	CodeCode Available	0
PairDistill: Pairwise Relevance Distillation for Dense Retrieval	Oct 2, 2024	Information RetrievalKnowledge Distillation	CodeCode Available	1
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Oct 2, 2024	Knowledge Distillation	—Unverified	0
"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning	Oct 2, 2024	Backdoor Attackbackdoor defense	CodeCode Available	0
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks	Oct 2, 2024	Knowledge Distillation	CodeCode Available	0
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Oct 2, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	1
Self-Updatable Large Language Models with Parameter Integration	Oct 1, 2024	Continual LearningConversational Recommendation	—Unverified	0
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading	Oct 1, 2024	Diabetic Retinopathy Gradingimage-classification	—Unverified	0
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Oct 1, 2024	Code GenerationHumanEval	CodeCode Available	0
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Oct 1, 2024	Computational EfficiencyKnowledge Distillation	—Unverified	0
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified	0
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity	Oct 1, 2024	DecoderKnowledge Distillation	—Unverified	0
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Sep 30, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Sep 30, 2024	2D Human Pose Estimationimage-classification	—Unverified	0
HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Sep 30, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Linear Projections of Teacher Embeddings for Few-Class Distillation	Sep 30, 2024	Binary ClassificationKnowledge Distillation	—Unverified	0
Domain Consistency Representation Learning for Lifelong Person Re-Identification	Sep 30, 2024	AttributeKnowledge Distillation	CodeCode Available	1
Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation	Sep 29, 2024	Federated LearningKnowledge Distillation	—Unverified	0
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Sep 29, 2024	Knowledge DistillationModel Compression	—Unverified	0
Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment	Sep 28, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified	0
Multi-modal Cross-domain Self-supervised Pre-training for fMRI and EEG Fusion	Sep 27, 2024	Data AugmentationEEG	—Unverified	0
Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models	Sep 27, 2024	Anomaly DetectionKnowledge Distillation	—Unverified	0
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Sep 27, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration	Sep 27, 2024	Federated LearningKnowledge Distillation	CodeCode Available	0
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation	Sep 27, 2024	Knowledge DistillationVision and Language Navigation	—Unverified	0
Harmonizing knowledge Transfer in Neural Network with Unified Distillation	Sep 27, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
Kendall's τ Coefficient for Logits Distillation	Sep 26, 2024	Knowledge Distillation	—Unverified	0
Weak-to-Strong Backdoor Attack for Large Language Models	Sep 26, 2024	Backdoor AttackKnowledge Distillation	—Unverified	0
Shape-intensity knowledge distillation for robust medical image segmentation	Sep 26, 2024	Image SegmentationKnowledge Distillation	CodeCode Available	0
MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events	Sep 25, 2024	Audio TaggingAutomatic Speech Recognition	—Unverified	0
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Sep 25, 2024	Domain AdaptationKnowledge Distillation	—Unverified	0
SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling	Sep 25, 2024	Cancer ClassificationKnowledge Distillation	—Unverified	0
Privacy Evaluation Benchmarks for NLP Models	Sep 24, 2024	Knowledge Distillation	CodeCode Available	0
AIM 2024 Challenge on UHD Blind Photo Quality Assessment	Sep 24, 2024	4kComputational Efficiency	CodeCode Available	1
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Sep 24, 2024	Knowledge DistillationQuantization	—Unverified	0
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models	Sep 23, 2024	Contrastive Learningcross-modal alignment	—Unverified	0

Show:10 25 50

← PrevPage 14 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified