Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 4240 papers

Title	Date	Tasks	Status	Hype
Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation	Apr 4, 2024	Clusteringcoreference-resolution	CodeCode Available	0
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	0
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought	Apr 4, 2024	Extractive Question-AnsweringKnowledge Distillation	—Unverified	0
Improve Knowledge Distillation via Label Revision and Data Selection	Apr 3, 2024	Knowledge DistillationModel Compression	—Unverified	0
Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution	Apr 3, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified	0
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models	Apr 3, 2024	DiversityKnowledge Distillation	CodeCode Available	1
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings	Apr 3, 2024	Data IntegrationKnowledge Distillation	—Unverified	0
Foundation Models for Structural Health Monitoring	Apr 3, 2024	Anomaly DetectionKnowledge Distillation	CodeCode Available	0
Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity and Performance Restoration	Apr 3, 2024	Knowledge Distillation	CodeCode Available	1
Federated Distillation: A Survey	Apr 2, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Task Integration Distillation for Object Detectors	Apr 2, 2024	Knowledge DistillationObject	—Unverified	0
Class-Incremental Few-Shot Event Detection	Apr 2, 2024	Event DetectionFew-Shot Learning	—Unverified	0
TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Apr 2, 2024	Knowledge DistillationVisual Place Recognition	CodeCode Available	1
Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation	Apr 2, 2024	Autonomous VehiclesDecision Making	—Unverified	0
Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners	Apr 2, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	2
A Comprehensive Review of Knowledge Distillation in Computer Vision	Apr 1, 2024	Deep LearningKnowledge Distillation	—Unverified	0
LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation	Apr 1, 2024	Knowledge Distillation	—Unverified	0
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation	Apr 1, 2024	DecoderKnowledge Distillation	CodeCode Available	1
SUGAR: Pre-training 3D Visual Representations for Robotics	Apr 1, 2024	3D Instance Segmentation3D Object Recognition	—Unverified	0
Weak-to-Strong 3D Object Detection with X-Ray Distillation	Mar 31, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	0
DMSSN: Distilled Mixed Spectral-Spatial Network for Hyperspectral Salient Object Detection	Mar 31, 2024	Dimensionality ReductionKnowledge Distillation	CodeCode Available	0
Orchestrate Latent Expertise: Advancing Online Continual Learning with Multi-Level Supervision and Reverse Self-Distillation	Mar 30, 2024	Continual LearningKnowledge Distillation	CodeCode Available	1
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning	Mar 29, 2024	Continual LearningContinual Panoptic Segmentation	CodeCode Available	2
GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation	Mar 28, 2024	Data-free Knowledge DistillationKnowledge Distillation	—Unverified	0
De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts	Mar 28, 2024	Causal InferenceData-free Knowledge Distillation	—Unverified	0
CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation	Mar 28, 2024	3D Object DetectionAutonomous Driving	—Unverified	0
I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation	Mar 27, 2024	Knowledge DistillationSegmentation	—Unverified	0
Enhancing Metaphor Detection through Soft Labels and Target Word Prediction	Mar 27, 2024	Knowledge DistillationPrompt Learning	—Unverified	0
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation	Mar 27, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available	0
Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models	Mar 26, 2024	Knowledge DistillationQuantization	—Unverified	0
KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning	Mar 26, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	1
Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN	Mar 26, 2024	Knowledge DistillationModel Compression	—Unverified	0
From Two-Stream to One-Stream: Efficient RGB-T Tracking via Mutual Prompt Learning and Knowledge Distillation	Mar 25, 2024	Knowledge DistillationObject Tracking	—Unverified	0
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation	Mar 25, 2024	DecoderKnowledge Distillation	CodeCode Available	1
Configurable Holography: Towards Display and Scene Adaptation	Mar 24, 2024	Depth EstimationKnowledge Distillation	—Unverified	0
iDAT: inverse Distillation Adapter-Tuning	Mar 23, 2024	image-classificationImage Classification	CodeCode Available	1
Learning to Project for Cross-Task Knowledge Distillation	Mar 21, 2024	Depth EstimationKnowledge Distillation	—Unverified	0
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation	Mar 21, 2024	Data AugmentationDecision Making	CodeCode Available	1
Fed-RAC: Resource-Aware Clustering for Tackling Heterogeneity of Participants in Federated Learning	Mar 20, 2024	ClusteringFederated Learning	CodeCode Available	0
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models	Mar 20, 2024	ChatbotKnowledge Distillation	CodeCode Available	0
Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model	Mar 20, 2024	Drug DiscoveryKnowledge Distillation	CodeCode Available	1
Scale Decoupled Distillation	Mar 20, 2024	Knowledge Distillation	CodeCode Available	2
TransformMix: Learning Transformation and Mixing Strategies from Data	Mar 19, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Scheduled Knowledge Acquisition on Lightweight Vector Symbolic Architectures for Brain-Computer Interfaces	Mar 18, 2024	Feature EngineeringKnowledge Distillation	—Unverified	0
KnFu: Effective Knowledge Fusion	Mar 18, 2024	Federated LearningKnowledge Distillation	—Unverified	0
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation	Mar 18, 2024	Knowledge DistillationNER	CodeCode Available	0
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models	Mar 18, 2024	3D Semantic SegmentationKnowledge Distillation	—Unverified	0
Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification	Mar 17, 2024	image-classificationImage Classification	CodeCode Available	0
Self-Supervised Quantization-Aware Knowledge Distillation	Mar 17, 2024	Knowledge DistillationQuantization	CodeCode Available	1
FlyKD: Graph Knowledge Distillation on the Fly with Curriculum Learning	Mar 16, 2024	Knowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 24 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified