Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1075 of 4240 papers

Title	Date	Tasks	Status	Hype
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation	May 22, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images	May 22, 2024	GPUKnowledge Distillation	—Unverified	0
Why Not Transform Chat Large Language Models to Non-English?	May 22, 2024	Knowledge Distillation	CodeCode Available	0
Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch	May 21, 2024	Knowledge Distillation	—Unverified	0
AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection	May 21, 2024	Knowledge DistillationPedestrian Detection	CodeCode Available	1
Active Object Detection with Knowledge Aggregation and Distillation from Large Models	May 21, 2024	Active Object DetectionDecision Making	CodeCode Available	0
CLRKDNet: Speeding up Lane Detection with Knowledge Distillation	May 21, 2024	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
TinyM^2Net-V3: Memory-Aware Compressed Multimodal Deep Neural Networks for Sustainable Edge Deployment	May 20, 2024	Knowledge DistillationModel Compression	—Unverified	0
GeoMask3D: Geometrically Informed Mask Selection for Self-Supervised Point Cloud Learning in 3D	May 20, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified	0
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks	May 20, 2024	Inference OptimizationKnowledge Distillation	—Unverified	0
Federated Learning for Time-Series Healthcare Sensing with Incomplete Modalities	May 20, 2024	Computational EfficiencyFederated Learning	CodeCode Available	0
Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge Devices	May 20, 2024	Knowledge DistillationStereo Matching	—Unverified	0
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models	May 20, 2024	Knowledge DistillationStory Generation	—Unverified	0
Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction	May 20, 2024	Autonomous DrivingKnowledge Distillation	—Unverified	0
Hierarchical Selective Classification	May 19, 2024	ClassificationKnowledge Distillation	—Unverified	0
Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation	May 19, 2024	Knowledge Distillation	—Unverified	0
Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation	May 19, 2024	Knowledge DistillationPose Estimation	—Unverified	0
Overcoming Data and Model Heterogeneities in Decentralized Federated Learning via Synthetic Anchors	May 19, 2024	Domain AdaptationFederated Learning	CodeCode Available	1
INDUS: Effective and Efficient Language Models for Scientific Applications	May 17, 2024	Contrastive LearningInformation Retrieval	—Unverified	0
Densely Distilling Cumulative Knowledge for Continual Learning	May 16, 2024	AllContinual Learning	—Unverified	0
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue Generation	May 16, 2024	Dialogue GenerationKnowledge Distillation	CodeCode Available	0
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models	May 14, 2024	Contrastive LearningDenoising	—Unverified	0
GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation	May 13, 2024	image-classificationImage Classification	CodeCode Available	0
Meta-Learned Modality-Weighted Knowledge Distillation for Robust Multi-Modal Learning with Missing Data	May 12, 2024	Brain Tumor SegmentationClassification	CodeCode Available	0
AdaKD: Dynamic Knowledge Distillation of ASR models using Adaptive Loss Weighting	May 11, 2024	Knowledge DistillationModel Compression	—Unverified	0

Show:10 25 50

← PrevPage 43 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified