Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1776–1800 of 4240 papers

Title	Date	Tasks	Status
On Reducing Activity with Distillation and Regularization for Energy Efficient Spiking Neural Networks	Jun 26, 2024	Knowledge Distillation	—Unverified
Sequential Editing for Lifelong Training of Speech Recognition Models	Jun 25, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Optimal Trade-offs in Knowledge Distillation for CNNs and Vision Transformers at the Edge	Jun 25, 2024	Knowledge Distillation	—Unverified
Preserving Node Distinctness in Graph Autoencoders via Similarity Distillation	Jun 25, 2024	DecoderKnowledge Distillation	—Unverified
WAVE: Weight Template for Adaptive Initialization of Variable-sized Models	Jun 25, 2024	Knowledge DistillationTransfer Learning	—Unverified
Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels	Jun 25, 2024	ArticlesIn-Context Learning	—Unverified
Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation Approach	Jun 25, 2024	Image ReconstructionKnowledge Distillation	—Unverified
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jun 25, 2024	Knowledge Distillation	—Unverified
Leveraging Knowledge Distillation for Lightweight Skin Cancer Classification: Balancing Accuracy and Computational Efficiency	Jun 24, 2024	Cancer ClassificationComputational Efficiency	—Unverified
Exploring compressibility of transformer based text-to-music (TTM) models	Jun 24, 2024	DecoderFAD	—Unverified
Enhancing OOD Detection Using Latent Diffusion	Jun 24, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available
The Privileged Students: On the Value of Initialization in Multilingual Knowledge Distillation	Jun 24, 2024	Knowledge Distillation	—Unverified
Continual Learning with Diffusion-based Generative Replay for Industrial Streaming Data	Jun 22, 2024	Continual LearningKnowledge Distillation	—Unverified
Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning	Jun 21, 2024	Knowledge Distillation	—Unverified
Reinforced Knowledge Distillation for Time Series Regression	Jun 21, 2024	Knowledge DistillationModel Compression	CodeCode Available
Failure-Resilient Distributed Inference with Model Compression over Heterogeneous Edge Devices	Jun 20, 2024	Knowledge DistillationModel Compression	—Unverified
Factual Dialogue Summarization via Learning from Large Language Models	Jun 20, 2024	Contrastive LearningData Augmentation	—Unverified
SeCoKD: Aligning Large Language Models for In-Context Learning with Fewer Shots	Jun 20, 2024	In-Context LearningKnowledge Distillation	—Unverified
Apprenticeship-Inspired Elegance: Synergistic Knowledge Distillation Empowers Spiking Neural Networks for Efficient Single-Eye Emotion Recognition	Jun 20, 2024	Emotion RecognitionKnowledge Distillation	—Unverified
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation	Jun 19, 2024	Knowledge Distillation	CodeCode Available
WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation	Jun 19, 2024	Depth EstimationImage Enhancement	CodeCode Available
Can Low-Rank Knowledge Distillation in LLMs be Useful for Microelectronic Reasoning?	Jun 19, 2024	Knowledge Distillation	—Unverified
Federated Learning with a Single Shared Image	Jun 18, 2024	Federated LearningKnowledge Distillation	CodeCode Available
Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation	Jun 18, 2024	Computed Tomography (CT)Knowledge Distillation	—Unverified
Vernacular? I Barely Know Her: Challenges with Style Control and Stereotyping	Jun 18, 2024	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 72 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified