Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1225 of 4240 papers

Title	Date	Tasks	Status	Hype
LookALike: Human Mimicry based collaborative decision making	Mar 16, 2024	Decision MakingKnowledge Distillation	—Unverified	0
Group-Mix SAM: Lightweight Solution for Industrial Assembly Line Applications	Mar 15, 2024	Knowledge Distillation	—Unverified	0
Histo-Genomic Knowledge Distillation For Cancer Prognosis From Histopathology Whole Slide Images	Mar 15, 2024	BenchmarkingKnowledge Distillation	CodeCode Available	1
Recurrent Drafter for Fast Speculative Decoding in Large Language Models	Mar 14, 2024	BenchmarkingKnowledge Distillation	CodeCode Available	3
Adapting OC20-trained EquiformerV2 Models for High-Entropy Materials	Mar 14, 2024	Knowledge Distillation	—Unverified	0
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models	Mar 14, 2024	Continual LearningKnowledge Distillation	—Unverified	0
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization	Mar 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection	Mar 14, 2024	Knowledge DistillationNovel Object Detection	CodeCode Available	2
SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams	Mar 14, 2024	DeblurringKnowledge Distillation	CodeCode Available	1
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation	Mar 14, 2024	Knowledge DistillationMachine Translation	CodeCode Available	0
Distilling Named Entity Recognition Models for Endangered Species from Large Language Models	Mar 13, 2024	In-Context LearningKnowledge Distillation	—Unverified	0
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer	Mar 13, 2024	Continual LearningImage Retrieval	—Unverified	0
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning	Mar 13, 2024	DenoisingKnowledge Distillation	CodeCode Available	0
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks	Mar 13, 2024	Knowledge Distillation	—Unverified	0
LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving	Mar 13, 2024	Autonomous DrivingKnowledge Distillation	—Unverified	0
eDifFIQA: Towards Efficient Face Image Quality Assessment Based On Denoising Diffusion Probabilistic Models	Mar 12, 2024	DenoisingFace Image Quality	CodeCode Available	1
Low-Energy On-Device Personalization for MCUs	Mar 12, 2024	Knowledge DistillationTransfer Learning	CodeCode Available	0
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning	Mar 12, 2024	Knowledge DistillationMultivariate Time Series Forecasting	CodeCode Available	2
Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure	Mar 12, 2024	AllContinual Learning	CodeCode Available	1
Distilling the Knowledge in Data Pruning	Mar 12, 2024	Knowledge Distillation	—Unverified	0
Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression	Mar 11, 2024	Backdoor AttackImage Compression	—Unverified	0
Evolving Knowledge Distillation with Large Language Models and Active Learning	Mar 11, 2024	Active LearningKnowledge Distillation	—Unverified	0
AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation	Mar 11, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available	0
One Category One Prompt: Dataset Distillation using Diffusion Models	Mar 11, 2024	Dataset DistillationKnowledge Distillation	—Unverified	0
Enhanced Sparsification via Stimulative Training	Mar 11, 2024	Knowledge DistillationModel Compression	—Unverified	0

Show:10 25 50

← PrevPage 49 of 170Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified