Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4201–4240 of 4240 papers

Title	Date	Tasks	Status
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization	Oct 7, 2022	Knowledge Distillationspeaker-diarization	—Unverified
Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning	Mar 27, 2023	Knowledge DistillationKnowledge Graphs	—Unverified
MVKT-ECG: Efficient Single-lead ECG Classification on Multi-Label Arrhythmia by Multi-View Knowledge Transferring	Jan 28, 2023	DiagnosticECG Classification	—Unverified
NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task	Aug 1, 2021	Knowledge DistillationMachine Translation	—Unverified
Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting	Jun 8, 2022	Autonomous DrivingKnowledge Distillation	—Unverified
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions	Feb 18, 2025	Knowledge DistillationMath	—Unverified
Natural Statistics of Network Activations and Implications for Knowledge Distillation	Jun 1, 2021	Knowledge Distillation	—Unverified
Nearest Neighbor Knowledge Distillation for Neural Machine Translation	Jan 16, 2022	Knowledge DistillationMachine Translation	—Unverified
Neighbourhood Distillation: On the benefits of non end-to-end distillation	Oct 2, 2020	Knowledge DistillationNeural Architecture Search	—Unverified
NEO-KD: Knowledge-Distillation-Based Adversarial Training for Robust Multi-Exit Neural Networks	Nov 1, 2023	Knowledge Distillation	—Unverified
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks	Dec 11, 2017	Knowledge DistillationScheduling	—Unverified
Network-Agnostic Knowledge Transfer for Medical Image Segmentation	Jan 23, 2021	Image SegmentationKnowledge Distillation	—Unverified
Reconstructing Pruned Filters using Cheap Spatial Transformations	Oct 25, 2021	Feature CompressionKnowledge Distillation	—Unverified
Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models	Mar 16, 2023	CoLACPU	—Unverified
Neural Architecture Search via Ensemble-based Knowledge Distillation	Sep 29, 2021	DiversityKnowledge Distillation	—Unverified
Neural Collapse Inspired Knowledge Distillation	Dec 16, 2024	Knowledge Distillation	—Unverified
Neural Compatibility Modeling with Attentive Knowledge Distillation	Apr 17, 2018	image-classificationImage Classification	—Unverified
Neural Machine Translation from Simplified Translations	Dec 19, 2016	Knowledge DistillationMachine Translation	—Unverified
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge	May 8, 2023	Knowledge Distillationvalid	—Unverified
New Perspective on Progressive GANs Distillation for One-class Novelty Detection	Sep 15, 2021	DecoderGenerative Adversarial Network	—Unverified
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application	Feb 9, 2021	ArticlesKnowledge Distillation	—Unverified
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation	Jul 27, 2022	Graph GenerationKnowledge Distillation	—Unverified
Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation	May 19, 2024	Knowledge Distillation	—Unverified
NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging	Mar 9, 2023	Data-free Knowledge DistillationFew-Shot Object Detection	—Unverified
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation	Jun 17, 2024	Knowledge DistillationNeRF	—Unverified
No Forgetting Learning: Memory-free Continual Learning	Mar 6, 2025	Continual LearningKnowledge Distillation	—Unverified
Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models	Sep 26, 2023	image-classificationImage Classification	—Unverified
Noisy Machines: Understanding Noisy Neural Networks and Enhancing Robustness to Analog Hardware Errors Using Distillation	Jan 14, 2020	Knowledge Distillation	—Unverified
Noisy Neural Network Compression for Analog Storage Devices	Oct 19, 2020	Knowledge DistillationModel Compression	—Unverified
Non-Autoregressive Sign Language Production via Knowledge Distillation	Aug 12, 2022	Knowledge DistillationSign Language Production	—Unverified
Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation	Sep 4, 2024	Knowledge Distillation	—Unverified
No One Left Behind: Inclusive Federated Learning over Heterogeneous Devices	Feb 16, 2022	Federated LearningKnowledge Distillation	—Unverified
Normalized Feature Distillation for Semantic Segmentation	Jul 12, 2022	Knowledge DistillationModel Compression	—Unverified
Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge	Jun 2, 2021	AllKnowledge Distillation	—Unverified
Not All Regions are Worthy to be Distilled: Region-aware Knowledge Distillation Towards Efficient Image-to-Image Translation	Sep 29, 2021	AllContrastive Learning	—Unverified
Not to Overfit or Underfit the Source Domains? An Empirical Study of Domain Generalization in Question Answering	May 15, 2022	Domain GeneralizationKnowledge Distillation	—Unverified
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation	Dec 10, 2023	Knowledge Distillation	—Unverified
Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation	Jul 7, 2021	Fine-Grained Visual RecognitionKnowledge Distillation	—Unverified
NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21	Nov 16, 2021	Data AugmentationKnowledge Distillation	—Unverified
NVIDIA NeMo’s Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21	Nov 1, 2021	Data AugmentationKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 85 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified