Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2700 of 4240 papers

Title	Date	Tasks	Status
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning	Mar 10, 2023	Federated LearningKnowledge Distillation	—Unverified
Dynamic Y-KD: A Hybrid Approach to Continual Instance Segmentation	Mar 10, 2023	Continual LearningIncremental Learning	—Unverified
Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss	Mar 10, 2023	Knowledge Distillation	—Unverified
Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation	Mar 9, 2023	Knowledge Distillation	—Unverified
NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging	Mar 9, 2023	Data-free Knowledge DistillationFew-Shot Object Detection	—Unverified
Gradient-Guided Knowledge Distillation for Object Detectors	Mar 7, 2023	Knowledge DistillationObject	—Unverified
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models	Mar 7, 2023	Knowledge DistillationSpoken Language Understanding	—Unverified
PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge Distillation	Mar 7, 2023	Data AugmentationKnowledge Distillation	CodeCode Available
KDSM: An uplift modeling framework based on knowledge distillation and sample matching	Mar 6, 2023	counterfactualKnowledge Distillation	—Unverified
Students Parrot Their Teachers: Membership Inference on Model Distillation	Mar 6, 2023	Knowledge Distillation	—Unverified
IKD+: Reliable Low Complexity Deep Models For Retinopathy Classification	Mar 4, 2023	ClassificationKnowledge Distillation	—Unverified
X^3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection	Mar 3, 2023	3D Object DetectionInstance Segmentation	—Unverified
Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis	Mar 3, 2023	Emotion RecognitionKnowledge Distillation	—Unverified
Unsupervised Deep Digital Staining For Microscopic Cell Images Via Knowledge Distillation	Mar 3, 2023	ColorizationKnowledge Distillation	—Unverified
Letz Translate: Low-Resource Machine Translation for Luxembourgish	Mar 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification	Mar 2, 2023	Knowledge DistillationSpeaker Verification	—Unverified
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning	Mar 1, 2023	Compositional Zero-Shot LearningKnowledge Distillation	—Unverified
Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias	Mar 1, 2023	Backdoor AttackKnowledge Distillation	CodeCode Available
Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation	Mar 1, 2023	Domain AdaptationKnowledge Distillation	—Unverified
Incremental Learning of Acoustic Scenes and Sound Events	Feb 28, 2023	Acoustic Scene ClassificationAudio Tagging	—Unverified
Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation	Feb 28, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition	Feb 28, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Leveraging Angular Distributions for Improved Knowledge Distillation	Feb 27, 2023	Knowledge Distillation	—Unverified
A Light-weight Deep Learning Model for Remote Sensing Image Classification	Feb 25, 2023	image-classificationImage Classification	—Unverified
Ensemble knowledge distillation of self-supervised speech models	Feb 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Knowledge Distillation framework for Multi-Organ Segmentation of Medaka Fish in Tomographic Image	Feb 24, 2023	Computed Tomography (CT)Image Segmentation	—Unverified
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers	Feb 23, 2023	Knowledge DistillationQuantization	CodeCode Available
Personalized Decentralized Federated Learning with Knowledge Distillation	Feb 23, 2023	Federated LearningKnowledge Distillation	—Unverified
Exploring Social Media for Early Detection of Depression in COVID-19 Patients	Feb 23, 2023	Knowledge Distillation	CodeCode Available
Practical Knowledge Distillation: Using DNNs to Beat DNNs	Feb 23, 2023	DenoisingKnowledge Distillation	—Unverified
Debiased Distillation by Transplanting the Last Layer	Feb 22, 2023	AttributeKnowledge Distillation	—Unverified
Distilling Calibrated Student from an Uncalibrated Teacher	Feb 22, 2023	Data AugmentationKnowledge Distillation	—Unverified
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer	Feb 22, 2023	Knowledge DistillationTransfer Learning	CodeCode Available
CADIS: Handling Cluster-skewed Non-IID Data in Federated Learning with Clustered Aggregation and Knowledge DIStilled Regularization	Feb 21, 2023	Federated LearningKnowledge Distillation	CodeCode Available
Two-in-one Knowledge Distillation for Efficient Facial Forgery Detection	Feb 21, 2023	Knowledge DistillationVocal Bursts Valence Prediction	—Unverified
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers	Feb 21, 2023	Knowledge Distillation	—Unverified
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers	Feb 19, 2023	Knowledge DistillationModel Compression	—Unverified
RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness	Feb 18, 2023	Knowledge DistillationMulti-Task Learning	—Unverified
Fairly Predicting Graft Failure in Liver Transplant for Organ Assigning	Feb 18, 2023	FairnessKnowledge Distillation	—Unverified
Explicit and Implicit Knowledge Distillation via Unlabeled Data	Feb 17, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Few-shot 3D LiDAR Semantic Segmentation for Autonomous Driving	Feb 17, 2023	Autonomous DrivingFew-Shot Learning	—Unverified
Learning From Biased Soft Labels	Feb 16, 2023	Knowledge Distillation	—Unverified
Cross Modal Distillation for Flood Extent Mapping	Feb 16, 2023	Knowledge Distillation	—Unverified
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation	Feb 16, 2023	Knowledge DistillationSentence	—Unverified
New Insights on Relieving Task-Recency Bias for Online Class Incremental Learning	Feb 16, 2023	class-incremental learningClass Incremental Learning	CodeCode Available
ST-MFNet Mini: Knowledge Distillation-Driven Frame Interpolation	Feb 16, 2023	Knowledge DistillationNetwork Pruning	CodeCode Available
Offline-to-Online Knowledge Distillation for Video Instance Segmentation	Feb 15, 2023	Data AugmentationInstance Segmentation	—Unverified
A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation	Feb 15, 2023	Data AugmentationDefect Detection	—Unverified

Show:10 25 50

← PrevPage 54 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified