Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1901–1950 of 4240 papers

Title	Date	Tasks	Status
Wake Vision: A Tailored Dataset and Benchmark Suite for TinyML Computer Vision Applications	May 1, 2024	Human DetectionKnowledge Distillation	—Unverified
Error Exponent in Agnostic PAC Learning	May 1, 2024	Binary ClassificationKnowledge Distillation	—Unverified
Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism	Apr 30, 2024	Data AugmentationDiversity	CodeCode Available
Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget	Apr 30, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies	Apr 29, 2024	Knowledge Distillationreinforcement-learning	—Unverified
Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition	Apr 28, 2024	Data AugmentationKnowledge Distillation	—Unverified
Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation	Apr 28, 2024	Action RecognitionGeneral Knowledge	—Unverified
A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation	Apr 26, 2024	Depth EstimationKnowledge Distillation	—Unverified
Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities	Apr 25, 2024	DisentanglementKnowledge Distillation	—Unverified
Promoting CNNs with Cross-Architecture Knowledge Distillation for Efficient Monocular Depth Estimation	Apr 25, 2024	DecoderDepth Estimation	—Unverified
BeSound: Bluetooth-Based Position Estimation Enhancing with Cross-Modality Distillation	Apr 24, 2024	Knowledge DistillationPosition	—Unverified
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation	Apr 23, 2024	Knowledge DistillationMachine Translation	—Unverified
Compressed Meta-Optical Encoder for Image Classification	Apr 23, 2024	Classificationimage-classification	—Unverified
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude	Apr 22, 2024	Knowledge DistillationLanguage Modeling	—Unverified
FedTAD: Topology-aware Data-free Knowledge Distillation for Subgraph Federated Learning	Apr 22, 2024	Data-free Knowledge DistillationFederated Learning	—Unverified
DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images	Apr 22, 2024	class-incremental learningClass Incremental Learning	CodeCode Available
CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective	Apr 22, 2024	Contrastive Learningimage-classification	CodeCode Available
Distributed Learning for Wi-Fi AP Load Prediction	Apr 22, 2024	Federated LearningKnowledge Distillation	—Unverified
Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation	Apr 22, 2024	DiversityKnowledge Distillation	CodeCode Available
EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder	Apr 21, 2024	image-classificationImage Classification	—Unverified
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities	Apr 20, 2024	Knowledge DistillationTransfer Learning	—Unverified
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation	Apr 19, 2024	DiversityKnowledge Distillation	—Unverified
Data-free Knowledge Distillation for Fine-grained Visual Categorization	Apr 18, 2024	Data-free Knowledge DistillationFine-Grained Visual Categorization	CodeCode Available
EdgeFusion: On-Device Text-to-Image Generation	Apr 18, 2024	Image GenerationKnowledge Distillation	—Unverified
KDk: A Defense Mechanism Against Label Inference Attacks in Vertical Federated Learning	Apr 18, 2024	Federated LearningKnowledge Distillation	—Unverified
GhostNetV3: Exploring the Training Strategies for Compact Models	Apr 17, 2024	Image ClassificationKnowledge Distillation	—Unverified
LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models	Apr 17, 2024	Knowledge Distillation	—Unverified
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene	Apr 17, 2024	image-classificationImage Classification	—Unverified
MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition	Apr 16, 2024	Action RecognitionKnowledge Distillation	—Unverified
Comprehensive Survey of Model Compression and Speed up for Vision Transformers	Apr 16, 2024	Computational EfficiencyEdge-computing	—Unverified
AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation	Apr 15, 2024	Face AlignmentFace Image Quality	CodeCode Available
MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution	Apr 15, 2024	Image Super-ResolutionKnowledge Distillation	—Unverified
ReffAKD: Resource-efficient Autoencoder-based Knowledge Distillation	Apr 15, 2024	Knowledge Distillation	CodeCode Available
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers	Apr 14, 2024	Knowledge Distillation	CodeCode Available
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies	Apr 13, 2024	Few-Shot LearningKnowledge Distillation	CodeCode Available
Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation	Apr 11, 2024	Depth EstimationDepth Prediction	—Unverified
Remembering Transformer for Continual Learning	Apr 11, 2024	Continual LearningKnowledge Distillation	—Unverified
Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis	Apr 11, 2024	Knowledge DistillationModel Optimization	—Unverified
Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers	Apr 11, 2024	Adversarial RobustnessKnowledge Distillation	—Unverified
A predictive machine learning force field framework for liquid electrolyte development	Apr 10, 2024	Knowledge Distillation	—Unverified
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation	Apr 9, 2024	Emotion RecognitionFacial Landmark Detection	—Unverified
Robust feature knowledge distillation for enhanced performance of lightweight crack segmentation models	Apr 9, 2024	Crack SegmentationKnowledge Distillation	—Unverified
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts	Apr 8, 2024	DescriptiveImage Segmentation	—Unverified
Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models	Apr 7, 2024	Contrastive LearningDiagnostic	—Unverified
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models	Apr 6, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model	Apr 6, 2024	Knowledge Distillation	—Unverified
Goldfish: An Efficient Federated Unlearning Framework	Apr 4, 2024	Knowledge DistillationMachine Unlearning	CodeCode Available
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models	Apr 4, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available
Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation	Apr 4, 2024	Clusteringcoreference-resolution	CodeCode Available
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available

Show:10 25 50

← PrevPage 39 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified