Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2201–2250 of 4240 papers

Title	Date	Tasks	Status
LayerCollapse: Adaptive compression of neural networks	Nov 29, 2023	Computational Efficiencyimage-classification	—Unverified
The Devil is in the Data: Learning Fair Graph Neural Networks via Partial Knowledge Distillation	Nov 29, 2023	FairnessKnowledge Distillation	CodeCode Available
Propagate & Distill: Towards Effective Graph Learners Using Propagation-Embracing MLPs	Nov 29, 2023	Graph Neural NetworkKnowledge Distillation	—Unverified
Rethinking Intermediate Layers design in Knowledge Distillation for Kidney and Liver Tumor Segmentation	Nov 28, 2023	DiagnosticKnowledge Distillation	CodeCode Available
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser	Nov 28, 2023	3D Face AnimationContrastive Learning	—Unverified
FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning	Nov 28, 2023	Knowledge DistillationTransfer Learning	—Unverified
UFIN: Universal Feature Interaction Network for Multi-Domain Click-Through Rate Prediction	Nov 27, 2023	Click-Through Rate PredictionKnowledge Distillation	CodeCode Available
Wired Perspectives: Multi-View Wire Art Embraces Generative AI	Nov 26, 2023	Knowledge Distillation	—Unverified
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification	Nov 26, 2023	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
Cosine Similarity Knowledge Distillation for Individual Class Information Transfer	Nov 24, 2023	Knowledge DistillationModel Compression	—Unverified
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified
Pseudo-label Correction for Instance-dependent Noise Using Teacher-student Framework	Nov 24, 2023	Knowledge DistillationPseudo Label	—Unverified
Maximizing Discrimination Capability of Knowledge Distillation with Energy Function	Nov 24, 2023	Data AugmentationKnowledge Distillation	—Unverified
Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation	Nov 23, 2023	Dimensionality ReductionImage Classification	—Unverified
Efficient and Robust Jet Tagging at the LHC with Knowledge Distillation	Nov 23, 2023	Inductive BiasJet Tagging	CodeCode Available
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning	Nov 23, 2023	Data AugmentationKnowledge Distillation	—Unverified
Knowledge Distillation Based Semantic Communications For Multiple Users	Nov 23, 2023	DecoderKnowledge Distillation	—Unverified
Education distillation:getting student models to learn in shcools	Nov 23, 2023	Incremental LearningKnowledge Distillation	—Unverified
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified
EA-KD: Entropy-based Adaptive Knowledge Distillation	Nov 22, 2023	image-classificationImage Classification	—Unverified
Unveiling the Unseen Potential of Graph Learning through MLPs: Effective Graph Learners Using Propagation-Embracing MLPs	Nov 20, 2023	Graph LearningGraph Neural Network	—Unverified
LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation	Nov 18, 2023	Knowledge DistillationLesion Detection	—Unverified
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers	Nov 17, 2023	Knowledge Distillation	—Unverified
Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction	Nov 17, 2023	Generative Adversarial NetworkKnowledge Distillation	—Unverified
A Knowledge Distillation Approach for Sepsis Outcome Prediction from Multivariate Clinical Time Series	Nov 16, 2023	Knowledge DistillationTime Series	—Unverified
Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation	Nov 15, 2023	Constituency ParsingKnowledge Distillation	CodeCode Available
Distilling the Unknown to Unveil Certainty	Nov 14, 2023	Knowledge DistillationOut of Distribution (OOD) Detection	CodeCode Available
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models	Nov 14, 2023	Knowledge DistillationTransfer Learning	—Unverified
Batch Selection and Communication for Active Learning with Edge Labeling	Nov 14, 2023	Active LearningKnowledge Distillation	—Unverified
Teach me with a Whisper: Enhancing Large Language Models for Analyzing Spoken Transcripts using Speech Embeddings	Nov 13, 2023	Knowledge DistillationLanguage Modeling	—Unverified
On Elastic Language Models	Nov 13, 2023	Information RetrievalKnowledge Distillation	—Unverified
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency	Nov 9, 2023	document understandingKey Information Extraction	—Unverified
Object-centric Cross-modal Feature Distillation for Event-based Object Detection	Nov 9, 2023	Knowledge DistillationObject	—Unverified
Text Representation Distillation via Information Bottleneck Principle	Nov 9, 2023	Knowledge DistillationRetrieval	CodeCode Available
Preference-Consistent Knowledge Distillation for Recommender System	Nov 8, 2023	Knowledge DistillationRecommendation Systems	CodeCode Available
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers	Nov 8, 2023	Knowledge DistillationOpenAI Gym	CodeCode Available
Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models	Nov 7, 2023	AttributeDenoising	CodeCode Available
Supervised domain adaptation for building extraction from off-nadir aerial images	Nov 7, 2023	Domain AdaptationEarth Observation	—Unverified
Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated data	Nov 7, 2023	Knowledge DistillationMulti-Task Learning	CodeCode Available
What is Lost in Knowledge Distillation?	Nov 7, 2023	Knowledge DistillationModel Compression	—Unverified
Co-training and Co-distillation for Quality Improvement and Compression of Language Models	Nov 6, 2023	Data AugmentationKnowledge Distillation	—Unverified
Asymmetric Masked Distillation for Pre-Training Small Foundation Models	Nov 6, 2023	Action ClassificationAction Recognition	CodeCode Available
Comparative Knowledge Distillation	Nov 3, 2023	Data AugmentationKnowledge Distillation	CodeCode Available
After-Stroke Arm Paresis Detection using Kinematic Data	Nov 3, 2023	Action ClassificationKnowledge Distillation	—Unverified
Data-Free Distillation of Language Model by Text-to-Text Transfer	Nov 3, 2023	Data-free Knowledge DistillationDiversity	—Unverified
An Efficient Detection and Control System for Underwater Docking using Machine Learning and Realistic Simulation: A Comprehensive Approach	Nov 2, 2023	Generative Adversarial NetworkImage-to-Image Translation	—Unverified
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition	Nov 2, 2023	Action RecognitionKnowledge Distillation	—Unverified
Group Distributionally Robust Knowledge Distillation	Nov 1, 2023	Knowledge Distillation	—Unverified
NEO-KD: Knowledge-Distillation-Based Adversarial Training for Robust Multi-Exit Neural Networks	Nov 1, 2023	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 45 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified