Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1550 of 4240 papers

Title	Date	Tasks	Status	Hype
Wired Perspectives: Multi-View Wire Art Embraces Generative AI	Nov 26, 2023	Knowledge Distillation	—Unverified	0
Unlearning via Sparse Representations	Nov 26, 2023	Knowledge Distillation	—Unverified	0
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification	Nov 26, 2023	Knowledge DistillationSelf-Knowledge Distillation	—Unverified	0
Cosine Similarity Knowledge Distillation for Individual Class Information Transfer	Nov 24, 2023	Knowledge DistillationModel Compression	—Unverified	0
Maximizing Discrimination Capability of Knowledge Distillation with Energy Function	Nov 24, 2023	Data AugmentationKnowledge Distillation	—Unverified	0
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery	Nov 24, 2023	Deep Reinforcement LearningKnowledge Distillation	—Unverified	0
Pseudo-label Correction for Instance-dependent Noise Using Teacher-student Framework	Nov 24, 2023	Knowledge DistillationPseudo Label	—Unverified	0
Knowledge Distillation Based Semantic Communications For Multiple Users	Nov 23, 2023	DecoderKnowledge Distillation	—Unverified	0
Efficient and Robust Jet Tagging at the LHC with Knowledge Distillation	Nov 23, 2023	Inductive BiasJet Tagging	CodeCode Available	0
Some Like It Small: Czech Semantic Embedding Models for Industry Applications	Nov 23, 2023	Image RetrievalKnowledge Distillation	CodeCode Available	1
Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation	Nov 23, 2023	Dimensionality ReductionImage Classification	—Unverified	0
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning	Nov 23, 2023	Data AugmentationKnowledge Distillation	—Unverified	0
Education distillation:getting student models to learn in shcools	Nov 23, 2023	Incremental LearningKnowledge Distillation	—Unverified	0
Efficient Transformer Knowledge Distillation: A Performance Review	Nov 22, 2023	Knowledge DistillationModel Compression	—Unverified	0
EA-KD: Entropy-based Adaptive Knowledge Distillation	Nov 22, 2023	image-classificationImage Classification	—Unverified	0
Point, Segment and Count: A Generalized Framework for Object Counting	Nov 21, 2023	Knowledge DistillationObject	CodeCode Available	1
HoVer-UNet: Accelerating HoVerNet with UNet-based multi-class nuclei segmentation via knowledge distillation	Nov 21, 2023	Instance SegmentationKnowledge Distillation	CodeCode Available	1
FreeKD: Knowledge Distillation via Semantic Frequency Prompt	Nov 20, 2023	Knowledge Distillation	CodeCode Available	1
Unveiling the Unseen Potential of Graph Learning through MLPs: Effective Graph Learners Using Propagation-Embracing MLPs	Nov 20, 2023	Graph LearningGraph Neural Network	—Unverified	0
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention	Nov 18, 2023	Concept AlignmentGraph Generation	CodeCode Available	1
LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation	Nov 18, 2023	Knowledge DistillationLesion Detection	—Unverified	0
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers	Nov 17, 2023	Knowledge Distillation	—Unverified	0
Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction	Nov 17, 2023	Generative Adversarial NetworkKnowledge Distillation	—Unverified	0
A Knowledge Distillation Approach for Sepsis Outcome Prediction from Multivariate Clinical Time Series	Nov 16, 2023	Knowledge DistillationTime Series	—Unverified	0
Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation	Nov 15, 2023	Constituency ParsingKnowledge Distillation	CodeCode Available	0
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models	Nov 14, 2023	Knowledge DistillationTransfer Learning	—Unverified	0
Distilling the Unknown to Unveil Certainty	Nov 14, 2023	Knowledge DistillationOut of Distribution (OOD) Detection	CodeCode Available	0
Batch Selection and Communication for Active Learning with Edge Labeling	Nov 14, 2023	Active LearningKnowledge Distillation	—Unverified	0
Teach me with a Whisper: Enhancing Large Language Models for Analyzing Spoken Transcripts using Speech Embeddings	Nov 13, 2023	Knowledge DistillationLanguage Modeling	—Unverified	0
On Elastic Language Models	Nov 13, 2023	Information RetrievalKnowledge Distillation	—Unverified	0
Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments	Nov 10, 2023	Activity RecognitionAutonomous Driving	CodeCode Available	1
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency	Nov 9, 2023	document understandingKey Information Extraction	—Unverified	0
Text Representation Distillation via Information Bottleneck Principle	Nov 9, 2023	Knowledge DistillationRetrieval	CodeCode Available	0
Object-centric Cross-modal Feature Distillation for Event-based Object Detection	Nov 9, 2023	Knowledge DistillationObject	—Unverified	0
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers	Nov 8, 2023	Knowledge DistillationOpenAI Gym	CodeCode Available	0
Preference-Consistent Knowledge Distillation for Recommender System	Nov 8, 2023	Knowledge DistillationRecommendation Systems	CodeCode Available	0
What is Lost in Knowledge Distillation?	Nov 7, 2023	Knowledge DistillationModel Compression	—Unverified	0
Supervised domain adaptation for building extraction from off-nadir aerial images	Nov 7, 2023	Domain AdaptationEarth Observation	—Unverified	0
Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated data	Nov 7, 2023	Knowledge DistillationMulti-Task Learning	CodeCode Available	0
Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models	Nov 7, 2023	AttributeDenoising	CodeCode Available	0
Asymmetric Masked Distillation for Pre-Training Small Foundation Models	Nov 6, 2023	Action ClassificationAction Recognition	CodeCode Available	0
Co-training and Co-distillation for Quality Improvement and Compression of Language Models	Nov 6, 2023	Data AugmentationKnowledge Distillation	—Unverified	0
Cross-Level Distillation and Feature Denoising for Cross-Domain Few-Shot Classification	Nov 4, 2023	ClassificationCross-Domain Few-Shot	CodeCode Available	1
After-Stroke Arm Paresis Detection using Kinematic Data	Nov 3, 2023	Action ClassificationKnowledge Distillation	—Unverified	0
Comparative Knowledge Distillation	Nov 3, 2023	Data AugmentationKnowledge Distillation	CodeCode Available	0
Data-Free Distillation of Language Model by Text-to-Text Transfer	Nov 3, 2023	Data-free Knowledge DistillationDiversity	—Unverified	0
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models	Nov 2, 2023	Data AugmentationDomain Generalization	CodeCode Available	1
An Efficient Detection and Control System for Underwater Docking using Machine Learning and Realistic Simulation: A Comprehensive Approach	Nov 2, 2023	Generative Adversarial NetworkImage-to-Image Translation	—Unverified	0
Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts	Nov 2, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Implicit Chain of Thought Reasoning via Knowledge Distillation	Nov 2, 2023	Knowledge DistillationMath	CodeCode Available	1

Show:10 25 50

← PrevPage 31 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified