Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1601–1650 of 4240 papers

Title	Date	Tasks	Status	Hype
Fair Feature Importance Scores for Interpreting Tree-Based Methods and Surrogates	Oct 6, 2023	FairnessFeature Importance	—Unverified	0
LumiNet: The Bright Side of Perceptual Knowledge Distillation	Oct 5, 2023	ClassificationKnowledge Distillation	CodeCode Available	1
DED: Diagnostic Evidence Distillation for acne severity grading on face images	Oct 5, 2023	Acne Severity GradingDiagnostic	CodeCode Available	0
Improving Knowledge Distillation with Teacher's Explanation	Oct 4, 2023	Knowledge Distillation	—Unverified	0
I^2KD-SLU: An Intra-Inter Knowledge Distillation Framework for Zero-Shot Cross-Lingual Spoken Language Understanding	Oct 4, 2023	Intent DetectionKnowledge Distillation	—Unverified	0
Heterogeneous Federated Learning Using Knowledge Codistillation	Oct 4, 2023	Federated Learningimage-classification	—Unverified	0
Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication	Oct 4, 2023	DecoderKnowledge Distillation	—Unverified	0
SEA: Sparse Linear Attention with Estimated Attention Mask	Oct 3, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Can a student Large Language Model perform as well as it's teacher?	Oct 3, 2023	Knowledge DistillationLanguage Modeling	—Unverified	0
Learnable Cross-modal Knowledge Distillation for Multi-modal Learning with Missing Modality	Oct 2, 2023	Knowledge Distillation	—Unverified	0
KGEx: Explaining Knowledge Graph Embeddings via Subgraph Sampling and Knowledge Distillation	Oct 2, 2023	Knowledge DistillationKnowledge Graph Embeddings	—Unverified	0
Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models	Oct 2, 2023	Knowledge DistillationLanguage Modelling	—Unverified	0
Towards Fixing Clever-Hans Predictors with Counterfactual Knowledge Distillation	Oct 2, 2023	counterfactualKnowledge Distillation	—Unverified	0
Distilling Influences to Mitigate Prediction Churn in Graph Neural Networks	Oct 2, 2023	Knowledge DistillationNode Classification	CodeCode Available	0
Adaptive Decoupled Pose Knowledge Distillation	Oct 1, 2023	Knowledge DistillationPose Estimation	CodeCode Available	0
NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation	Sep 30, 2023	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available	1
Distilling Inductive Bias: Knowledge Distillation Beyond Model Compression	Sep 30, 2023	Inductive BiasKnowledge Distillation	—Unverified	0
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation	Sep 29, 2023	Image GenerationKnowledge Distillation	—Unverified	0
Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation	Sep 29, 2023	Cross-Lingual Question AnsweringCross-Lingual Transfer	CodeCode Available	0
An Enhanced Low-Resolution Image Recognition Method for Traffic Environments	Sep 28, 2023	Computational EfficiencyKnowledge Distillation	—Unverified	0
Distill to Delete: Unlearning in Graph Networks with Knowledge Distillation	Sep 28, 2023	GPUGraph Neural Network	—Unverified	0
Distilling ODE Solvers of Diffusion Models into Smaller Steps	Sep 28, 2023	DenoisingKnowledge Distillation	—Unverified	0
Inherit with Distillation and Evolve with Contrast: Exploring Class Incremental Semantic Segmentation Without Exemplar Memory	Sep 27, 2023	Class-Incremental Semantic SegmentationContrastive Learning	—Unverified	0
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion	Sep 27, 2023	DecoderKnowledge Distillation	—Unverified	0
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning	Sep 27, 2023	Knowledge Distillationregression	—Unverified	0
Cold & Warm Net: Addressing Cold-Start Users in Recommender Systems	Sep 27, 2023	Knowledge DistillationMeta-Learning	—Unverified	0
Contrastive Continual Multi-view Clustering with Filtered Structural Fusion	Sep 26, 2023	ClusteringContrastive Learning	—Unverified	0
Learning Using Generated Privileged Information by Text-to-Image Diffusion Models	Sep 26, 2023	ClassificationKnowledge Distillation	—Unverified	0
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline	Sep 26, 2023	Knowledge DistillationObject Tracking	CodeCode Available	2
Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models	Sep 26, 2023	image-classificationImage Classification	—Unverified	0
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks	Sep 26, 2023	DenoisingImage Denoising	—Unverified	0
ADU-Depth: Attention-based Distillation with Uncertainty Modeling for Depth Estimation	Sep 26, 2023	3D geometryDepth Estimation	—Unverified	0
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation	Sep 26, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	1
Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving	Sep 25, 2023	Autonomous DrivingKnowledge Distillation	—Unverified	0
Data Upcycling Knowledge Distillation for Image Super-Resolution	Sep 25, 2023	Image Super-ResolutionKnowledge Distillation	CodeCode Available	0
DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning	Sep 24, 2023	Data-free Knowledge DistillationDiversity	—Unverified	0
Multivariate Prototype Representation for Domain-Generalized Incremental Learning	Sep 24, 2023	class-incremental learningClass Incremental Learning	—Unverified	0
Poster: Self-Supervised Quantization-Aware Knowledge Distillation	Sep 22, 2023	Knowledge DistillationQuantization	—Unverified	0
VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks	Sep 22, 2023	Adversarial RobustnessKeyword Spotting	—Unverified	0
Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation	Sep 22, 2023	DecoderFeature Importance	—Unverified	0
Defending against Data-Free Model Extraction by Distributionally Robust Defensive Training	Sep 21, 2023	Knowledge DistillationModel extraction	—Unverified	0
Defending against Data-Free Model Extraction by Distributionally Robust Defensive Training	Sep 21, 2023	Knowledge DistillationModel extraction	—Unverified	0
Elevating Skeleton-Based Action Recognition with Efficient Multi-Modality Self-Supervision	Sep 21, 2023	Action RecognitionKnowledge Distillation	CodeCode Available	0
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance	Sep 21, 2023	Domain GeneralizationKnowledge Distillation	CodeCode Available	1
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise Optimization	Sep 20, 2023	Knowledge Distillationobject-detection	CodeCode Available	2
Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation	Sep 20, 2023	3D Scene ReconstructionDepth Estimation	CodeCode Available	0
Language-Oriented Communication with Semantic Coding and Knowledge Distillation for Text-to-Image Generation	Sep 20, 2023	Image GenerationIn-Context Learning	—Unverified	0
Weight Averaging Improves Knowledge Distillation under Domain Shift	Sep 20, 2023	Domain GeneralizationKnowledge Distillation	CodeCode Available	1
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement	Sep 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Improving CLIP Robustness with Knowledge Distillation and Self-Training	Sep 19, 2023	Knowledge Distillation	—Unverified	0

Show:10 25 50

← PrevPage 33 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified