Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 4240 papers

Title	Date	Tasks	Status	Hype
FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline	Dec 15, 2023	GPUKnowledge Distillation	—Unverified	0
Student as an Inherent Denoiser of Noisy Teacher	Dec 15, 2023	Knowledge DistillationLanguage Modeling	—Unverified	0
MobileSAMv2: Faster Segment Anything to Everything	Dec 15, 2023	DecoderKnowledge Distillation	CodeCode Available	5
Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference	Dec 15, 2023	DecoderDenoising	CodeCode Available	2
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge	Dec 15, 2023	Information RetrievalKnowledge Distillation	CodeCode Available	0
Efficient speech detection in environmental audio using acoustic recognition and knowledge distillation	Dec 14, 2023	Knowledge DistillationModel Selection	—Unverified	0
COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems	Dec 14, 2023	Combinatorial OptimizationGraph Neural Network	CodeCode Available	0
Generative Model-based Feature Knowledge Distillation for Action Recognition	Dec 14, 2023	Action DetectionAction Recognition	CodeCode Available	1
Unraveling Key Factors of Knowledge Distillation	Dec 14, 2023	Knowledge DistillationMachine Translation	—Unverified	0
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector	Dec 14, 2023	Knowledge DistillationObject	CodeCode Available	1
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed Data	Dec 14, 2023	Contrastive LearningFederated Learning	CodeCode Available	1
RdimKD: Generic Distillation Paradigm by Dimensionality Reduction	Dec 14, 2023	Dimensionality ReductionKnowledge Distillation	—Unverified	0
RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment	Dec 14, 2023	Knowledge DistillationModel Compression	—Unverified	0
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models	Dec 13, 2023	Image GenerationKnowledge Distillation	—Unverified	0
Cooperative Learning for Cost-Adaptive Inference	Dec 13, 2023	Knowledge Distillation	—Unverified	0
KDAS: Knowledge Distillation via Attention Supervision Framework for Polyp Segmentation	Dec 13, 2023	Knowledge DistillationMedical Image Segmentation	CodeCode Available	1
Mutual-Learning Knowledge Distillation for Nighttime UAV Tracking	Dec 13, 2023	Knowledge Distillation	CodeCode Available	0
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach	Dec 12, 2023	Knowledge DistillationOffline RL	CodeCode Available	1
A dynamic interactive learning framework for automated 3D medical image segmentation	Dec 11, 2023	Image RegistrationImage Segmentation	—Unverified	0
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation	Dec 10, 2023	Knowledge Distillation	—Unverified	0
Fake It Till Make It: Federated Learning with Consensus-Oriented Generation	Dec 10, 2023	Federated LearningKnowledge Distillation	—Unverified	0
IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment	Dec 10, 2023	Incremental LearningKnowledge Distillation	—Unverified	0
Understanding the Effect of Model Compression on Social Bias in Large Language Models	Dec 9, 2023	Knowledge DistillationModel Compression	CodeCode Available	0
Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation	Dec 9, 2023	Adversarial RobustnessFairness	CodeCode Available	0
Localized Symbolic Knowledge Distillation for Visual Commonsense Models	Dec 8, 2023	Image DescriptionInstruction Following	CodeCode Available	0
Language Model Knowledge Distillation for Efficient Question Answering in Spanish	Dec 7, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	0
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis	Dec 7, 2023	DenoisingImage Generation	—Unverified	0
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation	Dec 7, 2023	Contrastive LearningData Augmentation	CodeCode Available	1
Combining inherent knowledge of vision-language models with unsupervised domain adaptation through strong-weak guidance	Dec 7, 2023	Domain AdaptationKnowledge Distillation	CodeCode Available	0
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs	Dec 5, 2023	Action SegmentationAll	CodeCode Available	0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation	Dec 4, 2023	BenchmarkingContrastive Learning	—Unverified	0
TriDeNT: Triple Deep Network Training for Privileged Knowledge Distillation in Histopathology	Dec 4, 2023	Knowledge Distillation	—Unverified	0
OplixNet: Towards Area-Efficient Optical Split-Complex Networks with Real-to-Complex Data Assignment and Knowledge Distillation	Dec 3, 2023	Knowledge Distillation	—Unverified	0
Enhancing and Adapting in the Clinic: Source-free Unsupervised Domain Adaptation for Medical Image Enhancement	Dec 3, 2023	Domain AdaptationImage Enhancement	CodeCode Available	1
S2P3: Self-Supervised Polarimetric Pose Prediction	Dec 2, 2023	Knowledge DistillationPose Prediction	—Unverified	0
Dual-Teacher De-biasing Distillation Framework for Multi-domain Fake News Detection	Dec 2, 2023	Fake News DetectionKnowledge Distillation	CodeCode Available	1
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment	Dec 1, 2023	Contrastive LearningFew-Shot Learning	CodeCode Available	3
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices	Nov 30, 2023	Knowledge Distillation	—Unverified	0
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions	Nov 30, 2023	Knowledge DistillationRAG	—Unverified	0
Initializing Models with Larger Ones	Nov 30, 2023	Knowledge Distillation	CodeCode Available	1
LayerCollapse: Adaptive compression of neural networks	Nov 29, 2023	Computational Efficiencyimage-classification	—Unverified	0
The Devil is in the Data: Learning Fair Graph Neural Networks via Partial Knowledge Distillation	Nov 29, 2023	FairnessKnowledge Distillation	CodeCode Available	0
Continual Learning for Image Segmentation with Dynamic Query	Nov 29, 2023	Continual LearningDiversity	CodeCode Available	1
Propagate & Distill: Towards Effective Graph Learners Using Propagation-Embracing MLPs	Nov 29, 2023	Graph Neural NetworkKnowledge Distillation	—Unverified	0
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation	Nov 28, 2023	Cross-lingual Text-to-Image GenerationImage Generation	CodeCode Available	1
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS	Nov 28, 2023	Knowledge DistillationNeRF	CodeCode Available	2
FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning	Nov 28, 2023	Knowledge DistillationTransfer Learning	—Unverified	0
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser	Nov 28, 2023	3D Face AnimationContrastive Learning	—Unverified	0
Rethinking Intermediate Layers design in Knowledge Distillation for Kidney and Liver Tumor Segmentation	Nov 28, 2023	DiagnosticKnowledge Distillation	CodeCode Available	0
UFIN: Universal Feature Interaction Network for Multi-Domain Click-Through Rate Prediction	Nov 27, 2023	Click-Through Rate PredictionKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 30 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified