Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 801–850 of 4240 papers

Title	Date	Tasks	Status	Hype
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination	Aug 18, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment	Aug 18, 2024	Brain Tumor SegmentationDomain Adaptation	—Unverified	0
V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models	Aug 17, 2024	Autonomous DrivingContrastive Learning	—Unverified	0
Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition	Aug 16, 2024	Emotion RecognitionKnowledge Distillation	CodeCode Available	0
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU	Aug 15, 2024	domain classificationIntent Detection	CodeCode Available	0
Towards Real-time Video Compressive Sensing on Mobile Devices	Aug 14, 2024	Compressive SensingKnowledge Distillation	CodeCode Available	0
One Step Diffusion-based Super-Resolution with Time-Aware Distillation	Aug 14, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available	1
FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher	Aug 14, 2024	Federated LearningKnowledge Distillation	—Unverified	0
Knowledge Distillation with Refined Logits	Aug 14, 2024	Knowledge DistillationModel Compression	CodeCode Available	1
Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach	Aug 13, 2024	Knowledge Distillation	—Unverified	0
Optimizing Vision Transformers with Data-Free Knowledge Transfer	Aug 12, 2024	Knowledge Distillationobject-detection	—Unverified	0
Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation	Aug 11, 2024	Graph EmbeddingKnowledge Distillation	—Unverified	0
LaDiMo: Layer-wise Distillation Inspired MoEfier	Aug 8, 2024	Knowledge DistillationMixture-of-Experts	—Unverified	0
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Aug 8, 2024	Contrastive LearningKnowledge Distillation	—Unverified	0
Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation	Aug 7, 2024	Data AugmentationImage Reconstruction	CodeCode Available	0
Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration	Aug 7, 2024	GPUIntrusion Detection	CodeCode Available	1
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection	Aug 7, 2024	Anomaly DetectionAnomaly Localization	—Unverified	0
EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures	Aug 6, 2024	Brain Computer InterfaceEEG	—Unverified	0
Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization	Aug 6, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	0
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations	Aug 6, 2024	Knowledge DistillationNavigate	—Unverified	0
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Aug 6, 2024	image-classificationImage Classification	CodeCode Available	0
VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation	Aug 6, 2024	ECG ClassificationKnowledge Distillation	—Unverified	0
Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution	Aug 5, 2024	ClassificationDiversity	CodeCode Available	0
An approach to optimize inference of the DIART speaker diarization pipeline	Aug 5, 2024	Inference OptimizationKnowledge Distillation	—Unverified	0
Unsupervised Domain Adaption Harnessing Vision-Language Pre-training	Aug 5, 2024	Domain AdaptationKnowledge Distillation	CodeCode Available	1
Do You Remember . . . the Future? Weak-to-Strong generalization in 3D Object Detection	Aug 3, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	0
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning	Aug 2, 2024	Continual LearningKnowledge Distillation	CodeCode Available	0
DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects	Aug 1, 2024	Depth CompletionFeature Correlation	—Unverified	0
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation	Aug 1, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Jul 31, 2024	Knowledge DistillationNeRF	—Unverified	0
Gemma 2: Improving Open Language Models at a Practical Size	Jul 31, 2024	Knowledge Distillation	—Unverified	0
Lifelong Person Search	Jul 31, 2024	Knowledge DistillationPerson Search	—Unverified	0
Dynamic Object Queries for Transformer-based Incremental Object Detection	Jul 31, 2024	Knowledge DistillationObject	—Unverified	0
VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Continual Learning	Jul 31, 2024	Continual LearningKnowledge Distillation	—Unverified	0
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins	Jul 31, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training	Jul 30, 2024	GPUKnowledge Distillation	CodeCode Available	1
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation	Jul 29, 2024	DecoderKnowledge Distillation	CodeCode Available	0
ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Jul 29, 2024	Activity RecognitionGroup Activity Recognition	—Unverified	0
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation	Jul 28, 2024	Knowledge DistillationSequential Diagnosis	CodeCode Available	0
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	0
LLAVADI: What Matters For Multimodal Large Language Models Distillation	Jul 28, 2024	Knowledge Distillation	—Unverified	0
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making	Jul 28, 2024	Decision MakingKnowledge Distillation	—Unverified	0
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Jul 27, 2024	Computational EfficiencyImage Super-Resolution	—Unverified	0
Modality-Balanced Learning for Multimedia Recommendation	Jul 26, 2024	Collaborative Filteringcounterfactual	CodeCode Available	1
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers	Jul 26, 2024	Domain AdaptationDomain Generalization	CodeCode Available	0
Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation	Jul 26, 2024	Knowledge DistillationQuestion Answering	CodeCode Available	2
FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction	Jul 26, 2024	Click-Through Rate PredictionFederated Learning	—Unverified	0
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT	Jul 25, 2024	Knowledge DistillationMulti-Object Tracking	CodeCode Available	0
How to Train the Teacher Model for Effective Knowledge Distillation	Jul 25, 2024	Knowledge Distillation	CodeCode Available	0
NC-NCD: Novel Class Discovery for Node Classification	Jul 25, 2024	ClassificationKnowledge Distillation	CodeCode Available	0

Show:10 25 50

← PrevPage 17 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified