Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 4240 papers

Title	Date	Tasks	Status
Condensed Sample-Guided Model Inversion for Knowledge Distillation	Aug 25, 2024	Knowledge Distillationmodel	—Unverified
Growing Deep Neural Network Considering with Similarity between Neurons	Aug 23, 2024	Decision MakingKnowledge Distillation	—Unverified
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption	Aug 23, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Rebalancing Multi-Label Class-Incremental Learning	Aug 22, 2024	class-incremental learningClass Incremental Learning	—Unverified
Aligning (Medical) LLMs for (Counterfactual) Fairness	Aug 22, 2024	counterfactualFairness	CodeCode Available
Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models	Aug 22, 2024	In-Context LearningKnowledge Distillation	—Unverified
Vision-Based Detection of Uncooperative Targets and Components on Small Satellites	Aug 22, 2024	Knowledge Distillation	—Unverified
LAKD-Activation Mapping Distillation Based on Local Learning	Aug 21, 2024	Knowledge Distillation	—Unverified
A Unified Framework for Continual Learning and Unlearning	Aug 21, 2024	Continual LearningKnowledge Distillation	—Unverified
Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection	Aug 21, 2024	Knowledge DistillationObject	—Unverified
SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection	Aug 20, 2024	Knowledge Distillationobject-detection	—Unverified
Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers	Aug 20, 2024	Knowledge Distillation	—Unverified
Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation	Aug 20, 2024	FairnessKnowledge Distillation	—Unverified
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Aug 18, 2024	Autonomous DrivingDomain Adaptation	CodeCode Available
MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment	Aug 18, 2024	Brain Tumor SegmentationDomain Adaptation	—Unverified
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination	Aug 18, 2024	Knowledge DistillationTransfer Learning	—Unverified
V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models	Aug 17, 2024	Autonomous DrivingContrastive Learning	—Unverified
Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition	Aug 16, 2024	Emotion RecognitionKnowledge Distillation	CodeCode Available
MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU	Aug 15, 2024	domain classificationIntent Detection	CodeCode Available
Towards Real-time Video Compressive Sensing on Mobile Devices	Aug 14, 2024	Compressive SensingKnowledge Distillation	CodeCode Available
FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher	Aug 14, 2024	Federated LearningKnowledge Distillation	—Unverified
Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach	Aug 13, 2024	Knowledge Distillation	—Unverified
Optimizing Vision Transformers with Data-Free Knowledge Transfer	Aug 12, 2024	Knowledge Distillationobject-detection	—Unverified
Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation	Aug 11, 2024	Graph EmbeddingKnowledge Distillation	—Unverified
ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model	Aug 8, 2024	Contrastive LearningKnowledge Distillation	—Unverified
LaDiMo: Layer-wise Distillation Inspired MoEfier	Aug 8, 2024	Knowledge DistillationMixture-of-Experts	—Unverified
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection	Aug 7, 2024	Anomaly DetectionAnomaly Localization	—Unverified
Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation	Aug 7, 2024	Data AugmentationImage Reconstruction	CodeCode Available
Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization	Aug 6, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations	Aug 6, 2024	Knowledge DistillationNavigate	—Unverified
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Aug 6, 2024	image-classificationImage Classification	CodeCode Available
VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation	Aug 6, 2024	ECG ClassificationKnowledge Distillation	—Unverified
EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures	Aug 6, 2024	Brain Computer InterfaceEEG	—Unverified
An approach to optimize inference of the DIART speaker diarization pipeline	Aug 5, 2024	Inference OptimizationKnowledge Distillation	—Unverified
Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution	Aug 5, 2024	ClassificationDiversity	CodeCode Available
Do You Remember . . . the Future? Weak-to-Strong generalization in 3D Object Detection	Aug 3, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning	Aug 2, 2024	Continual LearningKnowledge Distillation	CodeCode Available
DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects	Aug 1, 2024	Depth CompletionFeature Correlation	—Unverified
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation	Aug 1, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization	Jul 31, 2024	Knowledge DistillationNeRF	—Unverified
Gemma 2: Improving Open Language Models at a Practical Size	Jul 31, 2024	Knowledge Distillation	—Unverified
Dynamic Object Queries for Transformer-based Incremental Object Detection	Jul 31, 2024	Knowledge DistillationObject	—Unverified
Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins	Jul 31, 2024	Knowledge DistillationLanguage Modeling	—Unverified
VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Continual Learning	Jul 31, 2024	Continual LearningKnowledge Distillation	—Unverified
Lifelong Person Search	Jul 31, 2024	Knowledge DistillationPerson Search	—Unverified
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation	Jul 29, 2024	DecoderKnowledge Distillation	CodeCode Available
ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality	Jul 29, 2024	Activity RecognitionGroup Activity Recognition	—Unverified
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making	Jul 28, 2024	Decision MakingKnowledge Distillation	—Unverified
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge Distillation	Jul 28, 2024	Knowledge DistillationSequential Diagnosis	CodeCode Available

Show:10 25 50

← PrevPage 34 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified