Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 4240 papers

Title	Date	Tasks	Status	Hype
OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection	Jul 23, 2024	Code GenerationKnowledge Distillation	CodeCode Available	1
Knowledge Distillation Approaches for Accurate and Efficient Recommender System	Jul 19, 2024	Knowledge DistillationRecommendation Systems	CodeCode Available	1
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection	Jul 17, 2024	Knowledge Distillationobject-detection	CodeCode Available	1
Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection	Jul 16, 2024	Knowledge Distillationobject-detection	CodeCode Available	1
Mitigating Background Shift in Class-Incremental Semantic Segmentation	Jul 16, 2024	Class Incremental LearningClass-Incremental Semantic Segmentation	CodeCode Available	1
Discriminative and Consistent Representation Distillation	Jul 16, 2024	Causal InferenceContrastive Learning	CodeCode Available	1
Relational Representation Distillation	Jul 16, 2024	Computational EfficiencyContrastive Learning	CodeCode Available	1
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection	Jul 14, 2024	3D Object DetectionDepth Estimation	CodeCode Available	1
BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation	Jul 12, 2024	Knowledge Distillation	CodeCode Available	1
DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners	Jul 4, 2024	Audio ClassificationAudio Tagging	CodeCode Available	1
AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition	Jul 1, 2024	Face RecognitionKnowledge Distillation	CodeCode Available	1
CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion	Jun 28, 2024	Knowledge DistillationSuper-Resolution	CodeCode Available	1
ConStyle v2: A Strong Prompter for All-in-One Image Restoration	Jun 26, 2024	AllGPU	CodeCode Available	1
Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition	Jun 25, 2024	Knowledge DistillationMicro Expression Recognition	CodeCode Available	1
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation	Jun 25, 2024	Knowledge DistillationTest unseen	CodeCode Available	1
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs	Jun 20, 2024	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation	Jun 19, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Lightweight Model Pre-training via Language Guided Knowledge Distillation	Jun 17, 2024	Knowledge Distillation	CodeCode Available	1
Small Scale Data-Free Knowledge Distillation	Jun 12, 2024	Data-free Knowledge DistillationGenerative Adversarial Network	CodeCode Available	1
CTC-based Non-autoregressive Textless Speech-to-Speech Translation	Jun 11, 2024	Knowledge DistillationMachine Translation	CodeCode Available	1
DKDL-Net: A Lightweight Bearing Fault Detection Model via Decoupled Knowledge Distillation and Low-Rank Adaptation Fine-tuning	Jun 10, 2024	Fault DetectionFault Diagnosis	CodeCode Available	1
LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification	Jun 6, 2024	Face DetectionFace Verification	CodeCode Available	1
Multi-Task Multi-Scale Contrastive Knowledge Distillation for Efficient Medical Image Segmentation	Jun 5, 2024	Contrastive LearningImage Segmentation	CodeCode Available	1
Continual Collaborative Distillation for Recommender System	May 29, 2024	Knowledge DistillationRecommendation Systems	CodeCode Available	1
SLMRec: Distilling Large Language Models into Small for Sequential Recommendation	May 28, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	1
LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking	May 27, 2024	CPUKnowledge Distillation	CodeCode Available	1
Rethinking Early-Fusion Strategies for Improved Multispectral Object Detection	May 25, 2024	Knowledge DistillationMultispectral Object Detection	CodeCode Available	1
3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving	May 24, 2024	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models	May 23, 2024	Knowledge DistillationMath	CodeCode Available	1
Recurrent Early Exits for Federated Learning with Heterogeneous Clients	May 23, 2024	Federated LearningKnowledge Distillation	CodeCode Available	1
AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection	May 21, 2024	Knowledge DistillationPedestrian Detection	CodeCode Available	1
CLRKDNet: Speeding up Lane Detection with Knowledge Distillation	May 21, 2024	Autonomous DrivingKnowledge Distillation	CodeCode Available	1
Overcoming Data and Model Heterogeneities in Decentralized Federated Learning via Synthetic Anchors	May 19, 2024	Domain AdaptationFederated Learning	CodeCode Available	1
Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection	May 3, 2024	Anomaly DetectionAttribute	CodeCode Available	1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model	May 1, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	1
CrossMatch: Enhance Semi-Supervised Medical Image Segmentation with Perturbation Strategies and Knowledge Distillation	May 1, 2024	Image SegmentationKnowledge Distillation	CodeCode Available	1
Retrieval-Oriented Knowledge for Click-Through Rate Prediction	Apr 28, 2024	Click-Through Rate PredictionContrastive Learning	CodeCode Available	1
Dynamic Temperature Knowledge Distillation	Apr 19, 2024	Knowledge Distillation	CodeCode Available	1
Camera clustering for scalable stream-based active distillation	Apr 16, 2024	ClusteringKnowledge Distillation	CodeCode Available	1
Digging into contrastive learning for robust depth estimation with diffusion models	Apr 15, 2024	Contrastive LearningDenoising	CodeCode Available	1
CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers	Apr 9, 2024	Knowledge DistillationZero-shot Generalization	CodeCode Available	1
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection	Apr 7, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	1
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models	Apr 3, 2024	DiversityKnowledge Distillation	CodeCode Available	1
Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity and Performance Restoration	Apr 3, 2024	Knowledge Distillation	CodeCode Available	1
TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation	Apr 2, 2024	Knowledge DistillationVisual Place Recognition	CodeCode Available	1
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation	Apr 1, 2024	DecoderKnowledge Distillation	CodeCode Available	1
Orchestrate Latent Expertise: Advancing Online Continual Learning with Multi-Level Supervision and Reverse Self-Distillation	Mar 30, 2024	Continual LearningKnowledge Distillation	CodeCode Available	1
KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning	Mar 26, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	1
ToXCL: A Unified Framework for Toxic Speech Detection and Explanation	Mar 25, 2024	DecoderKnowledge Distillation	CodeCode Available	1
iDAT: inverse Distillation Adapter-Tuning	Mar 23, 2024	image-classificationImage Classification	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified