SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 17261750 of 4240 papers

TitleStatusHype
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities0
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation0
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data0
Don't Throw Away Data: Better Sequence Knowledge Distillation0
Multi-Granularity Semantic Revision for Large Language Model Distillation0
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-LabelsCode0
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation0
Minimizing PLM-Based Few-Shot Intent DetectorsCode0
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion0
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation0
A Survey on Symbolic Knowledge Distillation of Large Language Models0
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health DetectionCode0
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image ClassificationCode0
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appearCode0
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions0
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches0
LokiLM: Technical Report0
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image ClassificationCode0
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction0
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case StudyCode0
Reprogramming Distillation for Medical Foundation ModelsCode0
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training0
Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients0
Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data0
Leveraging Topological Guidance for Improved Knowledge DistillationCode0
Show:102550
← PrevPage 70 of 170Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified