SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 851900 of 4240 papers

TitleStatusHype
NC-NCD: Novel Class Discovery for Node ClassificationCode0
Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach0
How to Train the Teacher Model for Effective Knowledge DistillationCode0
CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in TomosynthesisCode0
DDK: Distilling Domain Knowledge for Efficient Large Language Models0
OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-ReflectionCode1
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student ArchitecturesCode0
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models0
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical videoCode0
Synthetic Image Learning: Preserving Performance and Preventing Membership Inference Attacks0
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification0
SeqMIA: Sequential-Metric Based Membership Inference AttackCode0
Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge DistillationCode0
Compact Language Models via Pruning and Knowledge DistillationCode3
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing ImagesCode0
ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image SegmentationCode2
Knowledge Distillation Approaches for Accurate and Efficient Recommender SystemCode1
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning0
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection0
QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person ViewCode0
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation0
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic SegmentationCode0
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model0
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object DetectionCode1
Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object DetectionCode1
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities0
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation0
Mitigating Background Shift in Class-Incremental Semantic SegmentationCode1
Progressive Pretext Task Learning for Human Trajectory PredictionCode2
Relational Representation DistillationCode1
Discriminative and Consistent Representation DistillationCode1
Don't Throw Away Data: Better Sequence Knowledge Distillation0
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data0
Accessing Vision Foundation Models at ImageNet-level CostsCode2
Multi-Granularity Semantic Revision for Large Language Model Distillation0
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-LabelsCode0
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object DetectionCode1
Minimizing PLM-Based Few-Shot Intent DetectorsCode0
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation0
A Survey on Symbolic Knowledge Distillation of Large Language Models0
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion0
BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge DistillationCode1
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image ClassificationCode0
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health DetectionCode0
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation0
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions0
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appearCode0
LokiLM: Technical Report0
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image ClassificationCode0
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches0
Show:102550
← PrevPage 18 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified