SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 17011750 of 4240 papers

TitleStatusHype
Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Curriculum Data Erasing Guided Knowledge DistillationCode0
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network0
Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D TransformersCode0
FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction0
Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOTCode0
Peak-Controlled Logits Poisoning Attack in Federated Distillation0
Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach0
How to Train the Teacher Model for Effective Knowledge DistillationCode0
NC-NCD: Novel Class Discovery for Node ClassificationCode0
CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in TomosynthesisCode0
DDK: Distilling Domain Knowledge for Efficient Large Language Models0
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical videoCode0
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student ArchitecturesCode0
Synthetic Image Learning: Preserving Performance and Preventing Membership Inference Attacks0
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models0
SeqMIA: Sequential-Metric Based Membership Inference AttackCode0
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification0
Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge DistillationCode0
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing ImagesCode0
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection0
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation0
QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person ViewCode0
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic SegmentationCode0
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning0
Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model0
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities0
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation0
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data0
Don't Throw Away Data: Better Sequence Knowledge Distillation0
Multi-Granularity Semantic Revision for Large Language Model Distillation0
Enhancing Weakly-Supervised Histopathology Image Segmentation with Knowledge Distillation on MIL-Based Pseudo-LabelsCode0
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation0
Minimizing PLM-Based Few-Shot Intent DetectorsCode0
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion0
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation0
A Survey on Symbolic Knowledge Distillation of Large Language Models0
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health DetectionCode0
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image ClassificationCode0
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appearCode0
Adaptive Deep Iris Feature Extractor at Arbitrary Resolutions0
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches0
LokiLM: Technical Report0
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image ClassificationCode0
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction0
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case StudyCode0
Reprogramming Distillation for Medical Foundation ModelsCode0
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training0
Federated Knowledge Transfer Fine-tuning Large Server Model with Resource-Constrained IoT Clients0
Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data0
Leveraging Topological Guidance for Improved Knowledge DistillationCode0
Show:102550
← PrevPage 35 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified