SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 26512700 of 4240 papers

TitleStatusHype
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning0
Dynamic Y-KD: A Hybrid Approach to Continual Instance Segmentation0
Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss0
Learning the Wrong Lessons: Inserting Trojans During Knowledge Distillation0
NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging0
Gradient-Guided Knowledge Distillation for Object Detectors0
Adaptive Knowledge Distillation between Text and Speech Pre-trained Models0
PreFallKD: Pre-Impact Fall Detection via CNN-ViT Knowledge DistillationCode0
KDSM: An uplift modeling framework based on knowledge distillation and sample matching0
Students Parrot Their Teachers: Membership Inference on Model Distillation0
IKD+: Reliable Low Complexity Deep Models For Retinopathy Classification0
X^3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection0
Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis0
Unsupervised Deep Digital Staining For Microscopic Cell Images Via Knowledge Distillation0
Letz Translate: Low-Resource Machine Translation for Luxembourgish0
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification0
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning0
Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning0
Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial BiasCode0
Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation0
Incremental Learning of Acoustic Scenes and Sound Events0
Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation0
Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech RecognitionCode0
Leveraging Angular Distributions for Improved Knowledge Distillation0
A Light-weight Deep Learning Model for Remote Sensing Image Classification0
Ensemble knowledge distillation of self-supervised speech models0
A Knowledge Distillation framework for Multi-Organ Segmentation of Medaka Fish in Tomographic Image0
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision TransformersCode0
Personalized Decentralized Federated Learning with Knowledge Distillation0
Exploring Social Media for Early Detection of Depression in COVID-19 PatientsCode0
Practical Knowledge Distillation: Using DNNs to Beat DNNs0
Debiased Distillation by Transplanting the Last Layer0
Distilling Calibrated Student from an Uncalibrated Teacher0
KS-DETR: Knowledge Sharing in Attention Learning for Detection TransformerCode0
CADIS: Handling Cluster-skewed Non-IID Data in Federated Learning with Clustered Aggregation and Knowledge DIStilled RegularizationCode0
Two-in-one Knowledge Distillation for Efficient Facial Forgery Detection0
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers0
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers0
RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness0
Fairly Predicting Graft Failure in Liver Transplant for Organ Assigning0
Explicit and Implicit Knowledge Distillation via Unlabeled Data0
Few-shot 3D LiDAR Semantic Segmentation for Autonomous Driving0
Learning From Biased Soft Labels0
Cross Modal Distillation for Flood Extent Mapping0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK0
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation0
New Insights on Relieving Task-Recency Bias for Online Class Incremental LearningCode0
ST-MFNet Mini: Knowledge Distillation-Driven Frame InterpolationCode0
Offline-to-Online Knowledge Distillation for Video Instance Segmentation0
A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation0
Show:102550
← PrevPage 54 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified