SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 27012750 of 4240 papers

TitleStatusHype
Take a Prior from Other Tasks for Severe Blur Removal0
Learning from Noisy Crowd Labels with LogicsCode0
NYCU-TWO at Memotion 3: Good Foundation, Good Teacher, then you have Good Meme Analysis0
SCLIFD:Supervised Contrastive Knowledge Distillation for Incremental Fault Diagnosis under Limited Fault Data0
Feature Affinity Assisted Knowledge Distillation and Quantization of Deep Neural Networks on Label-Free Data0
SOCRATES: Text-based Human Search and Approach using a Robot Dog0
Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge TransferCode0
Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System0
SLaM: Student-Label Mixing for Distillation with Unlabeled Examples0
Enhancing Modality-Agnostic Representations via Meta-Learning for Brain Tumor Segmentation0
An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking0
Audio Representation Learning by Distilling Video as Privileged Information0
Knowledge Distillation in Vision Transformers: A Critical Review0
Heterogeneous Federated Knowledge Graph Embedding Learning and Unlearning0
Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting PerspectiveCode0
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits0
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications0
Adaptive Search-and-Training for Robust and Efficient Network PruningCode0
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection0
Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur DetectionCode0
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans0
Knowledge Distillation on Graphs: A Survey0
AMD: Adaptive Masked Distillation for Object Detection0
Knowledge Distillation Label Smoothing: Fact or Fallacy?0
On student-teacher deviations in distillation: does it pay to disobey?0
FractalAD: A simple industrial anomaly detection method using fractal anomaly generation and backbone knowledge distillationCode0
Few-shot Face Image Translation via GAN Prior Distillation0
MVKT-ECG: Efficient Single-lead ECG Classification on Multi-Label Arrhythmia by Multi-View Knowledge Transferring0
Supervision Complexity and its Role in Knowledge Distillation0
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?0
Improved knowledge distillation by utilizing backward pass knowledge in neural networks0
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval0
Improving Text-based Early Prediction by Distillation from Privileged Time-Series Text0
A Simple Recipe for Competitive Low-compute Self supervised Vision Models0
Unifying Synergies between Self-supervised Learning and Dynamic ComputationCode0
The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation0
ProKD: An Unsupervised Prototypical Knowledge Distillation Network for Zero-Resource Cross-Lingual Named Entity Recognition0
RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation0
Adaptively Integrated Knowledge Distillation and Prediction Uncertainty for Continual Learning0
Knowledge Distillation in Federated Edge Learning: A Survey0
A Cohesive Distillation Architecture for Neural Language Models0
Effective Decision Boundary Learning for Class Incremental Learning0
Synthetic data generation method for data-free knowledge distillation in regression neural networksCode0
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation GeneralizationCode0
Designing an Improved Deep Learning-based Model for COVID-19 Recognition in Chest X-ray Images: A Knowledge Distillation Approach0
RELIANT: Fair Knowledge Distillation for Graph Neural NetworksCode0
Knowledge-guided Causal Intervention for Weakly-supervised Object LocalizationCode0
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator0
CaPriDe Learning: Confidential and Private Decentralized Learning Based on Encryption-Friendly Distillation LossCode0
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors0
Show:102550
← PrevPage 55 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified