SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 29763000 of 4240 papers

TitleStatusHype
Continual Learning for Fake Audio Detection0
Continual Learning for Neural Machine Translation0
Unsupervised Continual Learning Via Pseudo Labels0
Continual Learning with Diffusion-based Generative Replay for Industrial Streaming Data0
Continual Learning with Dirichlet Generative-based Rehearsal0
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans0
Continual Segment: Towards a Single, Unified and Non-forgetting Continual Segmentation Model of 143 Whole-body Organs in CT Scans0
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing0
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization0
Continuous Concepts Removal in Text-to-image Diffusion Models0
Continuous sign language recognition based on cross-resolution knowledge distillation0
Contrastive Continual Multi-view Clustering with Filtered Structural Fusion0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
Contrastive Representation Distillation via Multi-Scale Feature Decoupling0
Contrast R-CNN for Continual Learning in Object Detection0
Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition0
Controlling the Quality of Distillation in Response-Based Network Compression0
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies0
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition0
Cooperative Denoising for Distantly Supervised Relation Extraction0
Cooperative Learning for Cost-Adaptive Inference0
Coordinating Cross-modal Distillation for Molecular Property Prediction0
ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation0
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks0
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible0
Show:102550
← PrevPage 120 of 170Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified