SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 21012150 of 4240 papers

TitleStatusHype
Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control0
A Novel Garment Transfer Method Supervised by Distilled Knowledge of Virtual Try-on Model0
Contrastive Learning in Distilled ModelsCode0
Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Loss0
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers0
Knowledge Distillation on Spatial-Temporal Graph Convolutional Network for Traffic Prediction0
Robustness to distribution shifts of compressed networks for edge devices0
Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency0
Confidence Preservation Property in Knowledge Distillation Abstractions0
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
Model Compression Techniques in Biometrics Applications: A SurveyCode0
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization0
Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense PredictionCode0
A Deep Hierarchical Feature Sparse Framework for Occluded Person Re-Identification0
Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision0
EVOKE: Emotion Enabled Virtual Avatar Mapping Using Optimized Knowledge Distillation0
Knowledge Distillation of Black-Box Large Language Models0
Direct Distillation between Different Domains0
An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation0
Object-Centric Diffusion for Efficient Video Editing0
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection0
Attention to detail: inter-resolution knowledge distillationCode0
Hierarchical Knowledge Distillation on Text Graph for Data-limited Attribute Inference0
Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation0
Logits Poisoning Attack in Federated Distillation0
Multi-Channel Multi-Domain based Knowledge Distillation Algorithm for Sleep Staging with Single-Channel EEG0
SeqNAS: Neural Architecture Search for Event Sequence ClassificationCode0
CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition0
Exploring Vacant Classes in Label-Skewed Federated LearningCode0
Distillation-based fabric anomaly detectionCode0
Bridging Modalities: Knowledge Distillation and Masked Training for Translating Multi-Modal Emotion Recognition to Uni-Modal, Speech-Only Emotion RecognitionCode0
Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection0
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning0
Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware DetectorCode0
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data RegimesCode0
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification0
Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing0
Building Vision-Language Models on Solid Foundations with Masked Distillation0
Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation0
Uncertainty-Guided Never-Ending Learning to Drive0
IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation0
Curriculum-scheduled Knowledge Distillation from Multiple Pre-trained Teachers for Multi-domain Sequential RecommendationCode0
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPCCode0
C2KD: Bridging the Modality Gap for Cross-Modal Knowledge Distillation0
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling0
Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples0
Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection0
Compressing Deep Image Super-resolution Models0
Explainability-Driven Leaf Disease Classification Using Adversarial Training and Knowledge Distillation0
ClST: A Convolutional Transformer Framework for Automatic Modulation Recognition by Knowledge Distillation0
Show:102550
← PrevPage 43 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified