SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 13511400 of 4240 papers

TitleStatusHype
Deep Clustering with Diffused Sampling and Hardness-aware Self-distillationCode0
Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side DistillationCode1
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention0
Towards Complementary Knowledge Distillation for Efficient Dense Image Prediction0
Contrastive Learning in Distilled ModelsCode0
Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control0
A Novel Garment Transfer Method Supervised by Distilled Knowledge of Virtual Try-on Model0
Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency0
Knowledge Distillation on Spatial-Temporal Graph Convolutional Network for Traffic Prediction0
Robustness to distribution shifts of compressed networks for edge devices0
Rethinking Centered Kernel Alignment in Knowledge DistillationCode1
Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Loss0
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers0
Confidence Preservation Property in Knowledge Distillation Abstractions0
HiCD: Change Detection in Quality-Varied Images via Hierarchical Correlation DistillationCode1
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
Model Compression Techniques in Biometrics Applications: A SurveyCode0
TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in ConversationCode1
Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual InformationCode1
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization0
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box PromptsCode2
Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense PredictionCode0
A Deep Hierarchical Feature Sparse Framework for Occluded Person Re-Identification0
Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision0
Knowledge Distillation of Black-Box Large Language Models0
EVOKE: Emotion Enabled Virtual Avatar Mapping Using Optimized Knowledge Distillation0
Direct Distillation between Different Domains0
An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation0
Graph Relation Distillation for Efficient Biomedical Instance SegmentationCode1
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection0
Attention to detail: inter-resolution knowledge distillationCode0
Object-Centric Diffusion for Efficient Video Editing0
Hierarchical Knowledge Distillation on Text Graph for Data-limited Attribute Inference0
Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation0
Logits Poisoning Attack in Federated Distillation0
Multi-Channel Multi-Domain based Knowledge Distillation Algorithm for Sleep Staging with Single-Channel EEG0
SeqNAS: Neural Architecture Search for Event Sequence ClassificationCode0
Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level LossCode2
Bridging Modalities: Knowledge Distillation and Masked Training for Translating Multi-Modal Emotion Recognition to Uni-Modal, Speech-Only Emotion RecognitionCode0
Distillation-based fabric anomaly detectionCode0
Exploring Vacant Classes in Label-Skewed Federated LearningCode0
CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition0
Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection0
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning0
Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware DetectorCode0
HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing AidsCode1
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification0
Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing0
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data RegimesCode0
Show:102550
← PrevPage 28 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified