SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 20012050 of 4240 papers

TitleStatusHype
An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models0
Biologically inspired structure learning with reverse knowledge distillation for spiking neural networks0
Deep Collective Knowledge Distillation0
LaSNN: Layer-wise ANN-to-SNN Distillation for Effective and Efficient Training in Deep Spiking Neural Networks0
Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction0
OVTrack: Open-Vocabulary Multiple Object TrackingCode1
Learning to "Segment Anything" in Thermal Infrared Images through Knowledge Distillation with a Large Scale Dataset SATIRCode0
Robust Cross-Modal Knowledge Distillation for Unconstrained VideosCode1
Teacher Network Calibration Improves Cross-Quality Knowledge DistillationCode0
Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based GamesCode0
Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation LearningCode1
Class-Incremental Learning of Plant and Disease Detection: Growing Branches with Knowledge Distillation0
Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation0
SFT-KD-Recon: Learning a Student-friendly Teacher for Knowledge Distillation in Magnetic Resonance Image ReconstructionCode0
Grouped Knowledge Distillation for Deep Face Recognition0
A Survey on Recent Teacher-student Learning Studies0
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation0
Homogenizing Non-IID datasets via In-Distribution Knowledge Distillation for Decentralized Learning0
A Comprehensive Survey on Knowledge Distillation of Diffusion Models0
Model-Agnostic Decentralized Collaborative Learning for On-Device POI Recommendation0
Continual Learning for LiDAR Semantic Segmentation: Class-Incremental and Coarse-to-Fine strategies on Sparse DataCode1
Masked Student Dataset of ExpressionsCode0
Continual Detection Transformer for Incremental Object Detection0
DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic SegmentationCode1
Towards Efficient Task-Driven Model Reprogramming with Foundation Models0
Self-Distillation for Gaussian Process Regression and ClassificationCode0
MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations0
Selective Knowledge Sharing for Privacy-Preserving Federated Distillation without A Good TeacherCode1
Cross-Class Feature Augmentation for Class Incremental Learning0
Knowledge-Distilled Graph Neural Networks for Personalized Epileptic Seizure Detection0
Vision-Language Models for Vision Tasks: A SurveyCode4
Domain Generalization for Crop Segmentation with Standardized Ensemble Knowledge DistillationCode0
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation0
Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation0
Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders0
GVP: Generative Volumetric Primitives0
Knowledge Distillation for Feature Extraction in Underwater VSLAMCode1
oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes0
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval0
Kaizen: Practical Self-supervised Continual Learning with Continual Fine-tuningCode1
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation0
Asymmetric Image Retrieval with Cross Model Compatible Ensembles0
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object DetectionCode1
Dice Semimetric Losses: Optimizing the Dice Score with Soft LabelsCode1
Information-Theoretic GAN Compression with Variational Energy-based Model0
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language ModelsCode1
SELF-VS: Self-supervised Encoding Learning For Video SummarizationCode0
Projected Latent Distillation for Data-Agnostic Consolidation in Distributed Continual LearningCode0
DisWOT: Student Architecture Search for Distillation WithOut TrainingCode1
Improving Neural Topic Models with Wasserstein Knowledge DistillationCode0
Show:102550
← PrevPage 41 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified