SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 32013250 of 4240 papers

TitleStatusHype
Domain Adaptation for Dense Retrieval through Self-Supervision by Pseudo-Relevance Labeling0
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild0
Domain-Agnostic Clustering with Self-Distillation0
Domain Discrepancy Aware Distillation for Model Aggregation in Federated Learning0
Domain Generalization on Efficient Acoustic Scene Classification using Residual Normalization0
Domain-invariant Feature Exploration for Domain Generalization0
Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection0
Domain Knowledge Distillation from Large Language Model: An Empirical Study in the Autonomous Driving Domain0
Domain-specific knowledge distillation yields smaller and better models for conversational commerce0
Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis0
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks0
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation0
Do Not Forget to Attend to Uncertainty while Mitigating Catastrophic Forgetting0
Don't be picky, all students in the right family can learn from good teachers0
Don't Throw Away Data: Better Sequence Knowledge Distillation0
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency0
Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches0
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification0
Double Similarity Distillation for Semantic Image Segmentation0
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model0
DreamTeacher: Pretraining Image Backbones with Deep Generative Models0
DRKF: Distilled Rotated Kernel Fusion for Efficient Rotation Invariant Descriptors in Local Feature Matching0
DS3-Net: Difficulty-perceived Common-to-T1ce Semi-Supervised Multimodal MRI Synthesis Network0
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization0
DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning0
DST: Dynamic Substitute Training for Data-free Black-box Attack0
DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis0
DTCM: Deep Transformer Capsule Mutual Distillation for Multivariate Time Series Classification0
Dual Discriminator Adversarial Distillation for Data-free Model Compression0
Dual Embodied-Symbolic Concept Representations for Deep Learning0
Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head0
Dual Knowledge Distillation for Efficient Sound Event Detection0
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection0
Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection0
Dual-Student Knowledge Distillation Networks for Unsupervised Anomaly Detection0
Dual-Teacher Class-Incremental Learning With Data-Free Generative Replay0
Dual-Teacher: Integrating Intra-domain and Inter-domain Teachers for Annotation-efficient Cardiac Segmentation0
Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing0
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion0
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding0
DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset0
DVFL: A Vertical Federated Learning Method for Dynamic Data0
DyLiN: Making Light Field Networks Dynamic0
Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles0
Dynamically pruning segformer for efficient semantic segmentation0
DynamicKD: An Effective Knowledge Distillation via Dynamic Entropy Correction-Based Distillation for Gap Optimizing0
Dynamic Knowledge Distillation for Black-box Hypothesis Transfer Learning0
Dynamic Knowledge Distillation With Noise Elimination for RGB-D Salient Object Detection0
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting0
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization0
Show:102550
← PrevPage 65 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified