SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 14511500 of 4240 papers

TitleStatusHype
FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline0
Student as an Inherent Denoiser of Noisy Teacher0
MobileSAMv2: Faster Segment Anything to EverythingCode5
Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model InferenceCode2
WAVER: Writing-style Agnostic Text-Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary KnowledgeCode0
Efficient speech detection in environmental audio using acoustic recognition and knowledge distillation0
COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial ProblemsCode0
Generative Model-based Feature Knowledge Distillation for Action RecognitionCode1
Unraveling Key Factors of Knowledge Distillation0
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object DetectorCode1
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed DataCode1
RdimKD: Generic Distillation Paradigm by Dimensionality Reduction0
RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment0
Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models0
Cooperative Learning for Cost-Adaptive Inference0
KDAS: Knowledge Distillation via Attention Supervision Framework for Polyp SegmentationCode1
Mutual-Learning Knowledge Distillation for Nighttime UAV TrackingCode0
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL ApproachCode1
A dynamic interactive learning framework for automated 3D medical image segmentation0
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation0
Fake It Till Make It: Federated Learning with Consensus-Oriented Generation0
IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment0
Understanding the Effect of Model Compression on Social Bias in Large Language ModelsCode0
Improving Adversarial Robust Fairness via Anti-Bias Soft Label DistillationCode0
Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsCode0
Language Model Knowledge Distillation for Efficient Question Answering in SpanishCode0
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis0
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic SegmentationCode1
Combining inherent knowledge of vision-language models with unsupervised domain adaptation through strong-weak guidanceCode0
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video PairsCode0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
TriDeNT: Triple Deep Network Training for Privileged Knowledge Distillation in Histopathology0
OplixNet: Towards Area-Efficient Optical Split-Complex Networks with Real-to-Complex Data Assignment and Knowledge Distillation0
Enhancing and Adapting in the Clinic: Source-free Unsupervised Domain Adaptation for Medical Image EnhancementCode1
S2P3: Self-Supervised Polarimetric Pose Prediction0
Dual-Teacher De-biasing Distillation Framework for Multi-domain Fake News DetectionCode1
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices0
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions0
Initializing Models with Larger OnesCode1
LayerCollapse: Adaptive compression of neural networks0
The Devil is in the Data: Learning Fair Graph Neural Networks via Partial Knowledge DistillationCode0
Continual Learning for Image Segmentation with Dynamic QueryCode1
Propagate & Distill: Towards Effective Graph Learners Using Propagation-Embracing MLPs0
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image GenerationCode1
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPSCode2
FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning0
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face Diffuser0
Rethinking Intermediate Layers design in Knowledge Distillation for Kidney and Liver Tumor SegmentationCode0
UFIN: Universal Feature Interaction Network for Multi-Domain Click-Through Rate PredictionCode0
Show:102550
← PrevPage 30 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified