SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 29513000 of 4240 papers

TitleStatusHype
Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment0
Compressing Visual-linguistic Model via Knowledge Distillation0
Compression of Acoustic Event Detection Models With Quantized Distillation0
Compression of Deep Learning Models for Text: A Survey0
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices0
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval0
ConceptDistil: Model-Agnostic Distillation of Concept Explanations0
Condensed Sample-Guided Model Inversion for Knowledge Distillation0
Conditional Autoregressors are Interpretable Classifiers0
Conditional Generative Data-free Knowledge Distillation0
Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation0
Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation0
Confidence Conditioned Knowledge Distillation0
Confidence Preservation Property in Knowledge Distillation Abstractions0
Configurable Holography: Towards Display and Scene Adaptation0
Conformer with dual-mode chunked attention for joint online and offline ASR0
Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation0
Contextual Affinity Distillation for Image Anomaly Detection0
Contextual Distillation Model for Diversified Recommendation0
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering0
Contextual Knowledge Distillation for Transformer Compression0
Continual Detection Transformer for Incremental Object Detection0
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning0
Continual Face Forgery Detection via Historical Distribution Preserving0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
Continual Learning for Fake Audio Detection0
Continual Learning for Neural Machine Translation0
Unsupervised Continual Learning Via Pseudo Labels0
Continual Learning with Diffusion-based Generative Replay for Industrial Streaming Data0
Continual Learning with Dirichlet Generative-based Rehearsal0
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans0
Continual Segment: Towards a Single, Unified and Non-forgetting Continual Segmentation Model of 143 Whole-body Organs in CT Scans0
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing0
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization0
Continuous Concepts Removal in Text-to-image Diffusion Models0
Continuous sign language recognition based on cross-resolution knowledge distillation0
Contrastive Continual Multi-view Clustering with Filtered Structural Fusion0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
Contrastive Representation Distillation via Multi-Scale Feature Decoupling0
Contrast R-CNN for Continual Learning in Object Detection0
Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition0
Controlling the Quality of Distillation in Response-Based Network Compression0
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies0
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition0
Cooperative Denoising for Distantly Supervised Relation Extraction0
Cooperative Learning for Cost-Adaptive Inference0
Coordinating Cross-modal Distillation for Molecular Property Prediction0
ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation0
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks0
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible0
Show:102550
← PrevPage 60 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified