SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 451500 of 4240 papers

TitleStatusHype
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student PerformanceCode0
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object DetectionCode0
Enhancing Knowledge Distillation for LLMs with Response-Priming PromptingCode0
Canine EEG Helps Human: Cross-Species and Cross-Modality Epileptic Seizure Detection via Multi-Space Alignment0
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image SegmentationCode2
Hybrid Data-Free Knowledge DistillationCode0
On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process0
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning PerspectiveCode0
On the Compression of Language Models for Code: An Empirical Study on CodeBERT0
Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction0
Split Knowledge Distillation for Large Models in IoT: Architecture, Challenges, and Solutions0
In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning0
Modality-Inconsistent Continual Learning of Multimodal Large Language Models0
Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation0
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts0
Relation-Guided Adversarial Learning for Data-free Knowledge TransferCode1
BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform MotionsCode2
Neural Collapse Inspired Knowledge Distillation0
Active Large Language Model-based Knowledge Distillation for Session-based Recommendation0
Knowledge Migration Framework for Smart Contract Vulnerability Detection0
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes0
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation0
Leveraging Large Language Models for Active Merchant Non-player CharactersCode0
On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning0
Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty DetectionCode0
Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias0
LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering0
ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression0
Dynamic Contrastive Knowledge Distillation for Efficient Image RestorationCode1
Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices0
Multimodal Industrial Anomaly Detection by Crossmodal Reverse DistillationCode0
All You Need in Knowledge Distillation Is a Tailored Coordinate System0
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training0
DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-IdentificationCode0
A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks0
Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach0
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge DistillationCode2
DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation0
Cloud Object Detector Adaptation by Integrating Different Source KnowledgeCode1
TT-MPD: Test Time Model Pruning and Distillation0
Unlocking the Potential of Reverse Distillation for Anomaly DetectionCode1
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question AnsweringCode0
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening0
Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis0
Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation0
Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category DiscoveryCode0
CCS: Continuous Learning for Customized Incremental Wireless Sensing Services0
BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits0
One-shot Federated Learning via Synthetic Distiller-Distillate CommunicationCode1
Show:102550
← PrevPage 10 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified