SOTAVerified

Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Showing 15511600 of 4240 papers

TitleStatusHype
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging0
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation0
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity0
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading0
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation0
HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning0
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies0
Linear Projections of Teacher Embeddings for Few-Class Distillation0
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries0
Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation0
Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment0
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation0
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge IntegrationCode0
Harmonizing knowledge Transfer in Neural Network with Unified Distillation0
Multi-modal Cross-domain Self-supervised Pre-training for fMRI and EEG Fusion0
Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models0
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation0
Kendall's τ Coefficient for Logits Distillation0
Shape-intensity knowledge distillation for robust medical image segmentationCode0
Weak-to-Strong Backdoor Attack for Large Language Models0
SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling0
MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events0
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation0
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization0
Privacy Evaluation Benchmarks for NLP ModelsCode0
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models0
Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation0
DSG-KD: Knowledge Distillation from Domain-Specific to General Language ModelsCode0
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation0
Prior Knowledge Distillation Network for Face Super-Resolution0
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models0
On Importance of Pruning and Distillation for Efficient Low Resource NLP0
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics0
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper0
Simple Unsupervised Knowledge Distillation With Space Similarity0
Towards Low-latency Event-based Visual Recognition with Hybrid Step-wise Distillation Spiking Neural NetworksCode0
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward0
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction0
Small Language Models are Equation Reasoners0
Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models0
Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT DistillationCode0
Enhancing SLM via ChatGPT and Dataset Augmentation0
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution AlignmentCode0
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights0
Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings0
StableMamba: Distillation-free Scaling of Large SSMs for Images and Videos0
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction0
RUIE: Retrieval-based Unified Information Extraction using Large Language ModelCode0
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis0
Applications of Knowledge Distillation in Remote Sensing: A Survey0
Show:102550
← PrevPage 32 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ScaleKD (T:BEiT-L S:ViT-B/14)Top-1 accuracy %86.43Unverified
2ScaleKD (T:Swin-L S:ViT-B/16)Top-1 accuracy %85.53Unverified
3ScaleKD (T:Swin-L S:ViT-S/16)Top-1 accuracy %83.93Unverified
4ScaleKD (T:Swin-L S:Swin-T)Top-1 accuracy %83.8Unverified
5KD++(T: regnety-16GF S:ViT-B)Top-1 accuracy %83.6Unverified
6VkD (T:RegNety 160 S:DeiT-S)Top-1 accuracy %82.9Unverified
7SpectralKD (T:Swin-S S:Swin-T)Top-1 accuracy %82.7Unverified
8ScaleKD (T:Swin-L S:ResNet-50)Top-1 accuracy %82.55Unverified
9DiffKD (T:Swin-L S: Swin-T)Top-1 accuracy %82.5Unverified
10DIST (T: Swin-L S: Swin-T)Top-1 accuracy %82.3Unverified
#ModelMetricClaimedVerifiedStatus
1SRD (T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)79.86Unverified
2shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)78.76Unverified
3MV-MR (T: CLIP/ViT-B-16 S: resnet50)Top-1 Accuracy (%)78.6Unverified
4resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)78.28Unverified
5resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])Top-1 Accuracy (%)78.08Unverified
6ReviewKD++(T:resnet-32x4, S:shufflenet-v2)Top-1 Accuracy (%)77.93Unverified
7ReviewKD++(T:resnet-32x4, S:shufflenet-v1)Top-1 Accuracy (%)77.68Unverified
8resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)77.5Unverified
9resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.68Unverified
10resnet8x4 (T: resnet32x4 S: resnet8x4)Top-1 Accuracy (%)76.31Unverified
#ModelMetricClaimedVerifiedStatus
1LSHFM (T: ResNet101 S: ResNet50)mAP93.17Unverified
2LSHFM (T: ResNet101 S: MobileNetV2)mAP90.14Unverified
#ModelMetricClaimedVerifiedStatus
1TIE-KD (T: Adabins S: MobileNetV2)RMSE2.43Unverified