Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 4240 papers

Title	Date	Tasks	Status	Hype
DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models	Sep 23, 2024	Knowledge DistillationTransfer Learning	CodeCode Available	0
Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation	Sep 23, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
Prior Knowledge Distillation Network for Face Super-Resolution	Sep 22, 2024	Knowledge DistillationSuper-Resolution	—Unverified	0
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation	Sep 22, 2024	Image GenerationKnowledge Distillation	—Unverified	0
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models	Sep 22, 2024	Knowledge Distillation	—Unverified	0
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics	Sep 21, 2024	Knowledge DistillationSound Classification	—Unverified	0
On Importance of Pruning and Distillation for Efficient Low Resource NLP	Sep 21, 2024	Document ClassificationGPU	—Unverified	0
Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks	Sep 20, 2024	ARCGSM8K	CodeCode Available	1
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper	Sep 20, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Simple Unsupervised Knowledge Distillation With Space Similarity	Sep 20, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified	0
Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT Distillation	Sep 19, 2024	Data AugmentationEdge-computing	CodeCode Available	0
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward	Sep 19, 2024	Dialogue GenerationKnowledge Distillation	—Unverified	0
Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models	Sep 19, 2024	Knowledge Distillation	—Unverified	0
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction	Sep 19, 2024	Bayesian OptimizationHuman motion prediction	—Unverified	0
Small Language Models are Equation Reasoners	Sep 19, 2024	Arithmetic ReasoningKnowledge Distillation	—Unverified	0
Towards Low-latency Event-based Visual Recognition with Hybrid Step-wise Distillation Spiking Neural Networks	Sep 19, 2024	Knowledge Distillation	CodeCode Available	0
Enhancing SLM via ChatGPT and Dataset Augmentation	Sep 19, 2024	Knowledge DistillationNatural Language Inference	—Unverified	0
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified	0
Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings	Sep 19, 2024	Computed Tomography (CT)Image Generation	—Unverified	0
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment	Sep 19, 2024	Knowledge DistillationModel Compression	CodeCode Available	0
StableMamba: Distillation-free Scaling of Large SSMs for Images and Videos	Sep 18, 2024	Action Recognitionimage-classification	—Unverified	0
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Sep 18, 2024	Knowledge DistillationMedical Image Analysis	—Unverified	0
RUIE: Retrieval-based Unified Information Extraction using Large Language Model	Sep 18, 2024	Contrastive LearningIn-Context Learning	CodeCode Available	0
Applications of Knowledge Distillation in Remote Sensing: A Survey	Sep 18, 2024	Computational EfficiencyInstance Segmentation	—Unverified	0
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified	0
Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model	Sep 17, 2024	Knowledge DistillationOperator learning	CodeCode Available	0
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Sep 17, 2024	3D Object DetectionAutonomous Driving	—Unverified	0
Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation	Sep 17, 2024	Knowledge DistillationSpeech Synthesis	—Unverified	0
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Sep 16, 2024	Few-Shot Learningimage-classification	CodeCode Available	0
Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference	Sep 16, 2024	Autonomous DrivingKnowledge Distillation	—Unverified	0
Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification	Sep 14, 2024	Knowledge DistillationSpeaker Verification	—Unverified	0
Effective Pre-Training of Audio Transformers for Sound Event Detection	Sep 14, 2024	Data AugmentationEvent Detection	CodeCode Available	1
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility	Sep 14, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Sep 13, 2024	Class-Incremental Semantic SegmentationKnowledge Distillation	—Unverified	0
DiReDi: Distillation and Reverse Distillation for AIoT Applications	Sep 12, 2024	Knowledge DistillationManagement	—Unverified	0
Ruri: Japanese General Text Embeddings	Sep 12, 2024	Knowledge Distillation	CodeCode Available	2
Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios	Sep 12, 2024	Knowledge DistillationTransfer Learning	—Unverified	0
Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator	Sep 11, 2024	DiversityFederated Learning	—Unverified	0
DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis	Sep 11, 2024	ClassificationKnowledge Distillation	—Unverified	0
EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data	Sep 11, 2024	Data-free Knowledge DistillationKnowledge Distillation	CodeCode Available	1
Enhancing CTC-Based Visual Speech Recognition	Sep 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption	Sep 11, 2024	Anomaly DetectionComputational Efficiency	—Unverified	0
How Redundant Is the Transformer Stack in Speech Representation Models?	Sep 10, 2024	Knowledge DistillationSpeaker Identification	—Unverified	0
EasyST: A Simple Framework for Spatio-Temporal Prediction	Sep 10, 2024	Knowledge DistillationPrediction	CodeCode Available	1
Knowledge Distillation via Query Selection for Detection Transformer	Sep 10, 2024	Knowledge Distillationobject-detection	—Unverified	0
Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study	Sep 10, 2024	Active LearningFederated Learning	—Unverified	0
Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition	Sep 10, 2024	Face RecognitionKnowledge Distillation	—Unverified	0
Complex Emotion Recognition System using basic emotions via Facial Expression, EEG, and ECG Signals: a review	Sep 9, 2024	EEGElectroencephalogram (EEG)	—Unverified	0
LEROjD: Lidar Extended Radar-Only Object Detection	Sep 9, 2024	3D Object DetectionKnowledge Distillation	CodeCode Available	1
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data	Sep 9, 2024	Brain Tumor ClassificationFederated Learning	CodeCode Available	0

Show:10 25 50

← PrevPage 15 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified