Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1551–1600 of 4240 papers

Title	Date	Tasks	Status
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Oct 1, 2024	Computational EfficiencyKnowledge Distillation	—Unverified
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Oct 1, 2024	Knowledge DistillationMachine Translation	—Unverified
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity	Oct 1, 2024	DecoderKnowledge Distillation	—Unverified
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading	Oct 1, 2024	Diabetic Retinopathy Gradingimage-classification	—Unverified
Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Sep 30, 2024	Data AugmentationKnowledge Distillation	—Unverified
HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Sep 30, 2024	Federated LearningKnowledge Distillation	—Unverified
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Sep 30, 2024	2D Human Pose Estimationimage-classification	—Unverified
Linear Projections of Teacher Embeddings for Few-Class Distillation	Sep 30, 2024	Binary ClassificationKnowledge Distillation	—Unverified
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Sep 29, 2024	Knowledge DistillationModel Compression	—Unverified
Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation	Sep 29, 2024	Federated LearningKnowledge Distillation	—Unverified
Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment	Sep 28, 2024	Brain Tumor SegmentationKnowledge Distillation	—Unverified
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation	Sep 27, 2024	Knowledge DistillationVision and Language Navigation	—Unverified
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration	Sep 27, 2024	Federated LearningKnowledge Distillation	CodeCode Available
Harmonizing knowledge Transfer in Neural Network with Unified Distillation	Sep 27, 2024	Knowledge DistillationTransfer Learning	—Unverified
Multi-modal Cross-domain Self-supervised Pre-training for fMRI and EEG Fusion	Sep 27, 2024	Data AugmentationEEG	—Unverified
Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models	Sep 27, 2024	Anomaly DetectionKnowledge Distillation	—Unverified
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Sep 27, 2024	Knowledge DistillationTransfer Learning	—Unverified
Kendall's τ Coefficient for Logits Distillation	Sep 26, 2024	Knowledge Distillation	—Unverified
Shape-intensity knowledge distillation for robust medical image segmentation	Sep 26, 2024	Image SegmentationKnowledge Distillation	CodeCode Available
Weak-to-Strong Backdoor Attack for Large Language Models	Sep 26, 2024	Backdoor AttackKnowledge Distillation	—Unverified
SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling	Sep 25, 2024	Cancer ClassificationKnowledge Distillation	—Unverified
MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events	Sep 25, 2024	Audio TaggingAutomatic Speech Recognition	—Unverified
Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Sep 25, 2024	Domain AdaptationKnowledge Distillation	—Unverified
Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Sep 24, 2024	Knowledge DistillationQuantization	—Unverified
Privacy Evaluation Benchmarks for NLP Models	Sep 24, 2024	Knowledge Distillation	CodeCode Available
TS-HTFA: Advancing Time Series Forecasting via Hierarchical Text-Free Alignment with Large Language Models	Sep 23, 2024	Contrastive Learningcross-modal alignment	—Unverified
Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation	Sep 23, 2024	Knowledge DistillationLanguage Modeling	—Unverified
DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models	Sep 23, 2024	Knowledge DistillationTransfer Learning	CodeCode Available
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation	Sep 22, 2024	Image GenerationKnowledge Distillation	—Unverified
Prior Knowledge Distillation Network for Face Super-Resolution	Sep 22, 2024	Knowledge DistillationSuper-Resolution	—Unverified
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models	Sep 22, 2024	Knowledge Distillation	—Unverified
On Importance of Pruning and Distillation for Efficient Low Resource NLP	Sep 21, 2024	Document ClassificationGPU	—Unverified
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics	Sep 21, 2024	Knowledge DistillationSound Classification	—Unverified
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper	Sep 20, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Simple Unsupervised Knowledge Distillation With Space Similarity	Sep 20, 2024	Knowledge DistillationSelf-Supervised Learning	—Unverified
Towards Low-latency Event-based Visual Recognition with Hybrid Step-wise Distillation Spiking Neural Networks	Sep 19, 2024	Knowledge Distillation	CodeCode Available
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward	Sep 19, 2024	Dialogue GenerationKnowledge Distillation	—Unverified
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction	Sep 19, 2024	Bayesian OptimizationHuman motion prediction	—Unverified
Small Language Models are Equation Reasoners	Sep 19, 2024	Arithmetic ReasoningKnowledge Distillation	—Unverified
Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models	Sep 19, 2024	Knowledge Distillation	—Unverified
Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT Distillation	Sep 19, 2024	Data AugmentationEdge-computing	CodeCode Available
Enhancing SLM via ChatGPT and Dataset Augmentation	Sep 19, 2024	Knowledge DistillationNatural Language Inference	—Unverified
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment	Sep 19, 2024	Knowledge DistillationModel Compression	CodeCode Available
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights	Sep 19, 2024	Decision MakingKnowledge Distillation	—Unverified
Improving Cone-Beam CT Image Quality with Knowledge Distillation-Enhanced Diffusion Model in Imbalanced Data Settings	Sep 19, 2024	Computed Tomography (CT)Image Generation	—Unverified
StableMamba: Distillation-free Scaling of Large SSMs for Images and Videos	Sep 18, 2024	Action Recognitionimage-classification	—Unverified
Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Sep 18, 2024	Acoustic Scene ClassificationData Augmentation	—Unverified
RUIE: Retrieval-based Unified Information Extraction using Large Language Model	Sep 18, 2024	Contrastive LearningIn-Context Learning	CodeCode Available
EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Sep 18, 2024	Knowledge DistillationMedical Image Analysis	—Unverified
Applications of Knowledge Distillation in Remote Sensing: A Survey	Sep 18, 2024	Computational EfficiencyInstance Segmentation	—Unverified

Show:10 25 50

← PrevPage 32 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified