Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3701–3750 of 4240 papers

Title	Date	Tasks	Status
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Sep 29, 2024	Knowledge DistillationModel Compression	—Unverified
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jun 25, 2024	Knowledge Distillation	—Unverified
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Nov 22, 2024	Anomaly Detectiondocument understanding	—Unverified
Information-Theoretic GAN Compression with Variational Energy-based Model	Mar 28, 2023	Image EnhancementKnowledge Distillation	—Unverified
Inherit with Distillation and Evolve with Contrast: Exploring Class Incremental Semantic Segmentation Without Exemplar Memory	Sep 27, 2023	Class-Incremental Semantic SegmentationContrastive Learning	—Unverified
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer	Mar 20, 2025	Knowledge DistillationModel Compression	—Unverified
Initial Classifier Weights Replay for Memoryless Class Incremental Learning	Aug 31, 2020	Allclass-incremental learning	—Unverified
Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems	Dec 28, 2024	Anomaly DetectionBinary Classification	—Unverified
Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation	Dec 2, 2022	Knowledge DistillationSpeech Enhancement	—Unverified
Inplace knowledge distillation with teacher assistant for improved training of flexible deep neural networks	May 18, 2021	image-classificationImage Classification	—Unverified
In-situ animal behavior classification using knowledge distillation and fixed-point quantization	Sep 9, 2022	ClassificationKnowledge Distillation	—Unverified
Instance-aware Model Ensemble With Distillation For Unsupervised Domain Adaptation	Nov 15, 2022	Domain AdaptationKnowledge Distillation	—Unverified
In Teacher We Trust: Learning Compressed Models for Pedestrian Detection	Dec 1, 2016	Knowledge DistillationPedestrian Detection	—Unverified
Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification	Sep 14, 2024	Knowledge DistillationSpeaker Verification	—Unverified
Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Feb 18, 2025	Data AugmentationGSM8K	—Unverified
Integrating ChatGPT into Secure Hospital Networks: A Case Study on Improving Radiology Report Analysis	Feb 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding	Apr 15, 2021	intent-classificationIntent Classification	—Unverified
Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models	Aug 22, 2024	In-Context LearningKnowledge Distillation	—Unverified
Interactive Knowledge Distillation	Jul 3, 2020	image-classificationImage Classification	—Unverified
Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision	Oct 31, 2023	InformativenessKnowledge Distillation	—Unverified
Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition	Nov 28, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Intermediate Distillation: Data-Efficient Distillation from Black-Box LLMs for Information Retrieval	Jun 18, 2024	Information RetrievalKnowledge Distillation	—Unverified
Interpretable discovery of new semiconductors with machine learning	Jan 12, 2021	BIG-bench Machine LearningKnowledge Distillation	—Unverified
Interpretable Foreground Object Search As Knowledge Distillation	Jul 20, 2020	Knowledge DistillationObject	—Unverified
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation	May 20, 2025	Information RetrievalKnowledge Distillation	—Unverified
Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving	Apr 24, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Mar 28, 2025	Depth EstimationDepth Prediction	—Unverified
Introspective Learning by Distilling Knowledge from Online Self-explanation	Sep 19, 2020	Knowledge Distillation	—Unverified
Intuitive Access to Smartphone Settings Using Relevance Model Trained by Contrastive Learning	Jul 15, 2023	Contrastive LearningKnowledge Distillation	—Unverified
Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models	Feb 27, 2025	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Oct 30, 2024	Knowledge DistillationLanguage Modelling	—Unverified
IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation	Jan 1, 2024	Knowledge DistillationMotion Estimation	—Unverified
Is Label Smoothing Truly Incompatible with Knowledge Distillation: An Empirical Study	Apr 1, 2021	image-classificationImage Classification	—Unverified
Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Mar 31, 2025	ArticlesKnowledge Distillation	—Unverified
Isotonic Data Augmentation for Knowledge Distillation	Jul 3, 2021	AttributeData Augmentation	—Unverified
ISP Distillation	Jan 25, 2021	Knowledge DistillationObject Recognition	—Unverified
Iterative Dual Domain Adaptation for Neural Machine Translation	Dec 16, 2019	Domain AdaptationKnowledge Distillation	—Unverified
Iterative Graph Self-Distillation	Oct 23, 2020	Contrastive LearningGraph Learning	—Unverified
Iterative Self Knowledge Distillation -- From Pothole Classification to Fine-Grained and COVID Recognition	Feb 4, 2022	ClassificationKnowledge Distillation	—Unverified
JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition	Mar 4, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
基于层间知识蒸馏的神经机器翻译(Inter-layer Knowledge Distillation for Neural Machine Translation)	Aug 1, 2021	Knowledge DistillationMachine Translation	—Unverified
Joint Architecture and Knowledge Distillation in CNN for Chinese Text Recognition	Dec 17, 2019	Handwritten Chinese Text RecognitionKnowledge Distillation	—Unverified
Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation	May 27, 2021	Knowledge DistillationNeural Architecture Search	—Unverified
Joint Diffusion models in Continual Learning	Nov 12, 2024	Continual LearningKnowledge Distillation	—Unverified
Joint Feature Distribution Alignment Learning for NIR-VIS and VIS-VIS Face Recognition	Apr 25, 2022	Face RecognitionHeterogeneous Face Recognition	—Unverified
Joint Input and Output Coordination for Class-Incremental Learning	Sep 9, 2024	class-incremental learningClass Incremental Learning	—Unverified
Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment	Jan 25, 2022	BenchmarkingEntity Alignment	—Unverified
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation	May 22, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility	Sep 14, 2024	Knowledge DistillationLanguage Modeling	—Unverified
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture	Sep 24, 2022	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 75 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified