Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2951–3000 of 4240 papers

Title	Date	Tasks	Status
Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment	Sep 2, 2024	CPUGPU	—Unverified
Compressing Visual-linguistic Model via Knowledge Distillation	Apr 5, 2021	Image CaptioningKnowledge Distillation	—Unverified
Compression of Acoustic Event Detection Models With Quantized Distillation	Jul 1, 2019	Event DetectionKnowledge Distillation	—Unverified
Compression of Deep Learning Models for Text: A Survey	Aug 12, 2020	Deep LearningInformation Retrieval	—Unverified
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices	Nov 30, 2023	Knowledge Distillation	—Unverified
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval	May 28, 2023	Image RetrievalKnowledge Distillation	—Unverified
ConceptDistil: Model-Agnostic Distillation of Concept Explanations	May 7, 2022	Explainable ModelsKnowledge Distillation	—Unverified
Condensed Sample-Guided Model Inversion for Knowledge Distillation	Aug 25, 2024	Knowledge Distillationmodel	—Unverified
Conditional Autoregressors are Interpretable Classifiers	Mar 31, 2022	Classificationimage-classification	—Unverified
Conditional Generative Data-free Knowledge Distillation	Dec 31, 2021	Conditional Image GenerationData-free Knowledge Distillation	—Unverified
Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation	Mar 18, 2023	Autonomous DrivingDomain Adaptation	—Unverified
Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation	Feb 28, 2022	DecoderKnowledge Distillation	—Unverified
Confidence Conditioned Knowledge Distillation	Jul 6, 2021	Knowledge Distillation	—Unverified
Confidence Preservation Property in Knowledge Distillation Abstractions	Jan 21, 2024	ClassificationKnowledge Distillation	—Unverified
Configurable Holography: Towards Display and Scene Adaptation	Mar 24, 2024	Depth EstimationKnowledge Distillation	—Unverified
Conformer with dual-mode chunked attention for joint online and offline ASR	Jun 22, 2022	Knowledge Distillation	—Unverified
Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation	Apr 12, 2023	Knowledge Distillation	—Unverified
Contextual Affinity Distillation for Image Anomaly Detection	Jul 6, 2023	Anomaly DetectionKnowledge Distillation	—Unverified
Contextual Distillation Model for Diversified Recommendation	Jun 13, 2024	DiversityKnowledge Distillation	—Unverified
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering	Oct 21, 2020	Audio Signal ProcessingConversational Question Answering	—Unverified
Contextual Knowledge Distillation for Transformer Compression	Jan 1, 2021	Knowledge DistillationLanguage Modeling	—Unverified
Continual Detection Transformer for Incremental Object Detection	Apr 6, 2023	Class-Incremental Object DetectionKnowledge Distillation	—Unverified
Continual Distillation Learning: Knowledge Distillation in Prompt-based Continual Learning	Jul 18, 2024	Continual LearningKnowledge Distillation	—Unverified
Continual Face Forgery Detection via Historical Distribution Preserving	Aug 11, 2023	Knowledge Distillation	—Unverified
Continual Learning for Class- and Domain-Incremental Semantic Segmentation	Sep 16, 2022	class-incremental learningClass Incremental Learning	—Unverified
Continual Learning for Fake Audio Detection	Apr 15, 2021	Continual LearningKnowledge Distillation	—Unverified
Continual Learning for Neural Machine Translation	Jun 1, 2021	Continual LearningKnowledge Distillation	—Unverified
Unsupervised Continual Learning Via Pseudo Labels	Apr 14, 2021	ClusteringContinual Learning	—Unverified
Continual Learning with Diffusion-based Generative Replay for Industrial Streaming Data	Jun 22, 2024	Continual LearningKnowledge Distillation	—Unverified
Continual Learning with Dirichlet Generative-based Rehearsal	Sep 13, 2023	Continual LearningIncremental Learning	—Unverified
Continual Segment: Towards a Single, Unified and Accessible Continual Segmentation Model of 143 Whole-body Organs in CT Scans	Feb 1, 2023	Continual Semantic SegmentationDecoder	—Unverified
Continual Segment: Towards a Single, Unified and Non-forgetting Continual Segmentation Model of 143 Whole-body Organs in CT Scans	Jan 1, 2023	Continual Semantic SegmentationDecoder	—Unverified
Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing	Jun 26, 2025	Continual LearningContinual Self-Supervised Learning	—Unverified
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization	Dec 12, 2022	Knowledge DistillationNatural Language Understanding	—Unverified
Continuous Concepts Removal in Text-to-image Diffusion Models	Nov 30, 2024	Knowledge Distillation	—Unverified
Continuous sign language recognition based on cross-resolution knowledge distillation	Mar 13, 2023	Knowledge DistillationSign Language Recognition	—Unverified
Contrastive Continual Multi-view Clustering with Filtered Structural Fusion	Sep 26, 2023	ClusteringContrastive Learning	—Unverified
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation	Dec 4, 2023	BenchmarkingContrastive Learning	—Unverified
Contrastive Representation Distillation via Multi-Scale Feature Decoupling	Feb 9, 2025	Knowledge DistillationTransfer Learning	—Unverified
Contrast R-CNN for Continual Learning in Object Detection	Jul 11, 2021	Continual Learningimage-classification	—Unverified
Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition	Nov 22, 2021	Action RecognitionContrastive Learning	—Unverified
Controlling the Quality of Distillation in Response-Based Network Compression	Dec 19, 2021	Knowledge Distillation	—Unverified
Control Policy Correction Framework for Reinforcement Learning-based Energy Arbitrage Strategies	Apr 29, 2024	Knowledge Distillationreinforcement-learning	—Unverified
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition	Sep 29, 2021	image-classificationImage Classification	—Unverified
Cooperative Denoising for Distantly Supervised Relation Extraction	Aug 1, 2018	DenoisingInformation Retrieval	—Unverified
Cooperative Learning for Cost-Adaptive Inference	Dec 13, 2023	Knowledge Distillation	—Unverified
Coordinating Cross-modal Distillation for Molecular Property Prediction	Nov 30, 2022	Graph RegressionGraph Representation Learning	—Unverified
ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation	Sep 14, 2023	3DGSColorization	—Unverified
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks	Mar 13, 2024	Knowledge Distillation	—Unverified
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible	Apr 5, 2025	Federated LearningKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 60 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified