Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2451–2500 of 4240 papers

Title	Date	Tasks	Status
Federated Learning on Non-iid Data via Local and Global Distillation	Jun 26, 2023	Federated LearningKnowledge Distillation	—Unverified
Cross Architecture Distillation for Face Recognition	Jun 26, 2023	Face RecognitionKnowledge Distillation	—Unverified
Enhancing Mapless Trajectory Prediction through Knowledge Distillation	Jun 25, 2023	Autonomous DrivingKnowledge Distillation	—Unverified
Feature Adversarial Distillation for Point Cloud Classification	Jun 25, 2023	ClassificationFAD	—Unverified
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes	Jun 23, 2023	Arithmetic ReasoningKnowledge Distillation	—Unverified
Temporal Action Proposal Generation With Action Frequency Adaptive Network	Jun 23, 2023	Knowledge DistillationTemporal Action Proposal Generation	CodeCode Available
Incorporating Graph Information in Transformer-based AMR Parsing	Jun 23, 2023	Abstract Meaning RepresentationAMR Parsing	CodeCode Available
Knowledge Distillation via Token-level Relationship Graph	Jun 20, 2023	Knowledge DistillationTransfer Learning	—Unverified
Recent Advances in Direct Speech-to-text Translation	Jun 20, 2023	Data AugmentationDecoder	—Unverified
Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation	Jun 19, 2023	Knowledge DistillationRelation	—Unverified
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation	Jun 19, 2023	Action RecognitionFederated Learning	—Unverified
Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study	Jun 18, 2023	Data AugmentationDiagnostic	—Unverified
MixedTeacher : Knowledge Distillation for fast inference textural anomaly detection	Jun 16, 2023	Anomaly DetectionKnowledge Distillation	CodeCode Available
Knowledge Distillation for Efficient Audio-Visual Video Captioning	Jun 16, 2023	Audio-Visual Video CaptioningCaption Generation	—Unverified
Squeezing nnU-Nets with Knowledge Distillation for On-Board Cloud Detection	Jun 16, 2023	Cloud DetectionKnowledge Distillation	—Unverified
Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models	Jun 15, 2023	Data AugmentationKnowledge Distillation	CodeCode Available
Self-Knowledge Distillation for Surgical Phase Recognition	Jun 15, 2023	DecoderKnowledge Distillation	—Unverified
Heterogeneous Continual Learning	Jun 14, 2023	Continual LearningKnowledge Distillation	—Unverified
Enhanced Multimodal Representation Learning with Cross-modal KD	Jun 13, 2023	Contrastive LearningEmotion Classification	—Unverified
EaSyGuide : ESG Issue Identification Framework leveraging Abilities of Generative Large Language Models	Jun 11, 2023	ArticlesKnowledge Distillation	CodeCode Available
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition	Jun 9, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping	Jun 8, 2023	DenoisingKnowledge Distillation	—Unverified
The economic trade-offs of large language models: A case study	Jun 8, 2023	Knowledge DistillationPrompt Engineering	—Unverified
Population-Based Evolutionary Gaming for Unsupervised Person Re-identification	Jun 8, 2023	DiversityKnowledge Distillation	—Unverified
Faithful Knowledge Distillation	Jun 7, 2023	Adversarial RobustnessKnowledge Distillation	—Unverified
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining	Jun 6, 2023	Knowledge DistillationModel-based Reinforcement Learning	CodeCode Available
Zero shot framework for satellite image restoration	Jun 5, 2023	DisentanglementImage Restoration	—Unverified
Joint Pre-training and Local Re-training: Transferable Representation Learning on Multi-source Knowledge Graphs	Jun 5, 2023	Entity AlignmentKnowledge Distillation	CodeCode Available
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference	Jun 4, 2023	DecoderKnowledge Distillation	—Unverified
Deep Classifier Mimicry without Data Access	Jun 3, 2023	Knowledge Distillation	CodeCode Available
Evolving Knowledge Mining for Class Incremental Segmentation	Jun 3, 2023	Class-Incremental Semantic SegmentationKnowledge Distillation	CodeCode Available
Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models	Jun 2, 2023	Knowledge Distillation	—Unverified
Group channel pruning and spatial attention distilling for object detection	Jun 2, 2023	Knowledge DistillationModel Compression	—Unverified
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23	Jun 2, 2023	Knowledge DistillationMachine Translation	—Unverified
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning	Jun 1, 2023	Incremental LearningKnowledge Distillation	CodeCode Available
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation	Jun 1, 2023	automatic-speech-translationCross-Lingual Transfer	—Unverified
Graph Entropy Minimization for Semi-supervised Node Classification	May 31, 2023	ClassificationKnowledge Distillation	CodeCode Available
Accurate and Structured Pruning for Efficient Automatic Speech Recognition	May 31, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Research on Multilingual News Clustering Based on Cross-Language Word Embeddings	May 30, 2023	ClusteringKnowledge Distillation	—Unverified
A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation	May 30, 2023	Data AugmentationImage Retrieval	—Unverified
KEYword based Sampling (KEYS) for Large Language Models	May 30, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective	May 29, 2023	Knowledge DistillationReinforcement Learning (RL)	CodeCode Available
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking	May 29, 2023	Answer GenerationDialogue Generation	—Unverified
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval	May 28, 2023	Image RetrievalKnowledge Distillation	—Unverified
Vision Transformers for Small Histological Datasets Learned through Knowledge Distillation	May 27, 2023	Airbubbles DetectionAnomaly Detection	CodeCode Available
Knowledge Distillation Performs Partial Variance Reduction	May 27, 2023	Knowledge Distillation	CodeCode Available
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression	May 26, 2023	Knowledge Distillation	—Unverified
A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models	May 26, 2023	Knowledge Distillation	—Unverified
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data	May 25, 2023	Knowledge DistillationSpeech Extraction	—Unverified
Camera-Incremental Object Re-Identification with Identity Knowledge Evolution	May 25, 2023	Knowledge DistillationObject	CodeCode Available

Show:10 25 50

← PrevPage 50 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified