Knowledge Distillation

Knowledge distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep neural networks or ensembles of many models) have higher knowledge capacity than small models, this capacity might not be fully utilized.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2601–2650 of 4240 papers

Title	Date	Tasks	Status
GVP: Generative Volumetric Primitives	Mar 31, 2023	Image GenerationKnowledge Distillation	—Unverified
Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation	Mar 31, 2023	Knowledge DistillationMachine Translation	—Unverified
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation	Mar 30, 2023	DiversityImage Generation	—Unverified
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval	Mar 30, 2023	Image RetrievalKnowledge Distillation	—Unverified
oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes	Mar 30, 2023	Knowledge DistillationModel Compression	—Unverified
Asymmetric Image Retrieval with Cross Model Compatible Ensembles	Mar 30, 2023	DiversityFace Recognition	—Unverified
SELF-VS: Self-supervised Encoding Learning For Video Summarization	Mar 28, 2023	Knowledge DistillationRepresentation Learning	CodeCode Available
Information-Theoretic GAN Compression with Variational Energy-based Model	Mar 28, 2023	Image EnhancementKnowledge Distillation	—Unverified
Projected Latent Distillation for Data-Agnostic Consolidation in Distributed Continual Learning	Mar 28, 2023	Continual LearningKnowledge Distillation	CodeCode Available
Empowering Dual-Encoder with Query Generator for Cross-Lingual Dense Retrieval	Mar 27, 2023	Knowledge DistillationRetrieval	—Unverified
Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning	Mar 27, 2023	Knowledge DistillationKnowledge Graphs	—Unverified
Improving Neural Topic Models with Wasserstein Knowledge Distillation	Mar 27, 2023	Knowledge DistillationTopic Models	CodeCode Available
Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation	Mar 26, 2023	Knowledge Distillation	CodeCode Available
Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes	Mar 26, 2023	Depth EstimationKnowledge Distillation	—Unverified
Multi-view knowledge distillation transformer for human action recognition	Mar 25, 2023	Action RecognitionKnowledge Distillation	—Unverified
Task-Attentive Transformer Architecture for Continual Learning of Vision-and-Language Tasks Using Knowledge Distillation	Mar 25, 2023	Continual LearningKnowledge Distillation	—Unverified
Dealing With Heterogeneous 3D MR Knee Images: A Federated Few-Shot Learning Method With Dual Knowledge Distillation	Mar 25, 2023	Federated LearningFew-Shot Learning	CodeCode Available
Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Mar 24, 2023	Image RetrievalKnowledge Distillation	—Unverified
Mixed-Type Wafer Classification For Low Memory Devices Using Knowledge Distillation	Mar 24, 2023	Knowledge DistillationLightweight Deployment	—Unverified
DyLiN: Making Light Field Networks Dynamic	Mar 24, 2023	AttributeKnowledge Distillation	—Unverified
Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs	Mar 24, 2023	Knowledge Distillation	—Unverified
A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation	Mar 23, 2023	image-classificationImage Classification	—Unverified
Open-Vocabulary Object Detection using Pseudo Caption Labels	Mar 23, 2023	Image CaptioningKnowledge Distillation	—Unverified
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels	Mar 23, 2023	Knowledge DistillationSelf-Knowledge Distillation	—Unverified
From Wide to Deep: Dimension Lifting Network for Parameter-efficient Knowledge Graph Embedding	Mar 22, 2023	Graph EmbeddingKnowledge Distillation	—Unverified
Heterogeneous-Branch Collaborative Learning for Dialogue Generation	Mar 21, 2023	AttributeDialogue Generation	—Unverified
MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation	Mar 21, 2023	ClusteringContrastive Learning	CodeCode Available
Assessor-Guided Learning for Continual Environments	Mar 21, 2023	Continual LearningIncremental Learning	CodeCode Available
Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation	Mar 21, 2023	Adversarial RobustnessKnowledge Distillation	—Unverified
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition	Mar 20, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
More From Less: Self-Supervised Knowledge Distillation for Routine Histopathology Data	Mar 19, 2023	Knowledge Distillation	—Unverified
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models	Mar 18, 2023	Knowledge Distillation	—Unverified
Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation	Mar 18, 2023	Autonomous DrivingDomain Adaptation	—Unverified
An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering	Mar 18, 2023	Graph Question AnsweringKnowledge Distillation	CodeCode Available
Crowd Counting with Online Knowledge Learning	Mar 18, 2023	Crowd CountingEdge-computing	—Unverified
Whole-slide-imaging Cancer Metastases Detection and Localization with Limited Tumorous Data	Mar 18, 2023	Knowledge DistillationMedical Image Analysis	CodeCode Available
Distill n' Explain: explaining graph neural networks using simple surrogates	Mar 17, 2023	Knowledge Distillation	CodeCode Available
Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models	Mar 16, 2023	CoLACPU	—Unverified
Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval	Mar 16, 2023	Image RetrievalKnowledge Distillation	—Unverified
Knowledge Distillation for Adaptive MRI Prostate Segmentation Based on Limit-Trained Multi-Teacher Models	Mar 16, 2023	Knowledge DistillationMRI segmentation	—Unverified
DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model	Mar 16, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Cross-resolution Face Recognition via Identity-Preserving Network and Knowledge Distillation	Mar 15, 2023	Face RecognitionKnowledge Distillation	—Unverified
Knowledge Distillation from Single to Multi Labels: an Empirical Study	Mar 15, 2023	Classificationimage-classification	CodeCode Available
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
MetaMixer: A Regularization Strategy for Online Knowledge Distillation	Mar 14, 2023	Knowledge Distillation	—Unverified
Teacher-Student Knowledge Distillation for Radar Perception on Embedded Accelerators	Mar 14, 2023	Knowledge Distillationobject-detection	—Unverified
Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification	Mar 14, 2023	Data-free Knowledge DistillationKnowledge Distillation	—Unverified
Continuous sign language recognition based on cross-resolution knowledge distillation	Mar 13, 2023	Knowledge DistillationSign Language Recognition	—Unverified
Visual-Policy Learning through Multi-Camera View to Single-Camera View Knowledge Distillation for Robot Manipulation Tasks	Mar 13, 2023	Data AugmentationKnowledge Distillation	—Unverified
Knowledge Distillation for Efficient Sequences of Training Runs	Mar 11, 2023	Knowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 53 of 85Next →

All datasets ImageNet CIFAR-100 COCO (Common Objects in Context)COCO 2017 val PASCAL VOC KITTI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ScaleKD (T:BEiT-L S:ViT-B/14)	Top-1 accuracy %	86.43	—	Unverified
2	ScaleKD (T:Swin-L S:ViT-B/16)	Top-1 accuracy %	85.53	—	Unverified
3	ScaleKD (T:Swin-L S:ViT-S/16)	Top-1 accuracy %	83.93	—	Unverified
4	ScaleKD (T:Swin-L S:Swin-T)	Top-1 accuracy %	83.8	—	Unverified
5	KD++(T: regnety-16GF S:ViT-B)	Top-1 accuracy %	83.6	—	Unverified
6	VkD (T:RegNety 160 S:DeiT-S)	Top-1 accuracy %	82.9	—	Unverified
7	SpectralKD (T:Swin-S S:Swin-T)	Top-1 accuracy %	82.7	—	Unverified
8	ScaleKD (T:Swin-L S:ResNet-50)	Top-1 accuracy %	82.55	—	Unverified
9	DiffKD (T:Swin-L S: Swin-T)	Top-1 accuracy %	82.5	—	Unverified
10	DIST (T: Swin-L S: Swin-T)	Top-1 accuracy %	82.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SRD (T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	79.86	—	Unverified
2	shufflenet-v2(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	78.76	—	Unverified
3	MV-MR (T: CLIP/ViT-B-16 S: resnet50)	Top-1 Accuracy (%)	78.6	—	Unverified
4	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	78.28	—	Unverified
5	resnet8x4 (T: resnet32x4 S: resnet8x4 [modified])	Top-1 Accuracy (%)	78.08	—	Unverified
6	ReviewKD++(T:resnet-32x4, S:shufflenet-v2)	Top-1 Accuracy (%)	77.93	—	Unverified
7	ReviewKD++(T:resnet-32x4, S:shufflenet-v1)	Top-1 Accuracy (%)	77.68	—	Unverified
8	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	77.5	—	Unverified
9	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.68	—	Unverified
10	resnet8x4 (T: resnet32x4 S: resnet8x4)	Top-1 Accuracy (%)	76.31	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	77.16	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	73.73	—	Unverified
3	ADLIK-Faster (T: Faster R-CNN vit-base S: Faster R-CNN deit-small)	box AP	47.6	—	Unverified
4	ADLIK-Mask (T: Mask R-CNN vit-base S: Mask R-CNN deit-small)	mask AP	42.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet50))	AP@0.5	61.8	—	Unverified
2	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(resnet18))	AP@0.5	57.96	—	Unverified
3	ReviewKD++(T: faster rcnn(resnet101), S:faster rcnn(mobilenet-v2))	AP@0.5	55.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LSHFM (T: ResNet101 S: ResNet50)	mAP	93.17	—	Unverified
2	LSHFM (T: ResNet101 S: MobileNetV2)	mAP	90.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TIE-KD (T: Adabins S: MobileNetV2)	RMSE	2.43	—	Unverified