Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1025 of 1356 papers

Title	Date	Tasks	Status
Task-Agnostic and Adaptive-Size BERT Compression	Jan 1, 2021	Language ModellingModel Compression	—Unverified
A Half-Space Stochastic Projected Gradient Method for Group Sparsity Regularization	Jan 1, 2021	compressed sensingfeature selection	—Unverified
BinaryBERT: Pushing the Limit of BERT Quantization	Dec 31, 2020	BinarizationModel Compression	—Unverified
Towards Zero-Shot Knowledge Distillation for Natural Language Processing	Dec 31, 2020	Knowledge DistillationModel Compression	—Unverified
Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation	Dec 18, 2020	image-classificationImage Classification	—Unverified
Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks	Dec 16, 2020	Model Compression	—Unverified
Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces	Dec 16, 2020	GPUKnowledge Distillation	—Unverified
Wasserstein Contrastive Representation Distillation	Dec 15, 2020	Contrastive LearningKnowledge Distillation	—Unverified
Reinforced Multi-Teacher Selection for Knowledge Distillation	Dec 11, 2020	GPUKnowledge Distillation	—Unverified
Large-Scale Generative Data-Free Distillation	Dec 10, 2020	Knowledge DistillationModel Compression	—Unverified
Inferring ECG from PPG for Continuous Cardiac Monitoring Using Lightweight Neural Network	Dec 9, 2020	Model Compression	—Unverified
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework	Dec 8, 2020	Edge-computingModel Compression	—Unverified
Model Compression Using Optimal Transport	Dec 7, 2020	image-classificationImage Classification	—Unverified
Multi-head Knowledge Distillation for Model Compression	Dec 5, 2020	image-classificationImage Classification	—Unverified
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains	Dec 2, 2020	Knowledge DistillationLanguage Modeling	—Unverified
Compressing Pre-trained Language Models by Matrix Decomposition	Dec 1, 2020	Model Compression	—Unverified
Self-Supervised Generative Adversarial Compression	Dec 1, 2020	image-classificationImage Classification	—Unverified
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration	Dec 1, 2020	Bayesian OptimizationCode Generation	—Unverified
Edge Deep Learning for Neural Implants	Dec 1, 2020	Deep LearningEEG	—Unverified
Reverse-engineering recurrent neural network solutions to a hierarchical inference task for mice	Dec 1, 2020	Knowledge DistillationModel Compression	—Unverified
Extreme Model Compression for On-device Natural Language Understanding	Nov 30, 2020	Model CompressionNatural Language Understanding	—Unverified
A Selective Survey on Versatile Knowledge Distillation Paradigm for Neural Network Models	Nov 30, 2020	Knowledge DistillationModel Compression	—Unverified
Context-aware deep model compression for edge cloud computing	Nov 29, 2020	Cloud ComputingImage Classification	—Unverified
Bringing AI To Edge: From Deep Learning's Perspective	Nov 25, 2020	Deep LearningEdge-computing	—Unverified
Auto Graph Encoder-Decoder for Neural Network Pruning	Nov 25, 2020	DecoderModel Compression	—Unverified

Show:10 25 50

← PrevPage 41 of 55Next →

All datasets ImageNet QNLI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ADLIK-MO-ResNet50+W4A4	Top-1	77.88	—	Unverified
2	ADLIK-MO-ResNet50+W3A4	Top-1	77.34	—	Unverified
3	ResNet-18 + 4bit-1dim model compression using DKM	Top-1	70.52	—	Unverified
4	MobileNet-v1 + 4bit-1dim model compression using DKM	Top-1	69.63	—	Unverified
5	ResNet-18 + 2bit-1dim model compression using DKM	Top-1	68.63	—	Unverified
6	MobileNet-v1 + 2bit-1dim model compression using DKM	Top-1	67.62	—	Unverified
7	ResNet-18 + 4bit-4dim model compression using DKM	Top-1	66.1	—	Unverified
8	ResNet-18 + 2bit-2dim model compression using DKM	Top-1	64.7	—	Unverified
9	MobileNet-v1 + 4bit-4dim model compression using DKM	Top-1	61.4	—	Unverified
10	ResNet-18 + 1bit-1dim model compression using DKM	Top-1	59.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MobileBERT + 2bit-1dim model compression using DKM	Accuracy	82.13	—	Unverified
2	MobileBERT + 1bit-1dim model compression using DKM	Accuracy	63.17	—	Unverified