SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 201225 of 534 papers

TitleStatusHype
ExPAN(N)D: Exploring Posits for Efficient Artificial Neural Network Design in FPGA-based Systems0
Improving Feature Attribution through Input-specific Network Pruning0
Importance Estimation with Random Gradient for Neural Network Pruning0
Analysing Neural Network Topologies: a Game Theoretic Approach0
Extract Local Inference Chains of Deep Neural Nets0
CAP: Context-Aware Pruning for Semantic-Segmentation0
Pruning-Aware Merging for Efficient Multitask Inference0
Discriminative Adversarial Unlearning0
Cascade Weight Shedding in Deep Neural Networks: Benefits and Pitfalls for Network Pruning0
AutoPruning for Deep Neural Network with Dynamic Channel Masking0
DIPNet: Efficiency Distillation and Iterative Pruning for Image Super-Resolution0
Accurate Neural Network Pruning Requires Rethinking Sparse Optimization0
FedMef: Towards Memory-efficient Federated Dynamic Pruning0
FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity0
A Main/Subsidiary Network Framework for Simplifying Binary Neural Network0
Distributed Pruning Towards Tiny Neural Networks in Federated Learning0
Automatic Sparse Connectivity Learning for Neural Networks0
Differential Privacy Meets Neural Network Pruning0
Adaptive Neural Connections for Sparsity Learning0
Finding Deep Local Optima Using Network Pruning0
Hyperflows: Pruning Reveals the Importance of Weights0
Improve Convolutional Neural Network Pruning by Maximizing Filter Variety0
Layer-adaptive Structured Pruning Guided by Latency0
Differentiable Network Pruning for Microcontrollers0
Differentiable Channel Sparsity Search via Weight Sharing within Filters0
Show:102550
← PrevPage 9 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified