SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 501525 of 534 papers

TitleStatusHype
Clustering Convolutional Kernels to Compress Deep Neural NetworksCode0
On-Device Neural Language Model Based Word PredictionCode0
Crossbar-aware neural network pruning0
Fast Convex Pruning of Deep Neural NetworksCode0
“Learning-Compression” Algorithms for Neural Net Pruning0
CLIP-Q: Deep Network Compression Learning by In-Parallel Pruning-Quantization0
Channel Gating Neural NetworksCode1
Structured Pruning for Efficient ConvNets via Incremental RegularizationCode0
A Systematic DNN Weight Pruning Framework using Alternating Direction Method of MultipliersCode0
A Quantization-Friendly Separable Convolution for MobileNetsCode0
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural NetworksCode1
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning0
Building Efficient ConvNets using Redundant Feature PruningCode0
Efficient Sparse-Winograd Convolutional Neural NetworksCode0
Attention-Based Guided Structured Sparsity of Deep Neural NetworksCode0
Learning Compact Neural Networks with Regularization0
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks0
A relativistic extension of Hopfield neural networks via the mechanical analogy0
Structured Deep Neural Network Pruning via Matrix Pivoting0
NISP: Pruning Networks using Neuron Importance Score Propagation0
PackNet: Adding Multiple Tasks to a Single Network by Iterative PruningCode1
NeST: A Neural Network Synthesis Tool Based on a Grow-and-Prune Paradigm0
TraNNsformer: Neural network transformation for memristive crossbar based neuromorphic system design0
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization0
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks0
Show:102550
← PrevPage 21 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified