SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 351400 of 534 papers

TitleStatusHype
To prune or not to prune : A chaos-causality approach to principled pruning of dense neural networks0
Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks0
NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches0
Automated Model Compression by Jointly Applied Pruning and Quantization0
Data-Independent Neural Pruning via Coresets0
Towards Communication-Learning Trade-off for Federated Learning at the Network Edge0
Towards Compact and Robust Deep Neural Networks0
One-shot Network Pruning at Initialization with Discriminative Image Patches0
One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation0
On Iterative Neural Network Pruning, Reinitialization, and the Similarity of Masks0
Automated Channel Pruning with Learned Importance0
On the Decision Boundaries of Deep Neural Networks: A Tropical Geometry Perspective0
On the Decision Boundaries of Neural Networks: A Tropical Geometry Perspective0
On the Decision Boundaries of Neural Networks. A Tropical Geometry Perspective0
On-the-fly Network Pruning for Object Detection0
On the Landscape of Sparse Linear Networks0
On the use of local structural properties for improving the efficiency of hierarchical community detection methods0
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient0
Optimization over Trained (and Sparse) Neural Networks: A Surrogate within a Surrogate0
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural Networks0
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning0
Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning0
Auto Graph Encoder-Decoder for Neural Network Pruning0
AutoAdversary: A Pixel Pruning Method for Sparse Adversarial Attack0
Towards Fairness-aware Adversarial Network Pruning0
Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning0
A rescaling-invariant Lipschitz bound based on path-metrics for modern ReLU network parameterizations0
A Unified Approach to Coreset Learning0
Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs0
Personalized Federated Learning for Generative AI-Assisted Semantic Communications0
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information0
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML0
Post-training deep neural network pruning via layer-wise calibration0
Why Does DARTS Miss the Target, and How Do We Aim to Fix It?0
Single-path Bit Sharing for Automatic Loss-aware Model Compression0
Privacy-preserving Learning via Deep Net Pruning0
Privacy Preserving Stochastic Channel-Based Federated Learning with Neural Network Pruning0
Probabilistic Modeling: Proving the Lottery Ticket Hypothesis in Spiking Neural Network0
Progressive Local Filter Pruning for Image Retrieval Acceleration0
Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification0
Protective Self-Adaptive Pruning to Better Compress DNNs0
How much pre-training is enough to discover a good subnetwork?0
Prune Responsibly0
Pruning a neural network using Bayesian inference0
Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances0
Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models0
Towards thinner convolutional neural networks through Gradually Global Pruning0
Pruning CNN's with linear filter ensembles0
Pruning Compact ConvNets for Efficient Inference0
A Targeted Acceleration and Compression Framework for Low bit Neural Networks0
Show:102550
← PrevPage 8 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified