SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 401450 of 534 papers

TitleStatusHype
Achieving Adversarial Robustness via Sparsity0
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning0
Network Pruning via Annealing and Direct Sparsity Control0
TraNNsformer: Neural network transformation for memristive crossbar based neuromorphic system design0
Pruning has a disparate impact on model accuracy0
Troubleshooting Blind Image Quality Models in the Wild0
Pruning coupled with learning, ensembles of minimal neural networks, and future of XAI0
A relativistic extension of Hopfield neural networks via the mechanical analogy0
TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks0
Ultrafast Photorealistic Style Transfer via Neural Architecture Search0
Architecture-aware Network Pruning for Vision Quality Applications0
Understanding Diversity Based Neural Network Pruning in Teacher Student Setup0
PRUNIX: Non-Ideality Aware Convolutional Neural Network Pruning for Memristive Accelerators0
Putting 3D Spatially Sparse Networks on a Diet0
AP: Selective Activation for De-sparsifying Pruned Neural Networks0
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration0
A Probabilistic Approach to Neural Network Pruning0
Rapid Neural Pruning for Novel Datasets with Set-based Task-Adaptive Meta-Pruning0
Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning0
RARTS: An Efficient First-Order Relaxed Architecture Search Method0
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm0
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks0
Reducing the Model Order of Deep Neural Networks Using Information Theory0
Renormalized Sparse Neural Network Pruning0
A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation0
Representation and decomposition of functions in DAG-DNNs and structural network pruning0
Resource-Efficient Neural Networks for Embedded Systems0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Rethinking Again the Value of Network Pruning -- A Dynamical Isometry Perspective0
"Understanding Robustness Lottery": A Geometric Visual Comparative Analysis of Neural Network Pruning Approaches0
Unveiling Invariances via Neural Network Pruning0
A "Network Pruning Network" Approach to Deep Model Compression0
RGP: Neural Network Pruning through Its Regular Graph Structure0
An End-to-End Network Pruning Pipeline with Sparsity Enforcement0
Accurate Neural Network Pruning Requires Rethinking Sparse Optimization0
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning0
Validating the Lottery Ticket Hypothesis with Inertial Manifold Theory0
Variational Convolutional Neural Network Pruning0
Samples on Thin Ice: Re-Evaluating Adversarial Pruning of Neural Networks0
A Multi-objective Complex Network Pruning Framework Based on Divide-and-conquer and Global Performance Impairment Ranking0
Scalable iterative pruning of large language and vision models using block coordinate descent0
Verification of Neural Networks: Enhancing Scalability through Pruning0
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks0
Joint Device-Edge Inference over Wireless Links with Pruning0
Deep Network Pruning: A Comparative Study on CNNs in Face Recognition0
Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images0
An Efficient Row-Based Sparse Fine-Tuning0
Defeating Misclassification Attacks Against Transfer Learning0
Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models0
DBP: Discrimination Based Block-Level Pruning for Deep Model Acceleration0
Show:102550
← PrevPage 9 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified