SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 251300 of 534 papers

TitleStatusHype
Low-Rank Prune-And-Factorize for Language Model Compression0
MARS: Multi-macro Architecture SRAM CIM-Based Accelerator with Co-designed Compressed Neural Networks0
MaskConvNet: Training Efficient ConvNets from Scratch via Budget-constrained Filter Pruning0
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs0
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks0
Mini-batch Coresets for Memory-efficient Training of Large Language Models0
Meta-Learning with Network Pruning0
Meta-Learning with Network Pruning for Overfitting Reduction0
Model Compression Methods for YOLOv5: A Review0
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing0
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems0
Multistage Pruning of CNN Based ECG Classifiers for Edge Devices0
Multi-Task Network Pruning and Embedded Optimization for Real-time Deployment in ADAS0
Mutual Information Preserving Neural Network Pruning0
N2NSkip: Learning Highly Sparse Networks using Neuron-to-Neuron Skip Connections0
NeST: A Neural Network Synthesis Tool Based on a Grow-and-Prune Paradigm0
Network Automatic Pruning: Start NAP and Take a Nap0
Network Pruning by Greedy Subnetwork Selection0
Network Pruning for Low-Rank Binary Index0
Network Pruning for Low-Rank Binary Indexing0
Network Pruning Optimization by Simulated Annealing Algorithm0
Network Pruning Spaces0
Neural Architecture Codesign for Fast Bragg Peak Analysis0
Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning0
Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations0
Neural Network Pruning as Spectrum Preserving Process0
Neural Network Pruning by Cooperative Coevolution0
Neural Network Pruning for Real-time Polyp Segmentation0
Neural Network Pruning Through Constrained Reinforcement Learning0
NISP: Pruning Networks using Neuron Importance Score Propagation0
NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches0
Data-Independent Neural Pruning via Coresets0
One-shot Network Pruning at Initialization with Discriminative Image Patches0
One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation0
On Iterative Neural Network Pruning, Reinitialization, and the Similarity of Masks0
On the Decision Boundaries of Deep Neural Networks: A Tropical Geometry Perspective0
On the Decision Boundaries of Neural Networks: A Tropical Geometry Perspective0
On the Decision Boundaries of Neural Networks. A Tropical Geometry Perspective0
On-the-fly Network Pruning for Object Detection0
On the Landscape of Sparse Linear Networks0
On the use of local structural properties for improving the efficiency of hierarchical community detection methods0
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient0
Optimization over Trained (and Sparse) Neural Networks: A Surrogate within a Surrogate0
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural Networks0
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning0
Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning0
A rescaling-invariant Lipschitz bound based on path-metrics for modern ReLU network parameterizations0
Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs0
Personalized Federated Learning for Generative AI-Assisted Semantic Communications0
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information0
Show:102550
← PrevPage 6 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified