SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 151200 of 534 papers

TitleStatusHype
Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning0
Effective Subset Selection Through The Lens of Neural Network Pruning0
Deep Network Pruning: A Comparative Study on CNNs in Face Recognition0
A rescaling-invariant Lipschitz bound based on path-metrics for modern ReLU network parameterizations0
Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models0
FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity0
Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers0
Aggressive or Imperceptible, or Both: Network Pruning Assisted Hybrid Byzantines in Federated LearningCode0
FedMef: Towards Memory-efficient Federated Dynamic Pruning0
LNPT: Label-free Network Pruning and Training0
Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and EfficiencyCode0
FALCON: FLOP-Aware Combinatorial Optimization for Neural Network PruningCode0
Pruning neural network models for gene regulatory dynamics using data and domain knowledgeCode0
Structurally Prune Anything: Any Architecture, Any Framework, Any Time0
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization0
What to Do When Your Discrete Optimization Is the Size of a Neural Network?Code0
Discriminative Adversarial Unlearning0
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language ModelsCode0
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression0
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
GD doesn't make the cut: Three ways that non-differentiability affects neural network training0
Device-Wise Federated Network PruningCode0
Block Pruning for Enhanced Efficiency in Convolutional Neural Networks0
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information0
Neural Architecture Codesign for Fast Bragg Peak Analysis0
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy0
An End-to-End Network Pruning Pipeline with Sparsity Enforcement0
Towards Higher Ranks via Adversarial Weight PruningCode0
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning0
Neural Network Pruning by Gradient DescentCode0
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks0
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two BenchmarksCode0
Data Augmentations in Deep Weight Spaces0
Brain-Inspired Efficient Pruning: Exploiting Criticality in Spiking Neural Networks0
Efficient Model-Based Deep Learning via Network Pruning and Fine-TuningCode0
Importance Estimation with Random Gradient for Neural Network Pruning0
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group SparsityCode0
Linear Mode Connectivity in Sparse Neural Networks0
GraFT: Gradual Fusion Transformer for Multimodal Re-Identification0
Samples on Thin Ice: Re-Evaluating Adversarial Pruning of Neural Networks0
Filter Pruning For CNN With Enhanced Linear Representation RedundancyCode0
SWAP: Sparse Entropic Wasserstein Regression for Robust Network PruningCode0
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model0
Unveiling Invariances via Neural Network Pruning0
Ensemble Mask Networks0
EDAC: Efficient Deployment of Audio Classification Models For COVID-19 DetectionCode0
Adaptive Consensus: A network pruning approach for decentralized optimization0
To prune or not to prune : A chaos-causality approach to principled pruning of dense neural networks0
Pruning a neural network using Bayesian inference0
Accurate Neural Network Pruning Requires Rethinking Sparse Optimization0
Show:102550
← PrevPage 4 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified