SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 51100 of 534 papers

TitleStatusHype
Effective Subset Selection Through The Lens of Neural Network Pruning0
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight PruningCode1
Deep Network Pruning: A Comparative Study on CNNs in Face Recognition0
A rescaling-invariant Lipschitz bound based on path-metrics for modern ReLU network parameterizations0
Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models0
FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity0
Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers0
Aggressive or Imperceptible, or Both: Network Pruning Assisted Hybrid Byzantines in Federated LearningCode0
FedMef: Towards Memory-efficient Federated Dynamic Pruning0
Auto-Train-Once: Controller Network Guided Automatic Network Pruning from ScratchCode1
LNPT: Label-free Network Pruning and Training0
Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and EfficiencyCode0
FALCON: FLOP-Aware Combinatorial Optimization for Neural Network PruningCode0
Pruning neural network models for gene regulatory dynamics using data and domain knowledgeCode0
Structurally Prune Anything: Any Architecture, Any Framework, Any Time0
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization0
What to Do When Your Discrete Optimization Is the Size of a Neural Network?Code0
Discriminative Adversarial Unlearning0
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language ModelsCode0
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression0
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
GD doesn't make the cut: Three ways that non-differentiability affects neural network training0
Device-Wise Federated Network PruningCode0
Block Pruning for Enhanced Efficiency in Convolutional Neural Networks0
Fluctuation-based Adaptive Structured Pruning for Large Language ModelsCode1
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information0
Neural Architecture Codesign for Fast Bragg Peak Analysis0
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy0
An End-to-End Network Pruning Pipeline with Sparsity Enforcement0
Towards Higher Ranks via Adversarial Weight PruningCode0
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPSCode2
Filter-Pruning of Lightweight Face Detectors Using a Geometric Median CriterionCode1
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning0
Neural Network Pruning by Gradient DescentCode0
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks0
Data Augmentations in Deep Weight Spaces0
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two BenchmarksCode0
Beyond Size: How Gradients Shape Pruning Decisions in Large Language ModelsCode1
Brain-Inspired Efficient Pruning: Exploiting Criticality in Spiking Neural Networks0
Efficient Model-Based Deep Learning via Network Pruning and Fine-TuningCode0
Importance Estimation with Random Gradient for Neural Network Pruning0
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group SparsityCode0
Linear Mode Connectivity in Sparse Neural Networks0
GraFT: Gradual Fusion Transformer for Multimodal Re-Identification0
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMsCode1
Samples on Thin Ice: Re-Evaluating Adversarial Pruning of Neural Networks0
Filter Pruning For CNN With Enhanced Linear Representation RedundancyCode0
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High SparsityCode1
SWAP: Sparse Entropic Wasserstein Regression for Robust Network PruningCode0
Feather: An Elegant Solution to Effective DNN SparsificationCode1
Show:102550
← PrevPage 2 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified