SOTAVerified

Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Showing 76100 of 534 papers

TitleStatusHype
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information0
Neural Architecture Codesign for Fast Bragg Peak Analysis0
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy0
An End-to-End Network Pruning Pipeline with Sparsity Enforcement0
Towards Higher Ranks via Adversarial Weight PruningCode0
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPSCode2
Filter-Pruning of Lightweight Face Detectors Using a Geometric Median CriterionCode1
Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning0
Neural Network Pruning by Gradient DescentCode0
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks0
Data Augmentations in Deep Weight Spaces0
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two BenchmarksCode0
Beyond Size: How Gradients Shape Pruning Decisions in Large Language ModelsCode1
Brain-Inspired Efficient Pruning: Exploiting Criticality in Spiking Neural Networks0
Efficient Model-Based Deep Learning via Network Pruning and Fine-TuningCode0
Importance Estimation with Random Gradient for Neural Network Pruning0
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group SparsityCode0
Linear Mode Connectivity in Sparse Neural Networks0
GraFT: Gradual Fusion Transformer for Multimodal Re-Identification0
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMsCode1
Samples on Thin Ice: Re-Evaluating Adversarial Pruning of Neural Networks0
Filter Pruning For CNN With Enhanced Linear Representation RedundancyCode0
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High SparsityCode1
SWAP: Sparse Entropic Wasserstein Regression for Robust Network PruningCode0
Feather: An Elegant Solution to Effective DNN SparsificationCode1
Show:102550
← PrevPage 4 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50-2.3 GFLOPsAccuracy78.79Unverified
2ResNet50-1.5 GFLOPsAccuracy78.07Unverified
3ResNet50 2.5 GFLOPSAccuracy78Unverified
4RegX-1.6GAccuracy77.97Unverified
5ResNet50 2.0 GFLOPSAccuracy77.7Unverified
6ResNet50-3G FLOPsAccuracy77.1Unverified
7ResNet50-2G FLOPsAccuracy76.4Unverified
8ResNet50-1G FLOPsAccuracy76.38Unverified
9TAS-pruned ResNet-50Accuracy76.2Unverified
10ResNet50Accuracy75.59Unverified
#ModelMetricClaimedVerifiedStatus
1FeatherTop-1 Accuracy76.93Unverified
2SpartanTop-1 Accuracy76.17Unverified
3ST-3Top-1 Accuracy76.03Unverified
4AC/DCTop-1 Accuracy75.64Unverified
5CSTop-1 Accuracy75.5Unverified
6ProbMaskTop-1 Accuracy74.68Unverified
7STRTop-1 Accuracy74.31Unverified
8DNWTop-1 Accuracy74Unverified
9GMPTop-1 Accuracy73.91Unverified
#ModelMetricClaimedVerifiedStatus
1+U-DML*Inference Time (ms)675.56Unverified
2DenseAccuracy79Unverified
3AC/DCAccuracy78.2Unverified
4Beta-RankAccuracy74.01Unverified
5TAS-pruned ResNet-110Accuracy73.16Unverified
#ModelMetricClaimedVerifiedStatus
1TAS-pruned ResNet-110Accuracy94.33Unverified
2ShuffleNet – QuantisedInference Time (ms)23.15Unverified
3AlexNet – QuantisedInference Time (ms)5.23Unverified
4MobileNet – QuantisedInference Time (ms)4.74Unverified
#ModelMetricClaimedVerifiedStatus
1FFN-ShapleyPrunedAvg #Steps12.05Unverified