Network Pruning

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 534 papers

Title	Date	Tasks	Status	Hype
Exploring GLU Expansion Ratios: A Study of Structured Pruning in LLaMA-3.2 Models	Dec 26, 2024	Computational EfficiencyNetwork Pruning	CodeCode Available	5
Structured Pruning for Deep Convolutional Neural Networks: A survey	Mar 1, 2023	Network PruningNeural Architecture Search	CodeCode Available	4
DepGraph: Towards Any Structural Pruning	Jan 30, 2023	Network PruningNeural Network Compression	CodeCode Available	4
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models	Sep 26, 2024	Large Language ModelModel Compression	CodeCode Available	2
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS	Nov 28, 2023	Knowledge DistillationNeRF	CodeCode Available	2
A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations	Aug 13, 2023	Adversarial RobustnessNetwork Pruning	CodeCode Available	2
A Simple and Effective Pruning Approach for Large Language Models	Jun 20, 2023	Network Pruning	CodeCode Available	2
Pruning Filters for Efficient ConvNets	Aug 31, 2016	Image ClassificationNetwork Pruning	CodeCode Available	2
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations	May 5, 2025	Network Pruning	CodeCode Available	1
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning	Oct 9, 2024	In-Context LearningNetwork Pruning	CodeCode Available	1
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition	Sep 20, 2024	CPUNetwork Pruning	CodeCode Available	1
Adversarial Pruning: A Survey and Benchmark of Pruning Methods for Adversarial Robustness	Sep 2, 2024	Adversarial RobustnessNetwork Pruning	CodeCode Available	1
Investigating Sparsity in Recurrent Neural Networks	Jul 30, 2024	Machine TranslationNetwork Pruning	CodeCode Available	1
Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning	Jun 3, 2024	Network Pruning	CodeCode Available	1
Auto-Train-Once: Controller Network Guided Automatic Network Pruning from Scratch	Mar 21, 2024	Network Pruning	CodeCode Available	1
Fluctuation-based Adaptive Structured Pruning for Large Language Models	Dec 19, 2023	Network Pruning	CodeCode Available	1
Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion	Nov 28, 2023	Face DetectionNetwork Pruning	CodeCode Available	1
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models	Nov 8, 2023	Language ModellingNetwork Pruning	CodeCode Available	1
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs	Oct 13, 2023	Network Pruning	CodeCode Available	1
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity	Oct 8, 2023	Network Pruning	CodeCode Available	1
Feather: An Elegant Solution to Effective DNN Sparsification	Oct 3, 2023	Network Pruning	CodeCode Available	1
Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation	Aug 23, 2023	Network PruningTransfer Learning	CodeCode Available	1
Pruning vs Quantization: Which is Better?	Jul 6, 2023	Network PruningQuantization	CodeCode Available	1
How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint	Jun 9, 2023	Network Pruning	CodeCode Available	1
Lottery Tickets in Evolutionary Optimization: On Sparse Backpropagation-Free Trainability	May 31, 2023	Inductive BiasLinear Mode Connectivity	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 22Next →

All datasets ImageNet ImageNet - ResNet 50 - 90% sparsity CIFAR-100 CIFAR-10 MNIST

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ResNet50-2.3 GFLOPs	Accuracy	78.79	—	Unverified
2	ResNet50-1.5 GFLOPs	Accuracy	78.07	—	Unverified
3	ResNet50 2.5 GFLOPS	Accuracy	78	—	Unverified
4	RegX-1.6G	Accuracy	77.97	—	Unverified
5	ResNet50 2.0 GFLOPS	Accuracy	77.7	—	Unverified
6	ResNet50-3G FLOPs	Accuracy	77.1	—	Unverified
7	ResNet50-2G FLOPs	Accuracy	76.4	—	Unverified
8	ResNet50-1G FLOPs	Accuracy	76.38	—	Unverified
9	TAS-pruned ResNet-50	Accuracy	76.2	—	Unverified
10	ResNet50	Accuracy	75.59	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Feather	Top-1 Accuracy	76.93	—	Unverified
2	Spartan	Top-1 Accuracy	76.17	—	Unverified
3	ST-3	Top-1 Accuracy	76.03	—	Unverified
4	AC/DC	Top-1 Accuracy	75.64	—	Unverified
5	CS	Top-1 Accuracy	75.5	—	Unverified
6	ProbMask	Top-1 Accuracy	74.68	—	Unverified
7	STR	Top-1 Accuracy	74.31	—	Unverified
8	DNW	Top-1 Accuracy	74	—	Unverified
9	GMP	Top-1 Accuracy	73.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	+U-DML*	Inference Time (ms)	675.56	—	Unverified
2	Dense	Accuracy	79	—	Unverified
3	AC/DC	Accuracy	78.2	—	Unverified
4	Beta-Rank	Accuracy	74.01	—	Unverified
5	TAS-pruned ResNet-110	Accuracy	73.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TAS-pruned ResNet-110	Accuracy	94.33	—	Unverified
2	ShuffleNet – Quantised	Inference Time (ms)	23.15	—	Unverified
3	AlexNet – Quantised	Inference Time (ms)	5.23	—	Unverified
4	MobileNet – Quantised	Inference Time (ms)	4.74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FFN-ShapleyPruned	Avg #Steps	12.05	—	Unverified