SOTAVerified

Neural Network Compression

Papers

Showing 4150 of 193 papers

TitleStatusHype
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference0
Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network CompressionCode0
Evaluation Metrics for DNNs Compression0
How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?0
Guaranteed Quantization Error Computation for Neural Network Model Compression0
SwiftTron: An Efficient Hardware Accelerator for Quantized TransformersCode1
WHC: Weighted Hybrid Criterion for Filter Pruning on Convolutional Neural NetworksCode0
DepGraph: Towards Any Structural PruningCode4
Magnitude and Similarity based Variable Rate Filter Pruning for Efficient Convolution Neural NetworksCode0
PD-Quant: Post-Training Quantization based on Prediction Difference MetricCode1
Show:102550
← PrevPage 5 of 20Next →

No leaderboard results yet.