SOTAVerified

Neural Network Compression

Papers

Showing 150 of 193 papers

TitleStatusHype
DepGraph: Towards Any Structural PruningCode4
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator DesignCode2
Data-Free Learning of Student NetworksCode2
Neural Network Compression Framework for fast model inferenceCode2
A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and RecommendationsCode2
Head Network Distillation: Splitting Distilled Deep Neural Networks for Resource-Constrained Edge Computing SystemsCode1
SwiftTron: An Efficient Hardware Accelerator for Quantized TransformersCode1
Prune Your Model Before Distill ItCode1
ZeroQ: A Novel Zero Shot Quantization FrameworkCode1
Towards Meta-Pruning via Optimal TransportCode1
The continuous categorical: a novel simplex-valued exponential familyCode1
Distilled Split Deep Neural Networks for Edge-Assisted Real-Time SystemsCode1
PD-Quant: Post-Training Quantization based on Prediction Difference MetricCode1
CHIP: CHannel Independence-based Pruning for Compact Neural NetworksCode1
WoodFisher: Efficient Second-Order Approximation for Neural Network CompressionCode1
Robustness and Transferability of Universal Attacks on Compressed ModelsCode1
Quantisation and Pruning for Neural Network Compression and RegularisationCode1
Spectral Tensor Train Parameterization of Deep Learning LayersCode1
T-Basis: a Compact Representation for Neural NetworksCode1
Neural network compression via learnable wavelet transformsCode1
Wavelet Feature Maps Compression for Image-to-Image CNNsCode1
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and BetterCode1
NeRV: Neural Representations for VideosCode1
FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware TransformationCode1
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint ReductionCode1
REST: Robust and Efficient Neural Networks for Sleep Monitoring in the WildCode1
SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic NetworksCode1
Learning Filter Basis for Convolutional Neural Network CompressionCode1
An Overview of Neural Network Compression0
Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression0
A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification0
A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks0
DP-Net: Dynamic Programming Guided Deep Neural Network Compression0
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching0
A novel channel pruning method for deep neural network compression0
Cascaded Projection: End-to-End Network Compression and Acceleration0
A Comparative Study of Neural Network Compression0
Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning0
Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data0
A Bayesian Optimization Framework for Neural Network Compression0
Distilling Pixel-Wise Feature Similarities for Semantic Segmentation0
DKM: Differentiable K-Means Clustering Layer for Neural Network Compression0
Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis0
Balanced and Deterministic Weight-sharing Helps Network Performance0
Data-Driven Low-Rank Neural Network Compression0
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds0
Automatic Parameter Tying in Neural Networks0
An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design0
Deep Neural Network Compression for Aircraft Collision Avoidance Systems0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.