| DepGraph: Towards Any Structural Pruning | Jan 30, 2023 | Network PruningNeural Network Compression | CodeCode Available | 4 |
| Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | May 2, 2024 | Model CompressionNeural Network Compression | CodeCode Available | 2 |
| Data-Free Learning of Student Networks | Apr 2, 2019 | Neural Network Compression | CodeCode Available | 2 |
| Neural Network Compression Framework for fast model inference | Feb 20, 2020 | BinarizationCPU | CodeCode Available | 2 |
| A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations | Aug 13, 2023 | Adversarial RobustnessNetwork Pruning | CodeCode Available | 2 |
| Head Network Distillation: Splitting Distilled Deep Neural Networks for Resource-Constrained Edge Computing Systems | Nov 20, 2020 | Edge-computingimage-classification | CodeCode Available | 1 |
| SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers | Apr 8, 2023 | Neural Network CompressionQuantization | CodeCode Available | 1 |
| Prune Your Model Before Distill It | Sep 30, 2021 | Knowledge Distillationmodel | CodeCode Available | 1 |
| ZeroQ: A Novel Zero Shot Quantization Framework | Jan 1, 2020 | Data Free QuantizationModel Compression | CodeCode Available | 1 |
| Towards Meta-Pruning via Optimal Transport | Feb 12, 2024 | Neural Network Compression | CodeCode Available | 1 |
| The continuous categorical: a novel simplex-valued exponential family | Feb 20, 2020 | Neural Network CompressionTransfer Learning | CodeCode Available | 1 |
| Distilled Split Deep Neural Networks for Edge-Assisted Real-Time Systems | Oct 1, 2019 | Edge-computingImage Classification | CodeCode Available | 1 |
| PD-Quant: Post-Training Quantization based on Prediction Difference Metric | Dec 14, 2022 | Neural Network CompressionQuantization | CodeCode Available | 1 |
| CHIP: CHannel Independence-based Pruning for Compact Neural Networks | Oct 26, 2021 | Neural Network Compression | CodeCode Available | 1 |
| WoodFisher: Efficient Second-Order Approximation for Neural Network Compression | Apr 29, 2020 | image-classificationImage Classification | CodeCode Available | 1 |
| Robustness and Transferability of Universal Attacks on Compressed Models | Dec 10, 2020 | Neural Network CompressionQuantization | CodeCode Available | 1 |
| Quantisation and Pruning for Neural Network Compression and Regularisation | Jan 14, 2020 | Network PruningNeural Network Compression | CodeCode Available | 1 |
| Spectral Tensor Train Parameterization of Deep Learning Layers | Mar 7, 2021 | Deep Learningimage-classification | CodeCode Available | 1 |
| T-Basis: a Compact Representation for Neural Networks | Jul 13, 2020 | Neural Network CompressionTensor Networks | CodeCode Available | 1 |
| Neural network compression via learnable wavelet transforms | Apr 20, 2020 | Data CompressionNeural Network Compression | CodeCode Available | 1 |
| Wavelet Feature Maps Compression for Image-to-Image CNNs | May 24, 2022 | Depth EstimationNeural Network Compression | CodeCode Available | 1 |
| Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better | Jun 16, 2021 | Deep LearningInformation Retrieval | CodeCode Available | 1 |
| NeRV: Neural Representations for Videos | Oct 26, 2021 | DenoisingNeural Network Compression | CodeCode Available | 1 |
| FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation | Feb 15, 2021 | Model CompressionNeural Network Compression | CodeCode Available | 1 |
| Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction | Feb 1, 2022 | Neural Network CompressionQuantization | CodeCode Available | 1 |
| REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild | Jan 29, 2020 | EEGElectroencephalogram (EEG) | CodeCode Available | 1 |
| SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks | Jul 21, 2022 | Neural Network Compression | CodeCode Available | 1 |
| Learning Filter Basis for Convolutional Neural Network Compression | Aug 23, 2019 | General Classificationimage-classification | CodeCode Available | 1 |
| An Overview of Neural Network Compression | Jun 5, 2020 | Knowledge DistillationModel Compression | —Unverified | 0 |
| Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression | Sep 11, 2024 | Efficient Neural NetworkNeural Network Compression | —Unverified | 0 |
| A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification | May 14, 2022 | image-classificationImage Classification | —Unverified | 0 |
| A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks | Jan 6, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| DP-Net: Dynamic Programming Guided Deep Neural Network Compression | Mar 21, 2020 | ClusteringNeural Network Compression | —Unverified | 0 |
| Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Oct 9, 2024 | Knowledge DistillationNeural Network Compression | —Unverified | 0 |
| A novel channel pruning method for deep neural network compression | May 29, 2018 | channel selectionCombinatorial Optimization | —Unverified | 0 |
| Cascaded Projection: End-to-End Network Compression and Acceleration | Mar 12, 2019 | Neural Network Compression | —Unverified | 0 |
| A Comparative Study of Neural Network Compression | Oct 24, 2019 | L2 RegularizationNeural Network Compression | —Unverified | 0 |
| Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning | Feb 3, 2018 | Neural Network CompressionSensitivity | —Unverified | 0 |
| Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data | Jul 16, 2021 | Neural Network Compressionreinforcement-learning | —Unverified | 0 |
| A Bayesian Optimization Framework for Neural Network Compression | Oct 1, 2019 | Bayesian OptimizationKnowledge Distillation | —Unverified | 0 |
| Distilling Pixel-Wise Feature Similarities for Semantic Segmentation | Oct 31, 2019 | Knowledge DistillationNeural Network Compression | —Unverified | 0 |
| DKM: Differentiable K-Means Clustering Layer for Neural Network Compression | Aug 28, 2021 | ClusteringModel Compression | —Unverified | 0 |
| Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis | Apr 10, 2021 | Data-free Knowledge DistillationKnowledge Distillation | —Unverified | 0 |
| Balanced and Deterministic Weight-sharing Helps Network Performance | Dec 13, 2023 | Neural Network Compression | —Unverified | 0 |
| Data-Driven Low-Rank Neural Network Compression | Jul 13, 2021 | Neural Network Compression | —Unverified | 0 |
| Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds | Apr 15, 2018 | Generalization BoundsNeural Network Compression | —Unverified | 0 |
| Automatic Parameter Tying in Neural Networks | Jan 1, 2018 | L2 RegularizationNeural Network Compression | —Unverified | 0 |
| An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design | Aug 2, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Deep Neural Network Compression for Aircraft Collision Avoidance Systems | Oct 9, 2018 | Collision AvoidanceDecision Making | —Unverified | 0 |
| AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning | Nov 28, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |