| DepGraph: Towards Any Structural Pruning | Jan 30, 2023 | Network PruningNeural Network Compression | CodeCode Available | 4 | 5 |
| Data-Free Learning of Student Networks | Apr 2, 2019 | Neural Network Compression | CodeCode Available | 2 | 5 |
| Neural Network Compression Framework for fast model inference | Feb 20, 2020 | BinarizationCPU | CodeCode Available | 2 | 5 |
| Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | May 2, 2024 | Model CompressionNeural Network Compression | CodeCode Available | 2 | 5 |
| A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations | Aug 13, 2023 | Adversarial RobustnessNetwork Pruning | CodeCode Available | 2 | 5 |
| Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction | Feb 1, 2022 | Neural Network CompressionQuantization | CodeCode Available | 1 | 5 |
| CHIP: CHannel Independence-based Pruning for Compact Neural Networks | Oct 26, 2021 | Neural Network Compression | CodeCode Available | 1 | 5 |
| Head Network Distillation: Splitting Distilled Deep Neural Networks for Resource-Constrained Edge Computing Systems | Nov 20, 2020 | Edge-computingimage-classification | CodeCode Available | 1 | 5 |
| Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better | Jun 16, 2021 | Deep LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| Distilled Split Deep Neural Networks for Edge-Assisted Real-Time Systems | Oct 1, 2019 | Edge-computingImage Classification | CodeCode Available | 1 | 5 |