| Scalable Neural Network Compression and Pruning Using Hard Clustering and L1 Regularization | Jun 14, 2018 | ClusteringNeural Network Compression | —Unverified | 0 |
| SCANN: Synthesis of Compact and Accurate Neural Networks | Apr 19, 2019 | Dimensionality ReductionNeural Network Compression | —Unverified | 0 |
| Semi-tensor Product-based TensorDecomposition for Neural Network Compression | Sep 30, 2021 | Low-rank compressionNeural Network Compression | —Unverified | 0 |
| Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations | Apr 3, 2017 | Image CompressionNeural Network Compression | —Unverified | 0 |
| Sparse matrix products for neural network compression | Jan 1, 2021 | Neural Network Compression | —Unverified | 0 |
| SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Feb 26, 2024 | Image CompressionNeRF | —Unverified | 0 |
| Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix | Mar 14, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| Survey on Computer Vision Techniques for Internet-of-Things Devices | Aug 2, 2023 | Neural Network CompressionSurvey | —Unverified | 0 |
| Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks | Dec 20, 2019 | Neural Network Compression | —Unverified | 0 |
| The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Jul 5, 2024 | Deep Reinforcement LearningModel Compression | —Unverified | 0 |
| ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression | Jul 20, 2017 | Neural Network Compression | —Unverified | 0 |
| Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Jul 16, 2024 | GPUNeural Network Compression | —Unverified | 0 |
| TOCO: A Framework for Compressing Neural Network Models Based on Tolerance Analysis | Dec 18, 2019 | Active LearningNeural Network Compression | —Unverified | 0 |
| DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression | May 15, 2019 | Neural Network CompressionQuantization | —Unverified | 0 |
| Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression | Nov 19, 2021 | Neural Network CompressionQuantization | —Unverified | 0 |
| Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Feb 29, 2024 | Neural Network Compression | —Unverified | 0 |
| Transform Quantization for CNN (Convolutional Neural Network) Compression | Sep 2, 2020 | Dimensionality ReductionNeural Network Compression | —Unverified | 0 |
| Tropical Geometrical Zonotope Reduction as Applied to Neural Network Compression. | Sep 29, 2021 | Neural Network Compression | —Unverified | 0 |
| TropNNC: Structured Neural Network Compression Using Tropical Geometry | Sep 5, 2024 | Neural Network Compression | —Unverified | 0 |
| UMEC: Unified model and embedding compression for efficient recommendation systems | Jan 1, 2021 | Efficient Neural Networkfeature selection | —Unverified | 0 |
| Understanding the Effect of the Long Tail on Neural Network Compression | Jun 9, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Sep 5, 2024 | Neural Network CompressionTensor Decomposition | —Unverified | 0 |
| Universal Deep Neural Network Compression | Feb 7, 2018 | Neural Network CompressionQuantization | —Unverified | 0 |
| VQN: Variable Quantization Noise for Neural Network Compression | Nov 16, 2021 | Neural Network CompressionQuantization | —Unverified | 0 |
| Weight Normalization based Quantization for Deep Neural Network Compression | Jul 1, 2019 | Model CompressionNeural Network Compression | —Unverified | 0 |
| What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Oct 10, 2024 | Age/UnbiasedFairness | —Unverified | 0 |
| Generalized Ternary Connect: End-to-End Learning and Compression of Multiplication-Free Deep Neural Networks | Nov 12, 2018 | Edge-computingGeneral Classification | —Unverified | 0 |
| GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT | Mar 24, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| Grokking as Compression: A Nonlinear Complexity Perspective | Oct 9, 2023 | AttributeMemorization | —Unverified | 0 |
| Guaranteed Quantization Error Computation for Neural Network Model Compression | Apr 26, 2023 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM | Jan 30, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HEMP: High-order Entropy Minimization for neural network comPression | Jul 12, 2021 | Neural Network CompressionQuantization | —Unverified | 0 |
| How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression? | May 9, 2023 | Neural Network CompressionTensor Decomposition | —Unverified | 0 |
| Hybrid Tensor Decomposition in Neural Network Compression | Jun 29, 2020 | Neural Network CompressionTensor Decomposition | —Unverified | 0 |
| Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing | May 22, 2025 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration | Jun 1, 2020 | image-classificationImage Classification | —Unverified | 0 |
| Lightweight Attribute Localizing Models for Pedestrian Attribute Recognition | Jun 16, 2023 | AttributeNeural Network Compression | —Unverified | 0 |
| Linearity-based neural network compression | Jun 26, 2025 | Efficient Neural NetworkNeural Network Compression | —Unverified | 0 |
| Compressing 3DCNNs Based on Tensor Train Decomposition | Dec 8, 2019 | Hand Gesture RecognitionHand-Gesture Recognition | —Unverified | 0 |
| Low-Rank Matrix Approximation for Neural Network Compression | Apr 25, 2025 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner | Jun 19, 2021 | Knowledge DistillationModel Compression | —Unverified | 0 |
| MINT: Deep Network Compression via Mutual Information-based Neuron Trimming | Mar 18, 2020 | Neural Network Compression | —Unverified | 0 |
| Partial Binarization of Neural Networks for Budget-Aware Efficient Learning | Nov 12, 2022 | BinarizationNeural Network Compression | —Unverified | 0 |
| MLPrune: Multi-Layer Pruning for Automated Neural Network Compression | Sep 27, 2018 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Exact Backpropagation in Binary Weighted Networks with Group Weight Transformations | Jul 3, 2021 | BinarizationClassification with Binary Weight Network | CodeCode Available | 0 |
| WHC: Weighted Hybrid Criterion for Filter Pruning on Convolutional Neural Networks | Feb 16, 2023 | ClassificationNetwork Pruning | CodeCode Available | 0 |
| COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning | Jun 25, 2019 | Model CompressionNeural Network Compression | CodeCode Available | 0 |
| MUSCO: Multi-Stage Compression of neural networks | Mar 24, 2019 | Neural Network Compression | CodeCode Available | 0 |
| A Programmable Approach to Neural Network Compression | Nov 6, 2019 | Bayesian OptimizationImage Classification | CodeCode Available | 0 |
| Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression | Dec 5, 2020 | Knowledge DistillationNeural Network Compression | CodeCode Available | 0 |