| Linearity-based neural network compression | Jun 26, 2025 | Efficient Neural NetworkNeural Network Compression | —Unverified | 0 |
| MUC-G4: Minimal Unsat Core-Guided Incremental Verification for Deep Neural Network Compression | Jun 3, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing | May 22, 2025 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Certified Neural Approximations of Nonlinear Dynamics | May 21, 2025 | Neural Network Compression | CodeCode Available | 0 |
| Low-Rank Matrix Approximation for Neural Network Compression | Apr 25, 2025 | Model CompressionNeural Network Compression | —Unverified | 0 |
| GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT | Mar 24, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix | Mar 14, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding | Feb 12, 2025 | Neural Architecture SearchNeural Network Compression | —Unverified | 0 |
| A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks | Jan 6, 2025 | Neural Network CompressionQuantization | —Unverified | 0 |
| What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Oct 10, 2024 | Age/UnbiasedFairness | —Unverified | 0 |
| Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Oct 9, 2024 | Knowledge DistillationNeural Network Compression | —Unverified | 0 |
| Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models | Sep 26, 2024 | Neural Network CompressionQuantization | CodeCode Available | 0 |
| Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression | Sep 11, 2024 | Efficient Neural NetworkNeural Network Compression | —Unverified | 0 |
| TropNNC: Structured Neural Network Compression Using Tropical Geometry | Sep 5, 2024 | Neural Network Compression | —Unverified | 0 |
| Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Sep 5, 2024 | Neural Network CompressionTensor Decomposition | —Unverified | 0 |
| Convolutional Neural Network Compression Based on Low-Rank Decomposition | Aug 29, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Condensed Sample-Guided Model Inversion for Knowledge Distillation | Aug 25, 2024 | Knowledge Distillationmodel | —Unverified | 0 |
| An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design | Aug 2, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Jul 16, 2024 | GPUNeural Network Compression | —Unverified | 0 |
| The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Jul 5, 2024 | Deep Reinforcement LearningModel Compression | —Unverified | 0 |
| Neural Network Compression for Reinforcement Learning Tasks | May 13, 2024 | Neural Network Compressionreinforcement-learning | —Unverified | 0 |
| Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | May 2, 2024 | Model CompressionNeural Network Compression | CodeCode Available | 2 |
| Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Feb 29, 2024 | Neural Network Compression | —Unverified | 0 |
| SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Feb 26, 2024 | Image CompressionNeRF | —Unverified | 0 |
| Towards Meta-Pruning via Optimal Transport | Feb 12, 2024 | Neural Network Compression | CodeCode Available | 1 |
| EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Jan 31, 2024 | Knowledge DistillationModel Compression | —Unverified | 0 |
| Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | Jan 15, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Balanced and Deterministic Weight-sharing Helps Network Performance | Dec 13, 2023 | Neural Network Compression | —Unverified | 0 |
| ABKD: Graph Neural Network Compression with Attention-Based Knowledge Distillation | Oct 24, 2023 | Drug DiscoveryFake News Detection | —Unverified | 0 |
| Grokking as Compression: A Nonlinear Complexity Perspective | Oct 9, 2023 | AttributeMemorization | —Unverified | 0 |
| Causal-DFQ: Causality Guided Data-free Network Quantization | Sep 24, 2023 | Data Free QuantizationNeural Network Compression | CodeCode Available | 0 |
| A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations | Aug 13, 2023 | Adversarial RobustnessNetwork Pruning | CodeCode Available | 2 |
| Quantization Aware Factorization for Deep Neural Network Compression | Aug 8, 2023 | Neural Network CompressionQuantization | —Unverified | 0 |
| Survey on Computer Vision Techniques for Internet-of-Things Devices | Aug 2, 2023 | Neural Network CompressionSurvey | —Unverified | 0 |
| Model Compression Methods for YOLOv5: A Review | Jul 21, 2023 | Knowledge Distillationmodel | —Unverified | 0 |
| Lightweight Attribute Localizing Models for Pedestrian Attribute Recognition | Jun 16, 2023 | AttributeNeural Network Compression | —Unverified | 0 |
| Neural Network Compression using Binarization and Few Full-Precision Weights | Jun 15, 2023 | BinarizationCPU | —Unverified | 0 |
| Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD | Jun 13, 2023 | Neural Network Compression | CodeCode Available | 0 |
| Understanding the Effect of the Long Tail on Neural Network Compression | Jun 9, 2023 | image-classificationImage Classification | —Unverified | 0 |
| End-to-End Neural Network Compression via _1_2 Regularized Latency Surrogates | Jun 9, 2023 | Neural Architecture SearchNeural Network Compression | —Unverified | 0 |
| Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference | Jun 4, 2023 | DecoderKnowledge Distillation | —Unverified | 0 |
| Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression | May 25, 2023 | Multi-Task LearningNeural Network Compression | CodeCode Available | 0 |
| Evaluation Metrics for DNNs Compression | May 18, 2023 | Neural Network CompressionObject | —Unverified | 0 |
| How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression? | May 9, 2023 | Neural Network CompressionTensor Decomposition | —Unverified | 0 |
| Guaranteed Quantization Error Computation for Neural Network Model Compression | Apr 26, 2023 | Model CompressionNeural Network Compression | —Unverified | 0 |
| SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers | Apr 8, 2023 | Neural Network CompressionQuantization | CodeCode Available | 1 |
| WHC: Weighted Hybrid Criterion for Filter Pruning on Convolutional Neural Networks | Feb 16, 2023 | ClassificationNetwork Pruning | CodeCode Available | 0 |
| DepGraph: Towards Any Structural Pruning | Jan 30, 2023 | Network PruningNeural Network Compression | CodeCode Available | 4 |
| Magnitude and Similarity based Variable Rate Filter Pruning for Efficient Convolution Neural Networks | Dec 27, 2022 | Network PruningNeural Network Compression | CodeCode Available | 0 |
| PD-Quant: Post-Training Quantization based on Prediction Difference Metric | Dec 14, 2022 | Neural Network CompressionQuantization | CodeCode Available | 1 |