The State of Sparsity in Deep Neural Networks Feb 25, 2019 Model Compression Sparse Learning
Code Code Available 1Learned Step Size Quantization Feb 21, 2019 Model Compression Quantization
Code Code Available 1ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers Dec 31, 2018 Model Compression Quantization
Code Code Available 1Discrimination-aware Channel Pruning for Deep Neural Networks Oct 28, 2018 channel selection Model Compression
Code Code Available 1Dynamic Channel Pruning: Feature Boosting and Suppression Oct 12, 2018 Model Compression Network Pruning
Code Code Available 1Verifiable Reinforcement Learning via Policy Extraction May 22, 2018 Deep Reinforcement Learning Imitation Learning
Code Code Available 1To prune, or not to prune: exploring the efficacy of pruning for model compression Oct 5, 2017 Model Compression
Code Code Available 1Ternary Weight Networks May 16, 2016 Model Compression object-detection
Code Code Available 1SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size Feb 24, 2016 Image Classification Model Compression
Code Code Available 1LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression Jul 21, 2025 Decoder Model Compression
— Unverified 0DipSVD: Dual-importance Protected SVD for Efficient LLM Compression Jun 25, 2025 Model Compression Quantization
— Unverified 0RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models Jun 21, 2025 Model Compression Quantization
— Unverified 0Model compression using knowledge distillation with integrated gradients Jun 17, 2025 Data Augmentation Knowledge Distillation
— Unverified 0Simple is what you need for efficient and accurate medical image segmentation Jun 16, 2025 feature selection Image Segmentation
Code Code Available 0EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization Jun 16, 2025 Mixture-of-Experts Model Compression
Code Code Available 0Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs Jun 16, 2025 Model Compression
Code Code Available 0Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms Jun 12, 2025 Automatic Speech Recognition Keyword Spotting
Code Code Available 0Post-Training Quantization for Video Matting Jun 12, 2025 Image Matting Model Compression
— Unverified 0AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent Jun 11, 2025 Model Compression Quantization
— Unverified 0Structured Pruning and Quantization for Learned Image Compression Jun 2, 2025 image-classification Image Classification
Code Code Available 0INSIGHT: A Survey of In-Network Systems for Intelligent, High-Efficiency AI and Topology Optimization May 30, 2025 Federated Learning Intrusion Detection
— Unverified 0Smooth Model Compression without Fine-Tuning May 30, 2025 model Model Compression
— Unverified 0FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression May 29, 2025 Language Modeling Language Modelling
Code Code Available 0Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates May 28, 2025 Model Compression
— Unverified 0ResSVD: Residual Compensated SVD for Large Language Model Compression May 26, 2025 Language Modeling Language Modelling
— Unverified 0Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks May 26, 2025 Deep Learning Model Compression
— Unverified 0Small Language Models: Architectures, Techniques, Evaluation, Problems and Future Adaptation May 26, 2025 Model Compression Quantization
— Unverified 0Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs May 26, 2025 Model Compression
— Unverified 0Efficient Speech Translation through Model Compression and Knowledge Distillation May 26, 2025 Knowledge Distillation Model Compression
Code Code Available 0Knowledge Grafting of Large Language Models May 24, 2025 Continual Learning Knowledge Distillation
Code Code Available 0Making deep neural networks work for medical audio: representation, compression and domain adaptation May 24, 2025 Domain Adaptation Model Compression
— Unverified 0Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing May 24, 2025 Model Compression Quantization
— Unverified 0LatentLLM: Attention-Aware Joint Tensor Compression May 23, 2025 Model Compression Tensor Decomposition
— Unverified 0Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing May 22, 2025 Model Compression Neural Network Compression
— Unverified 0Edge-First Language Model Inference: Models, Metrics, and Tradeoffs May 22, 2025 Benchmarking Language Modeling
— Unverified 0On Multilingual Encoder Language Model Compression for Low-Resource Languages May 22, 2025 Knowledge Distillation Language Modeling
— Unverified 0Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models May 20, 2025 Model Compression Tensor Networks
— Unverified 0RanDeS: Randomized Delta Superposition for Multi-Model Compression May 16, 2025 model Model Compression
Code Code Available 0Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer May 13, 2025 class-incremental learning Class Incremental Learning
— Unverified 0KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification May 12, 2025 Classification Hyperparameter Optimization
— Unverified 0Semantic Retention and Extreme Compression in LLMs: Can We Have Both? May 12, 2025 Language Modeling Language Modelling
— Unverified 0Sponge Attacks on Sensing AI: Energy-Latency Vulnerabilities and Defense via Model Pruning May 9, 2025 Model Compression
— Unverified 0Edge-Optimized Deep Learning & Pattern Recognition Techniques for Non-Intrusive Load Monitoring of Energy Time Series May 7, 2025 Model Compression Non-Intrusive Load Monitoring
— Unverified 0Onboard Optimization and Learning: A Survey May 7, 2025 Decision Making Model Compression
— Unverified 0Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques May 5, 2025 Knowledge Distillation Mixture-of-Experts
— Unverified 0Radio: Rate-Distortion Optimization for Large Language Model Compression May 5, 2025 Language Modeling Language Modelling
— Unverified 0Smart Environmental Monitoring of Marine Pollution using Edge AI Apr 30, 2025 Edge-computing Model Compression
— Unverified 0Towards Faster and More Compact Foundation Models for Molecular Property Prediction Apr 28, 2025 Model Compression Molecular Property Prediction
Code Code Available 0Low-Rank Matrix Approximation for Neural Network Compression Apr 25, 2025 Model Compression Neural Network Compression
— Unverified 0On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration Apr 24, 2025 CPU Model Compression
— Unverified 0