Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs May 26, 2025 Model Compression
— Unverified 0Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory Dec 2, 2016 Memorization Model Compression
— Unverified 0Partitioning-Guided K-Means: Extreme Empty Cluster Resolution for Extreme Model Compression Jun 24, 2023 Model Compression Quantization
— Unverified 0PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning Jan 1, 2020 Code Generation Model Compression
— Unverified 0PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting Jan 16, 2022 Model Compression
— Unverified 0PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation Jun 13, 2024 Knowledge Distillation Model Compression
— Unverified 0PCNN: Pattern-based Fine-Grained Regular Pruning towards Optimizing CNN Accelerators Feb 11, 2020 Model Compression
— Unverified 0PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices Sep 6, 2019 Model Compression
— Unverified 0Pea-KD: Parameter-efficient and Accurate Knowledge Distillation on BERT Sep 30, 2020 Knowledge Distillation Model Compression
— Unverified 0Pea-KD: Parameter-efficient and accurate Knowledge Distillation Sep 28, 2020 Knowledge Distillation Model Compression
— Unverified 0Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation Mar 28, 2025 Language Modeling Language Modelling
— Unverified 0Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs Feb 20, 2020 Model Compression Network Pruning
— Unverified 0PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices Apr 23, 2020 Model Compression
— Unverified 0Perturbation of Deep Autoencoder Weights for Model Compression and Classification of Tabular Data May 17, 2022 BIG-bench Machine Learning Classification
— Unverified 0PFGDF: Pruning Filter via Gaussian Distribution Feature for Deep Neural Networks Acceleration Jun 23, 2020 Model Compression
— Unverified 0Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models Jan 31, 2025 GPU Model Compression
— Unverified 0Position-Aware Depth Decay Decoding (D^3): Boosting Large Language Model Inference Efficiency Mar 11, 2025 GSM8K Language Modeling
— Unverified 0Post-Training Quantization for Video Matting Jun 12, 2025 Image Matting Model Compression
— Unverified 0Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs Nov 21, 2023 Model Compression Quantization
— Unverified 0Post-Training Weighted Quantization of Neural Networks for Language Models Jan 1, 2021 Model Compression Quantization
— Unverified 0PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation Jun 25, 2021 Keyword Spotting Knowledge Distillation
— Unverified 0Practical quantum federated learning and its experimental demonstration Jan 22, 2025 Federated Learning Model Compression
— Unverified 0Precise Box Score: Extract More Information from Datasets to Improve the Performance of Face Detection Apr 28, 2018 Face Detection Model Compression
— Unverified 0Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data Aug 11, 2021 Knowledge Distillation Model Compression
— Unverified 0Preview-based Category Contrastive Learning for Knowledge Distillation Oct 18, 2024 Contrastive Learning Knowledge Distillation
— Unverified 0InDistill: Information flow-preserving knowledge distillation for model compression May 20, 2022 Knowledge Distillation Model Compression
Code Code Available 0CASP: Compression of Large Multimodal Models Based on Attention Sparsity Mar 7, 2025 Model Compression Quantization
Code Code Available 0Compressing Vision Transformers for Low-Resource Visual Learning Sep 5, 2023 Autonomous Navigation image-classification
Code Code Available 0Slicing Mutual Information Generalization Bounds for Neural Networks Jun 6, 2024 Generalization Bounds Model Compression
Code Code Available 0SlimNets: An Exploration of Deep Model Compression and Acceleration Aug 1, 2018 Knowledge Distillation Model Compression
Code Code Available 0Information-Theoretic Understanding of Population Risk Improvement with Model Compression Jan 27, 2019 Clustering Model Compression
Code Code Available 0Focused Quantization for Sparse CNNs Mar 7, 2019 Model Compression Neural Network Compression
Code Code Available 0Canonical convolutional neural networks Jun 3, 2022 Form Model Compression
Code Code Available 0ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs Apr 17, 2025 Model Compression Quantization
Code Code Available 0Model Compression with Adversarial Robustness: A Unified Optimization Framework Feb 10, 2019 Adversarial Robustness Model Compression
Code Code Available 0Visual Domain Adaptation for Monocular Depth Estimation on Resource-Constrained Hardware Aug 5, 2021 Depth Estimation Domain Adaptation
Code Code Available 0PruMUX: Augmenting Data Multiplexing with Model Compression May 24, 2023 Knowledge Distillation model
Code Code Available 0A Corrected Expected Improvement Acquisition Function Under Noisy Observations Oct 8, 2023 Bayesian Optimization Model Compression
Code Code Available 0Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Aug 6, 2024 image-classification Image Classification
Code Code Available 0Towards Faster and More Compact Foundation Models for Molecular Property Prediction Apr 28, 2025 Model Compression Molecular Property Prediction
Code Code Available 0Tensorization of neural networks for improved privacy and interpretability Jan 10, 2025 Model Compression
Code Code Available 0Network Pruning via Performance Maximization Jun 19, 2021 Model Compression Network Pruning
Code Code Available 0Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation Mar 27, 2024 Domain Adaptation Knowledge Distillation
Code Code Available 0Tensorized Embedding Layers for Efficient Model Compression Jan 30, 2019 Language Modelling Machine Translation
Code Code Available 0APSQ: Additive Partial Sum Quantization with Algorithm-Hardware Co-Design Apr 10, 2025 Model Compression Quantization
Code Code Available 0Neural Architecture Codesign for Fast Physics Applications Jan 9, 2025 High-Level Synthesis Model Compression
Code Code Available 0Iterative Filter Pruning for Concatenation-based CNN Architectures May 4, 2024 Model Compression
Code Code Available 0TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP Dec 2, 2019 Explainable Artificial Intelligence (XAI) Model Compression
Code Code Available 0JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis Oct 30, 2018 Keyword Spotting Model Compression
Code Code Available 0Image Classification with CondenseNeXt for ARM-Based Computing Platforms Jun 26, 2021 Autonomous Driving Classification
Code Code Available 0