Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM May 2, 2019 Model Compression Quantization
— Unverified 00 Model Compression via Hyper-Structure Network Jan 1, 2021 model Model Compression
— Unverified 00 Model Compression via Symmetries of the Parameter Space Sep 29, 2021 model Model Compression
— Unverified 00 Toward Real-World Voice Disorder Classification Dec 5, 2021 Classification Model Compression
— Unverified 00 Model Compression with Generative Adversarial Networks May 1, 2019 CPU Diversity
— Unverified 00 Model Compression with Multi-Task Knowledge Distillation for Web-scale Question Answering System Apr 21, 2019 Knowledge Distillation Model Compression
— Unverified 00 Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System Oct 18, 2019 General Knowledge Knowledge Distillation
— Unverified 00 An Effective Information Theoretic Framework for Channel Pruning Aug 14, 2024 Model Compression
— Unverified 00 Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification Sep 9, 2017 Classification Face Recognition
— Unverified 00 On Cross-Layer Alignment for Model Fusion of Heterogeneous Neural Networks Oct 29, 2021 Knowledge Distillation Model Compression
— Unverified 00 Towards Accurate Post-Training Quantization for Vision Transformer Mar 25, 2023 Model Compression Quantization
— Unverified 00 A Light-weight Deep Human Activity Recognition Algorithm Using Multi-knowledge Distillation Jul 6, 2021 Activity Recognition Classification
— Unverified 00 Towards a tailored mixed-precision sub-8-bit quantization scheme for Gated Recurrent Units using Genetic Algorithms Feb 19, 2024 Model Compression Quantization
— Unverified 00 Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference Jun 4, 2023 Decoder Knowledge Distillation
— Unverified 00 Modulating Regularization Frequency for Efficient Compression-Aware Model Training May 5, 2021 Model Compression
— Unverified 00 MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness Mar 27, 2025 Language Modeling Language Modelling
— Unverified 00 MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning Aug 24, 2024 Model Compression
— Unverified 00 MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework Sep 16, 2020 Deep Learning Edge-computing
— Unverified 00 MT-BioNER: Multi-task Learning for Biomedical Named Entity Recognition using Deep Bidirectional Transformers Jan 24, 2020 domain classification General Classification
— Unverified 00 Towards Better Parameter-Efficient Fine-Tuning for Large Language Models: A Position Paper Nov 22, 2023 Model Compression parameter-efficient fine-tuning
— Unverified 00 Multi-Dimensional Pruning: A Unified Framework for Model Compression Jun 1, 2020 Model Compression
— Unverified 00 Towards Building a Real Time Mobile Device Bird Counting System Through Synthetic Data Training and Model Compression Dec 15, 2019 Crowd Counting Model Compression
— Unverified 00 Multi-head Knowledge Distillation for Model Compression Dec 5, 2020 image-classification Image Classification
— Unverified 00 An Automatic and Efficient BERT Pruning for Edge AI Systems Jun 21, 2022 CPU Model Compression
— Unverified 00 Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation Mar 1, 2023 Domain Adaptation Knowledge Distillation
— Unverified 00 Multi-Precision Quantized Neural Networks via Encoding Decomposition of -1 and +1 May 31, 2019 image-classification Image Classification
— Unverified 00 MultiPruner: Balanced Structure Removal in Foundation Models Jan 17, 2025 Model Compression
— Unverified 00 Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition Oct 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Multi-task Learning Approach for Modulation and Wireless Signal Classification for 5G and Beyond: Edge Deployment via Model Compression Feb 26, 2022 Management Model Compression
— Unverified 00 Multi-Task Semantic Communications via Large Models Mar 28, 2025 Model Compression Retrieval-augmented Generation
— Unverified 00 Multi-Task Zipping via Layer-wise Neuron Sharing May 24, 2018 Model Compression
— Unverified 00 MWQ: Multiscale Wavelet Quantized Neural Networks Mar 9, 2021 Model Compression Quantization
— Unverified 00 N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning Sep 18, 2017 Model Compression reinforcement-learning
— Unverified 00 Analysis of Quantization on MLP-based Vision Models Sep 14, 2022 Model Compression Quantization
— Unverified 00 N-Ary Quantization for CNN Model Compression and Inference Acceleration May 1, 2019 Clustering Model Compression
— Unverified 00 NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search May 30, 2021 Language Modelling Model Compression
— Unverified 00 Natively Interpretable Machine Learning and Artificial Intelligence: Preliminary Results and Future Directions Jan 2, 2019 Anomaly Detection BIG-bench Machine Learning
— Unverified 00 NeR-VCP: A Video Content Protection Method Based on Implicit Neural Representation Aug 20, 2024 Model Compression NER
— Unverified 00 Reconstructing Pruned Filters using Cheap Spatial Transformations Oct 25, 2021 Feature Compression Knowledge Distillation
— Unverified 00 Network Implosion: Effective Model Compression for ResNets via Static Layer Pruning and Retraining Jun 10, 2019 Model Compression
— Unverified 00 Network Pruning for Low-Rank Binary Index Sep 25, 2019 Model Compression Network Pruning
— Unverified 00 Network Pruning for Low-Rank Binary Indexing May 14, 2019 Model Compression Network Pruning
— Unverified 00 Weight Normalization based Quantization for Deep Neural Network Compression Jul 1, 2019 Model Compression Neural Network Compression
— Unverified 00 Neural 3D Scene Compression via Model Compression May 7, 2021 Image Compression model
— Unverified 00 Neural Architecture Codesign for Fast Bragg Peak Analysis Dec 10, 2023 AutoML Model Compression
— Unverified 00 ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation Mar 8, 2025 Autonomous Driving feature selection
— Unverified 00 Neural Network Compression for Noisy Storage Devices Feb 15, 2021 Model Compression Neural Network Compression
— Unverified 00 Neural Network Compression using Binarization and Few Full-Precision Weights Jun 15, 2023 Binarization CPU
— Unverified 00 Neural Network Compression Via Sparse Optimization Nov 10, 2020 Model Compression Neural Network Compression
— Unverified 00 Neural Network Pruning by Cooperative Coevolution Apr 12, 2022 Evolutionary Algorithms Model Compression
— Unverified 00