Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Jun 5, 2024 GPU Quantization
— Unverified 0ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning Aug 1, 2017 Quantization
— Unverified 0ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification Oct 11, 2024 MME Quantization
— Unverified 0ZOBNN: Zero-Overhead Dependable Design of Binary Neural Networks with Deliberately Quantized Parameters Jul 6, 2024 Attribute Quantization
— Unverified 01.58-bit FLUX Dec 24, 2024 Computational Efficiency Image Generation
— Unverified 0MobiVSR: A Visual Speech Recognition Solution for Mobile Devices May 10, 2019 Lip Reading Quantization
— Unverified 0Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference Jul 29, 2024 Quantization
— Unverified 0Model-Based Detector for SSDs in the Presence of Inter-cell Interference Jan 31, 2019 Decoder Quantization
— Unverified 0Model Compression May 20, 2021 BIG-bench Machine Learning model
— Unverified 0Model Compression and Efficient Inference for Large Language Models: A Survey Feb 15, 2024 Knowledge Distillation Model Compression
— Unverified 0Model compression as constrained optimization, with application to neural nets. Part II: quantization Jul 13, 2017 Binarization Model Compression
— Unverified 0Model compression as constrained optimization, with application to neural nets. Part I: general framework Jul 5, 2017 Model Compression Object Recognition
— Unverified 0Model compression as constrained optimization, with application to neural nets. Part V: combining compressions Jul 9, 2021 Additive models Low-rank compression
— Unverified 0Model Compression for DNN-based Speaker Verification Using Weight Quantization Oct 31, 2022 Model Compression Quantization
— Unverified 0Model Compression Methods for YOLOv5: A Review Jul 21, 2023 Knowledge Distillation model
— Unverified 0Model Hemorrhage and the Robustness Limits of Large Language Models Mar 31, 2025 Quantization
— Unverified 0Modeling Image Quantization Tradeoffs for Optimal Compression Dec 14, 2021 Quantization
— Unverified 0Modeling Realistic Degradations in Non-blind Deconvolution Jun 4, 2018 Deblurring Image Deblurring
— Unverified 0Model Predictive Control for Neuromimetic Quantized Systems Dec 19, 2022 model Model Predictive Control
— Unverified 0Model Selection CNN-based VVC QualityEnhancement May 7, 2021 Decoder model
— Unverified 0Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference Jun 4, 2023 Decoder Knowledge Distillation
— Unverified 0Modulation For Modulo: A Sampling-Efficient High-Dynamic Range ADC Nov 22, 2023 Quantization
— Unverified 0Modulo Sampling: Performance Guarantees in The Presence of Quantization Jan 2, 2025 Quantization
— Unverified 0MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling Sep 26, 2024 Motion Generation Quantization
— Unverified 0Mokey: Enabling Narrow Fixed-Point Inference for Out-of-the-Box Floating-Point Transformer Models Mar 23, 2022 Quantization
— Unverified 0Moment Quantization for Video Temporal Grounding Apr 3, 2025 Quantization Video Understanding
— Unverified 0Moniqua: Modulo Quantized Communication in Decentralized SGD Feb 26, 2020 Quantization
— Unverified 0Monte Carlo Deep Neural Network Arithmetic Sep 25, 2019 image-classification Image Classification
— Unverified 0MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0More for Keys, Less for Values: Adaptive KV Cache Quantization Feb 20, 2025 Quantization
— Unverified 0More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression Dec 17, 2024 Quantization
— Unverified 0MorphIC: A 65-nm 738k-Synapse/mm^2 Quad-Core Binary-Weight Digital Neuromorphic Processor with Stochastic Spike-Driven Online Learning Apr 17, 2019 2k Quantization
— Unverified 0MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models Jun 17, 2025 Mixture-of-Experts Quantization
— Unverified 0MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer Apr 11, 2025 Motion Synthesis Quantization
— Unverified 0MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression May 30, 2018 Neural Network Compression Quantization
— Unverified 0MPTQ-ViT: Mixed-Precision Post-Training Quantization for Vision Transformer Jan 26, 2024 Quantization
— Unverified 0MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server Apr 22, 2018 BIG-bench Machine Learning Quantization
— Unverified 0MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization Feb 1, 2025 Quantization
— Unverified 0Mr.BiQ: Post-Training Non-Uniform Quantization Based on Minimizing the Reconstruction Error Jan 1, 2022 Binarization Quantization
— Unverified 0MRQ:Support Multiple Quantization Schemes through Model Re-Quantization Aug 1, 2023 model Quantization
— Unverified 0MSE Minimization in RIS-Aided MU-MIMO with Discrete Phase Shifts and Fronthaul Quantization Jun 18, 2024 Quantization
— Unverified 0MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework Sep 16, 2020 Deep Learning Edge-computing
— Unverified 0MUC-G4: Minimal Unsat Core-Guided Incremental Verification for Deep Neural Network Compression Jun 3, 2025 Neural Network Compression Quantization
— Unverified 0MulCode: A Multiplicative Multi-way Model for Compressing Neural Language Model Nov 1, 2019 Language Modeling Language Modelling
— Unverified 0MuLoCo: Muon is a practical inner optimizer for DiLoCo May 29, 2025 Decoder Quantization
— Unverified 0Multi-Agent Consensus Subject to Communication and Privacy Constraints Feb 21, 2021 Quantization
— Unverified 0Multi-bit Distributed Detection of Sparse Stochastic Signals over Error-Prone Reporting Channels Nov 6, 2024 Quantization
— Unverified 0MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs May 23, 2024 Multivariate Time Series Forecasting Quantization
— Unverified 0Multi-Feature Discrete Collaborative Filtering for Fast Cold-start Recommendation Mar 24, 2020 Collaborative Filtering Quantization
— Unverified 0Multi-Layer Hierarchical Federated Learning with Quantization May 13, 2025 Federated Learning Quantization
— Unverified 0