High-Accuracy Low-Precision Training Mar 9, 2018 CPU Quantization
Code Code Available 0Rethinking floating point for deep learning Nov 1, 2018 Deep Learning Math
Code Code Available 0The Neural Network Pushdown Automaton: Model, Stack and Learning Simulations Nov 15, 2017 Quantization
Code Code Available 0Approximate spectral clustering density-based similarity for noisy datasets Feb 22, 2023 Clustering Graph Clustering
Code Code Available 0CASP: Compression of Large Multimodal Models Based on Attention Sparsity Mar 7, 2025 Model Compression Quantization
Code Code Available 0The Power of Negative Zero: Datatype Customization for Quantized Large Language Models Jan 6, 2025 Computational Efficiency Quantization
Code Code Available 0Eliminating Quantization Errors in Classification-Based Sound Source Localization Nov 21, 2023 Classification Quantization
Code Code Available 0Weighted quantization using MMD: From mean field to mean shift via gradient flows Feb 14, 2025 Clustering Quantization
Code Code Available 0EAST: Encoding-Aware Sparse Training for Deep Memory Compression of ConvNets Dec 20, 2019 Quantization
Code Code Available 0EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization Jun 16, 2025 Mixture-of-Experts Model Compression
Code Code Available 0Probabilistic Weight Fixing: Large-scale training of neural network weight uncertainties for quantization Sep 24, 2023 Position Quantization
Code Code Available 0Hierarchical Quantized Representations for Script Generation Aug 28, 2018 Decoder Language Modeling
Code Code Available 0Revealing and Protecting Labels in Distributed Training Oct 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0CoopNet: Cooperative Convolutional Neural Network for Low-Power MCUs Nov 19, 2019 Binarization Quantization
Code Code Available 0The Quantization Model of Neural Scaling Mar 23, 2023 Language Modeling Language Modelling
Code Code Available 0DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection May 22, 2025 Quantization Safety Alignment
Code Code Available 0Hierarchical Encoding of Sequential Data With Compact and Sub-Linear Storage Cost Oct 1, 2019 Quantization Simultaneous Localization and Mapping
Code Code Available 0A Comprehensive Evaluation of Quantization Strategies for Large Language Models Feb 26, 2024 Language Modeling Language Modelling
Code Code Available 0Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM Mar 23, 2019 Model Compression Quantization
Code Code Available 0Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and Efficiency Mar 14, 2024 Adversarial Robustness Model Compression
Code Code Available 0Hessian Aware Quantization of Spiking Neural Networks Apr 29, 2021 Quantization
Code Code Available 0Convolutional Neural Networks to Enhance Coded Speech Jun 25, 2018 Quantization
Code Code Available 0Revisiting Multi-Codebook Quantization May 21, 2021 Quantization Retrieval
Code Code Available 0Progressive Stochastic Binarization of Deep Networks Apr 3, 2019 Binarization Network Pruning
Code Code Available 0Convert, compress, correct: Three steps toward communication-efficient DNN training Mar 17, 2022 Quantization
Code Code Available 0Revisiting Saliency Metrics: Farthest-Neighbor Area Under Curve Feb 24, 2020 Quantization Saliency Detection
Code Code Available 0DSConv: Efficient Convolution Operator Jan 7, 2019 Quantization
Code Code Available 0Cartesian K-Means Jun 1, 2013 Clustering Object Recognition
Code Code Available 0Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesis Aug 9, 2023 GPU Image Generation
Code Code Available 0Properties that allow or prohibit transferability of adversarial attacks among quantized networks May 15, 2024 Quantization
Code Code Available 0ACIQ: Analytical Clipping for Integer Quantization of neural networks May 1, 2019 Quantization
Code Code Available 0Continuous-variable neural-network quantum states and the quantum rotor model Jul 15, 2021 Quantization Variational Monte Carlo
Code Code Available 0DQRM: Deep Quantized Recommendation Models Oct 26, 2024 Quantization
Code Code Available 0RGCNN: Regularized Graph CNN for Point Cloud Segmentation Jun 8, 2018 Point Cloud Classification Point Cloud Segmentation
Code Code Available 0Spiking Neural Networks in the Alexiewicz Topology: A New Perspective on Analysis and Error Bounds May 9, 2023 Quantization
Code Code Available 0HERO: Hessian-Enhanced Robust Optimization for Unifying and Improving Generalization and Quantization Performance Nov 23, 2021 Quantization
Code Code Available 0RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy Dec 2, 2024 Computational Efficiency Language Modeling
Code Code Available 0ProxQuant: Quantized Neural Networks via Proximal Operators Oct 1, 2018 Quantization
Code Code Available 0Continual Learning for Generative Retrieval over Dynamic Corpora Aug 29, 2023 Continual Learning Quantization
Code Code Available 0BRIDLE: Generalized Self-supervised Learning with Quantization Feb 4, 2025 image-classification Image Classification
Code Code Available 0Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing Dec 26, 2024 Edge-computing Quantization
Code Code Available 0Context Unaware Knowledge Distillation for Image Retrieval Jul 19, 2022 Image Retrieval Knowledge Distillation
Code Code Available 0HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks Jan 20, 2022 Quantization Vocal Bursts Intensity Prediction
Code Code Available 0Spreading vectors for similarity search Jun 8, 2018 Quantization Triplet
Code Code Available 0HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization May 27, 2021 Decoder Denoising
Code Code Available 0Harnessing Large Language Models Locally: Empirical Results and Implications for AI PC May 21, 2025 CPU Quantization
Code Code Available 0Pruning vs XNOR-Net: A Comprehensive Study of Deep Learning for Audio Classification on Edge-devices Aug 13, 2021 Audio Classification Classification
Code Code Available 0SPT: Fine-Tuning Transformer-based Language Models Efficiently with Sparsification Dec 16, 2023 Quantization
Code Code Available 0Ps and Qs: Quantization-aware pruning for efficient low latency neural network inference Feb 22, 2021 Bayesian Optimization Computational Efficiency
Code Code Available 0Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks Jan 16, 2024 Classification image-classification
Code Code Available 0