Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers Nov 1, 2019 Image Classification Machine Translation
— Unverified 00 Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference May 9, 2025 CPU GPU
— Unverified 00 Empirical Evaluation of Post-Training Quantization Methods for Language Tasks Oct 29, 2022 Attribute Quantization
— Unverified 00 Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN Mar 26, 2024 Knowledge Distillation Model Compression
— Unverified 00 An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM Aug 29, 2019 Quantization
— Unverified 00 Emotion Recognition Using Speaker Cues Feb 4, 2020 Emotion Recognition Quantization
— Unverified 00 Emergent Quantized Communication Nov 4, 2022 Quantization
— Unverified 00 Embedding Compression with Isotropic Iterative Quantization Jan 11, 2020 Image Retrieval Quantization
— Unverified 00 Cell growth rate dictates the onset of glass to fluid-like transition and long time super-diffusion in an evolving cell colony Feb 14, 2018 Quantization
— Unverified 00 ANTLER: Bayesian Nonlinear Tensor Learning and Modeler for Unstructured, Varying-Size Point Cloud Data Feb 25, 2022 Dimensionality Reduction Quantization
— Unverified 00 Adaptive Periodic Averaging: A Practical Approach to Reducing Communication in Distributed Learning Jul 13, 2020 GPU image-classification
— Unverified 00 Embedding Compression for Efficient Re-Identification May 23, 2024 Dimensionality Reduction Quantization
— Unverified 00 Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals Jun 1, 2015 Math Quantization
— Unverified 00 CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs Dec 3, 2024 Image Captioning Quantization
— Unverified 00 ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting Oct 30, 2024 Quantization
— Unverified 00 CEG4N: Counter-Example Guided Neural Network Quantization Refinement Jul 9, 2022 Quantization
— Unverified 00 Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks Sep 8, 2021 CPU GPU
— Unverified 00 CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization Jun 25, 2024 Quantization
— Unverified 00 Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision Feb 28, 2024 Face Recognition Quantization
— Unverified 00 EfQAT: An Efficient Framework for Quantization-Aware Training Nov 17, 2024 Quantization
— Unverified 00 CDC: Classification Driven Compression for Bandwidth Efficient Edge-Cloud Collaborative Deep Learning May 4, 2020 Classification General Classification
— Unverified 00 An Overview on IEEE 802.11bf: WLAN Sensing Oct 20, 2023 Quantization
— Unverified 00 Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers Oct 9, 2023 Image Generation Image Reconstruction
— Unverified 00 CBQ: Cross-Block Quantization for Large Language Models Dec 13, 2023 GPU Quantization
— Unverified 00 Efficient Vision-based Vehicle Speed Estimation May 2, 2025 Quantization vehicle detection
— Unverified 00 Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features Dec 26, 2024 Multi-Task Learning Quantization
— Unverified 00 An Overview of Neural Network Compression Jun 5, 2020 Knowledge Distillation Model Compression
— Unverified 00 Efficient Systolic Array Based on Decomposable MAC for Quantized Deep Neural Networks Jan 1, 2020 Quantization
— Unverified 00 Efficient Super Resolution Using Binarized Neural Network Dec 16, 2018 Binarization image-classification
— Unverified 00 An Overview of Datatype Quantization Techniques for Convolutional Neural Networks Aug 22, 2018 Quantization
— Unverified 00 Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction Dec 12, 2022 Click-Through Rate Prediction Prediction
— Unverified 00 Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization Jun 12, 2025 Quantization
— Unverified 00 Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of Weight Residuals May 28, 2023 Quantization
— Unverified 00 Efficient Speech Representation Learning with Low-Bit Quantization Dec 14, 2022 Model Compression Quantization
— Unverified 00 Efficient Approximate Search for Sets of Vectors Jul 14, 2021 Quantization
— Unverified 00 Efficient Quantum Approximate kNN Algorithm via Granular-Ball Computing May 29, 2025 Quantization
— Unverified 00 A Novel Unified Model for Multi-exposure Stereo Coding Based on Low Rank Tucker-ALS and 3D-HEVC Apr 10, 2021 Quantization
— Unverified 00 Efficient Quantization Strategies for Latent Diffusion Models Dec 9, 2023 Image Generation Quantization
— Unverified 00 Can Large Language Models Understand Context? Feb 1, 2024 In-Context Learning Quantization
— Unverified 00 A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks Jan 6, 2025 Neural Network Compression Quantization
— Unverified 00 Efficient Point Transformer for Large-scale 3D Scene Understanding Sep 29, 2021 3D Semantic Segmentation Quantization
— Unverified 00 Efficient On-the-fly Category Retrieval using ConvNets and GPUs Jul 17, 2014 Binarization GPU
— Unverified 00 Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ? Oct 22, 2024 Machine Translation Quantization
— Unverified 00 A Novel Physics-based Channel Model for Reconfigurable Intelligent Surface-assisted Multi-user Communication Systems Aug 3, 2020 Quantization
— Unverified 00 Adaptive Joint Optimization for 3D Reconstruction with Differentiable Rendering Aug 15, 2022 3D Reconstruction Quantization
— Unverified 00 Efficient Neural PDE-Solvers using Quantization Aware Training Aug 14, 2023 Quantization
— Unverified 00 Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review Nov 20, 2023 Model Compression Quantization
— Unverified 00 Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics Data Apr 2, 2022 Quantization
— Unverified 00 Efficient Neural Compression with Inference-time Decoding Jun 10, 2024 Decoder Quantization
— Unverified 00 CAMBI: Contrast-aware Multiscale Banding Index Jan 29, 2021 Quantization Sensitivity
— Unverified 00