Characterising Bias in Compressed Models Oct 6, 2020 Fairness Quantization
— Unverified 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Channel-Wise Mixed-Precision Quantization for Large Language Models Oct 16, 2024 Quantization
— Unverified 0APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm Oct 30, 2024 Decoder Quantization
— Unverified 0Adaptive Proximal Gradient Methods for Structured Neural Networks Dec 1, 2021 Quantization
— Unverified 0Channel-wise Hessian Aware trace-Weighted Quantization of Neural Networks Aug 19, 2020 AutoML Deep Reinforcement Learning
— Unverified 0Channel Pruning In Quantization-aware Training: An Adaptive Projection-gradient Descent-shrinkage-splitting Method Apr 9, 2022 Quantization
— Unverified 0Channel Estimation in MIMO Systems with One-bit Spatial Sigma-delta ADCs Sep 19, 2021 Quantization
— Unverified 0APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference Jan 21, 2022 Data Compression Quantization
— Unverified 0Efficient Quantum Approximate kNN Algorithm via Granular-Ball Computing May 29, 2025 Quantization
— Unverified 0Efficient Vision-based Vehicle Speed Estimation May 2, 2025 Quantization vehicle detection
— Unverified 0ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting Oct 30, 2024 Quantization
— Unverified 0End-to-end fully-binarized network design: from Generic Learned Thermometer to Block Pruning May 5, 2025 Knowledge Distillation Quantization
— Unverified 0Enhancing Bridge Deck Delamination Detection Based on Aerial Thermography Through Grayscale Morphologic Reconstruction: A Case Study Apr 11, 2019 Clustering Quantization
— Unverified 0Channel Estimation for MIMO Hybrid Architectures with Low Resolution ADCs for mmWave Communication Oct 30, 2019 Quantization
— Unverified 0Channel Balancing for Accurate Quantization of Winograd Convolutions Jan 1, 2022 Quantization
— Unverified 0Channel-Aware Constellation Design for Digital OTA Computation Jan 24, 2025 Quantization
— Unverified 0Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers Nov 1, 2019 Image Classification Machine Translation
— Unverified 0HAWKEYE: Adversarial Example Detector for Deep Neural Networks Sep 22, 2019 Quantization
— Unverified 0Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference May 9, 2025 CPU GPU
— Unverified 0Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN Mar 26, 2024 Knowledge Distillation Model Compression
— Unverified 0An Ultra-Efficient Memristor-Based DNN Framework with Structured Weight Pruning and Quantization Using ADMM Aug 29, 2019 Quantization
— Unverified 0ANTLER: Bayesian Nonlinear Tensor Learning and Modeler for Unstructured, Varying-Size Point Cloud Data Feb 25, 2022 Dimensionality Reduction Quantization
— Unverified 0Cell growth rate dictates the onset of glass to fluid-like transition and long time super-diffusion in an evolving cell colony Feb 14, 2018 Quantization
— Unverified 0Adaptive Periodic Averaging: A Practical Approach to Reducing Communication in Distributed Learning Jul 13, 2020 GPU image-classification
— Unverified 0Efficient Neural PDE-Solvers using Quantization Aware Training Aug 14, 2023 Quantization
— Unverified 0Efficient On-the-fly Category Retrieval using ConvNets and GPUs Jul 17, 2014 Binarization GPU
— Unverified 0CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs Dec 3, 2024 Image Captioning Quantization
— Unverified 0CEG4N: Counter-Example Guided Neural Network Quantization Refinement Jul 9, 2022 Quantization
— Unverified 0CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization Jun 25, 2024 Quantization
— Unverified 0CDC: Classification Driven Compression for Bandwidth Efficient Edge-Cloud Collaborative Deep Learning May 4, 2020 Classification General Classification
— Unverified 0An Overview on IEEE 802.11bf: WLAN Sensing Oct 20, 2023 Quantization
— Unverified 0CBQ: Cross-Block Quantization for Large Language Models Dec 13, 2023 GPU Quantization
— Unverified 0Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features Dec 26, 2024 Multi-Task Learning Quantization
— Unverified 0An Overview of Neural Network Compression Jun 5, 2020 Knowledge Distillation Model Compression
— Unverified 0Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction Dec 12, 2022 Click-Through Rate Prediction Prediction
— Unverified 0An Overview of Datatype Quantization Techniques for Convolutional Neural Networks Aug 22, 2018 Quantization
— Unverified 0Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization Jun 12, 2025 Quantization
— Unverified 0Efficient Neural Compression with Inference-time Decoding Jun 10, 2024 Decoder Quantization
— Unverified 0Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review Nov 20, 2023 Model Compression Quantization
— Unverified 0Efficient Point Transformer for Large-scale 3D Scene Understanding Sep 29, 2021 3D Semantic Segmentation Quantization
— Unverified 0Efficiently Scaling Transformer Inference Nov 9, 2022 Quantization
— Unverified 0Discrete Audio Tokens: More Than a Survey! Jun 12, 2025 Language Modeling Language Modelling
— Unverified 0Efficient Machine Translation with Model Pruning and Quantization Nov 1, 2021 CPU Decoder
— Unverified 0A Novel Unified Model for Multi-exposure Stereo Coding Based on Low Rank Tucker-ALS and 3D-HEVC Apr 10, 2021 Quantization
— Unverified 0STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs Aug 3, 2024 Binarization Computational Efficiency
— Unverified 0Efficient Match Kernel between Sets of Features for Visual Recognition Dec 1, 2009 Quantization
— Unverified 0Can Large Language Models Understand Context? Feb 1, 2024 In-Context Learning Quantization
— Unverified 0A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks Jan 6, 2025 Neural Network Compression Quantization
— Unverified 0Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ? Oct 22, 2024 Machine Translation Quantization
— Unverified 0