Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss Sep 5, 2021 Quantization
— Unverified 0Clustering with Bregman Divergences: an Asymptotic Analysis Dec 1, 2016 Clustering Quantization
— Unverified 0Approximately Invertible Neural Network for Learned Image Compression Aug 30, 2024 Denoising Image Compression
— Unverified 0Adaptive Resolution Inference (ARI): Energy-Efficient Machine Learning for Internet of Things Aug 26, 2024 Quantization
— Unverified 0Clustering-Based Evolutionary Federated Multiobjective Optimization and Learning Apr 29, 2025 Clustering Diversity
— Unverified 0Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors Jun 24, 2024 Quantization
— Unverified 0Cluster-Based Cooperative Digital Over-the-Air Aggregation for Wireless Federated Edge Learning Aug 3, 2020 Decoder Diversity
— Unverified 0ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning Mar 17, 2025 GPU Model Compression
— Unverified 0Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding Mar 12, 2024 Quantization
— Unverified 01-Bit Compressive Sensing for Efficient Federated Learning Over the Air Mar 30, 2021 Compressive Sensing Dimensionality Reduction
— Unverified 0Efficient Compression of Multitask Multilingual Speech Models May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression Jan 1, 2025 3DGS Quantization
— Unverified 0Accelerating Deep Learning with Dynamic Data Pruning Nov 24, 2021 Attribute Deep Learning
— Unverified 0CLIP-Q: Deep Network Compression Learning by In-Parallel Pruning-Quantization Jun 1, 2018 image-classification Image Classification
— Unverified 0Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Sep 26, 2024 GPU Management
— Unverified 0Adaptive quantization with mixed-precision based on low-cost proxy Feb 27, 2024 Neural Architecture Search Quantization
— Unverified 02-Bit Random Projections, NonLinear Estimators, and Approximate Near Neighbor Search Feb 21, 2016 Quantization Re-Ranking
— Unverified 0Efficient Asynchronous Federated Learning with Sparsification and Quantization Dec 23, 2023 Federated Learning Quantization
— Unverified 0A Post-coder Feedback Approach to Overcome Training Asymmetry in MIMO-TDD Jul 22, 2020 Quantization
— Unverified 0Click-through Rate Prediction with Auto-Quantized Contrastive Learning Sep 27, 2021 Click-Through Rate Prediction Contrastive Learning
— Unverified 0Adaptive Quantization Resolution and Power Control for Federated Learning over Cell-free Networks Dec 14, 2024 Federated Learning Quantization
— Unverified 0Classification Accuracy Improvement for Neuromorphic Computing Systems with One-level Precision Synapses Jan 7, 2017 General Classification image-classification
— Unverified 0Class-based Quantization for Neural Networks Nov 27, 2022 Quantization
— Unverified 0Efficient ANN-SNN Conversion with Error Compensation Learning May 12, 2025 Quantization
— Unverified 0Apollo-Forecast: Overcoming Aliasing and Inference Speed Challenges in Language Models for Time Series Forecasting Dec 16, 2024 Quantization Time Series
— Unverified 0CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer Jun 1, 2025 Audio captioning Language Modeling
— Unverified 0Adaptive Quantization of Neural Networks Jan 1, 2018 Edge-computing Model Compression
— Unverified 0CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Accelerating Deep Learning Model Inference on Arm CPUs with Ultra-Low Bit Quantization and Runtime Jul 18, 2022 Quantization
— Unverified 0A Planck Radiation and Quantization Scheme for Human Cognition and Language Jan 10, 2022 Quantization
— Unverified 0Choose Your Model Size: Any Compression by a Single Gradient Descent Feb 3, 2025 Quantization
— Unverified 0Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning Feb 8, 2021 Federated Learning Quantization
— Unverified 0Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing May 24, 2025 Model Compression Quantization
— Unverified 0Efficient Batch Homomorphic Encryption for Vertically Federated XGBoost Dec 8, 2021 Federated Learning Quantization
— Unverified 0CHIME: A Compressive Framework for Holistic Interest Modeling Apr 9, 2025 Contrastive Learning Quantization
— Unverified 0Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models Apr 3, 2024 Quantization
— Unverified 0A Picture is Worth a Billion Bits: Real-Time Image Reconstruction from Dense Binary Pixels Oct 15, 2015 Image Reconstruction Quantization
— Unverified 0Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for DNNs on the Edge Aug 6, 2019 Quantization
— Unverified 0Check-N-Run: A Checkpointing System for Training Deep Learning Recommendation Models Oct 17, 2020 Quantization Recommendation Systems
— Unverified 0Adaptive Quantization for Key Generation in Low-Power Wide-Area Networks Oct 11, 2023 Quantization
— Unverified 0Accelerating Deep Learning Inference via Freezing Feb 7, 2020 Deep Learning Quantization
— Unverified 0Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models May 10, 2024 AI Agent Model Compression
— Unverified 0Characterizing Coherent Integrated Photonic Neural Networks under Imperfections Jul 22, 2022 Quantization
— Unverified 0APG-MOS: Auditory Perception Guided-MOS Predictor for Synthetic Speech Apr 29, 2025 Quantization
— Unverified 0Characterization of the frequency response of channel-interleaved photonic ADCs based on the optical time-division demultiplexer Sep 3, 2021 Quantization
— Unverified 0Adaptive Quantization for Deep Neural Network Dec 4, 2017 Quantization
— Unverified 0An approach to optimize inference of the DIART speaker diarization pipeline Aug 5, 2024 Inference Optimization Knowledge Distillation
— Unverified 0Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference Jan 30, 2023 Efficient Neural Network Quantization
— Unverified 0A Performance Analysis of You Only Look Once Models for Deployment on Constrained Computational Edge Devices in Drone Applications Feb 6, 2025 NVIDIA Jetson Orin Nano object-detection
— Unverified 0Characterising Bias in Compressed Models Oct 6, 2020 Fairness Quantization
— Unverified 0