LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection Jan 29, 2024 3D Object Detection Autonomous Vehicles
Code Code Available 25 HAQ: Hardware-Aware Automated Quantization with Mixed Precision Nov 21, 2018 Quantization Reinforcement Learning
Code Code Available 25 Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Mar 27, 2025 Image Generation Quantization
Code Code Available 25 GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance May 11, 2025 Language Modeling Language Modelling
Code Code Available 25 A Closer Look at Hardware-Friendly Weight Quantization Oct 7, 2022 Quantization
Code Code Available 25 GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration Apr 3, 2025 GPU Quantization
Code Code Available 25 An Empirical Study of Qwen3 Quantization May 4, 2025 Natural Language Understanding Quantization
Code Code Available 25 AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval Apr 9, 2024 All Information Retrieval
Code Code Available 25 GENIUS: A Generative Framework for Universal Multimodal Search Mar 25, 2025 Information Retrieval Quantization
Code Code Available 25 GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting Jan 26, 2025 Quantization
Code Code Available 25 GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM Mar 8, 2024 Quantization
Code Code Available 25 GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval Jul 17, 2024 Decoder Image Enhancement
Code Code Available 25 BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Jun 9, 2025 Quantization Vision-Language-Action
Code Code Available 25 From Tiny Machine Learning to Tiny Deep Learning: A Survey Jun 21, 2025 AutoML Model Optimization
Code Code Available 25 AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing Jun 23, 2025 Neural Architecture Search Quantization
Code Code Available 25 LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS Nov 28, 2023 Knowledge Distillation NeRF
Code Code Available 25 On-Device Training Under 256KB Memory Jun 30, 2022 Lifelong learning Quantization
Code Code Available 25 RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search May 21, 2024 Quantization
Code Code Available 25 Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs Feb 16, 2024 Quantization
Code Code Available 25 any4: Learned 4-bit Numeric Representation for LLMs Jul 7, 2025 GPU GSM8K
Code Code Available 25 Designing Large Foundation Models for Efficient Training and Inference: A Survey Sep 3, 2024 Knowledge Distillation Model Compression
Code Code Available 15 Confounding Tradeoffs for Neural Network Quantization Feb 12, 2021 Quantization
Code Code Available 15 Fine-tuning Quantized Neural Networks with Zeroth-order Optimization May 19, 2025 GPU Quantization
Code Code Available 15 4-bit Shampoo for Memory-Efficient Network Training May 28, 2024 image-classification Image Classification
Code Code Available 15 A Greedy Algorithm for Quantizing Neural Networks Oct 29, 2020 Quantization
Code Code Available 15 Conditional Coding and Variable Bitrate for Practical Learned Video Coding Apr 19, 2021 Decoder Quantization
Code Code Available 15 Finite Scalar Quantization: VQ-VAE Made Simple Sep 27, 2023 Colorization Depth Estimation
Code Code Available 15 Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks Dec 30, 2021 CPU image-classification
Code Code Available 15 COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization Mar 11, 2024 Quantization
Code Code Available 15 Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Jun 5, 2024 Quantization Reinforcement Learning (RL)
Code Code Available 15 Compression with Bayesian Implicit Neural Representations May 30, 2023 Audio Compression Quantization
Code Code Available 15 CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution Feb 21, 2025 Image Super-Resolution Quantization
Code Code Available 15 Fine-grained Data Distribution Alignment for Post-Training Quantization Sep 9, 2021 Quantization
Code Code Available 15 Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms Feb 3, 2021 image-classification Image Classification
Code Code Available 15 Few shot font generation via transferring similarity guided global style and quantization local style Sep 2, 2023 Disentanglement Font Generation
Code Code Available 15 FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning Jun 17, 2022 Federated Learning Privacy Preserving
Code Code Available 15 Compressing LLMs: The Truth is Rarely Pure and Never Simple Oct 2, 2023 Quantization Retrieval
Code Code Available 15 Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction Feb 1, 2022 Neural Network Compression Quantization
Code Code Available 15 FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos Dec 23, 2022 Model Compression Quantization
Code Code Available 15 Compress Any Segment Anything Model (SAM) Jul 11, 2025 model Quantization
Code Code Available 15 AFPQ: Asymmetric Floating Point Quantization for LLMs Nov 3, 2023 Quantization
Code Code Available 15 Feature Quantization Improves GAN Training Apr 5, 2020 Conditional Image Generation Face Generation
Code Code Available 15 Comprehensive Graph-conditional Similarity Preserving Network for Unsupervised Cross-modal Hashing Dec 25, 2020 Quantization Retrieval
Code Code Available 15 Context-aware Communication for Multi-agent Reinforcement Learning Dec 25, 2023 Multi-agent Reinforcement Learning Quantization
Code Code Available 15 Federated Optimization Algorithms with Random Reshuffling and Gradient Compression Jun 14, 2022 Federated Learning Quantization
Code Code Available 15 FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation Jun 13, 2025 Model Compression Quantization
Code Code Available 15 FastText.zip: Compressing text classification models Dec 12, 2016 General Classification Quantization
Code Code Available 15 AffineQuant: Affine Transformation Quantization for Large Language Models Mar 19, 2024 Quantization
Code Code Available 15 FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation Feb 15, 2021 Model Compression Neural Network Compression
Code Code Available 15 Compact representations of convolutional neural networks via weight pruning and quantization Aug 28, 2021 Quantization
Code Code Available 15