AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing Jun 23, 2025 Neural Architecture Search Quantization
Code Code Available 2hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices Mar 9, 2021 BIG-bench Machine Learning Diagnostic
Code Code Available 2I-BERT: Integer-only BERT Quantization Jan 5, 2021 GPU Natural Language Inference
Code Code Available 2HAQ: Hardware-Aware Automated Quantization with Mixed Precision Nov 21, 2018 Quantization Reinforcement Learning
Code Code Available 2Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Mar 27, 2025 Image Generation Quantization
Code Code Available 2GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration Apr 3, 2025 GPU Quantization
Code Code Available 2A Closer Look at Hardware-Friendly Weight Quantization Oct 7, 2022 Quantization
Code Code Available 2Atom: Low-bit Quantization for Efficient and Accurate LLM Serving Oct 29, 2023 GPU Quantization
Code Code Available 2GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance May 11, 2025 Language Modeling Language Modelling
Code Code Available 2AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval Apr 9, 2024 All Information Retrieval
Code Code Available 2GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM Mar 8, 2024 Quantization
Code Code Available 2GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting Jan 26, 2025 Quantization
Code Code Available 2GENIUS: A Generative Framework for Universal Multimodal Search Mar 25, 2025 Information Retrieval Quantization
Code Code Available 2A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation Oct 2, 2024 Image Generation Quantization
Code Code Available 2any4: Learned 4-bit Numeric Representation for LLMs Jul 7, 2025 GPU GSM8K
Code Code Available 2Binarized Neural Machine Translation Feb 9, 2023 Binarization Machine Translation
Code Code Available 2Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs Feb 16, 2024 Quantization
Code Code Available 2From Tiny Machine Learning to Tiny Deep Learning: A Survey Jun 21, 2025 AutoML Model Optimization
Code Code Available 2GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval Jul 17, 2024 Decoder Image Enhancement
Code Code Available 2RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search May 21, 2024 Quantization
Code Code Available 2Fine-grained Data Distribution Alignment for Post-Training Quantization Sep 9, 2021 Quantization
Code Code Available 1Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Jun 5, 2024 Quantization Reinforcement Learning (RL)
Code Code Available 1Fine-tuning Quantized Neural Networks with Zeroth-order Optimization May 19, 2025 GPU Quantization
Code Code Available 14-bit Shampoo for Memory-Efficient Network Training May 28, 2024 image-classification Image Classification
Code Code Available 1A Greedy Algorithm for Quantizing Neural Networks Oct 29, 2020 Quantization
Code Code Available 1Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks Dec 30, 2021 CPU image-classification
Code Code Available 1Finite Scalar Quantization: VQ-VAE Made Simple Sep 27, 2023 Colorization Depth Estimation
Code Code Available 1Few shot font generation via transferring similarity guided global style and quantization local style Sep 2, 2023 Disentanglement Font Generation
Code Code Available 1Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction Feb 1, 2022 Neural Network Compression Quantization
Code Code Available 1FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos Dec 23, 2022 Model Compression Quantization
Code Code Available 1Federated Optimization Algorithms with Random Reshuffling and Gradient Compression Jun 14, 2022 Federated Learning Quantization
Code Code Available 1Feature Quantization Improves GAN Training Apr 5, 2020 Conditional Image Generation Face Generation
Code Code Available 1FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning Jun 17, 2022 Federated Learning Privacy Preserving
Code Code Available 1FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation Jun 13, 2025 Model Compression Quantization
Code Code Available 1Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms Feb 3, 2021 image-classification Image Classification
Code Code Available 1FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation Feb 15, 2021 Model Compression Neural Network Compression
Code Code Available 1Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN May 31, 2023 image-classification Image Classification
Code Code Available 1Fast Nearest Convolution for Real-Time Efficient Image Super-Resolution Aug 24, 2022 Image Super-Resolution Quantization
Code Code Available 1FastText.zip: Compressing text classification models Dec 12, 2016 General Classification Quantization
Code Code Available 1Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy May 15, 2024 Federated Learning image-classification
Code Code Available 1AFPQ: Asymmetric Floating Point Quantization for LLMs Nov 3, 2023 Quantization
Code Code Available 1Fast Lossless Neural Compression with Integer-Only Discrete Flows Jun 17, 2022 Quantization
Code Code Available 1Fast Distance-based Anomaly Detection in Images Using an Inception-like Autoencoder Mar 12, 2020 Anomaly Detection Quantization
Code Code Available 1Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution Aug 9, 2023 Image Super-Resolution Quantization
Code Code Available 1AffineQuant: Affine Transformation Quantization for Large Language Models Mar 19, 2024 Quantization
Code Code Available 1Extremely Lightweight Quantization Robust Real-Time Single-Image Super Resolution for Mobile Devices May 21, 2021 image-classification Image Classification
Code Code Available 1Exploring Quantization for Efficient Pre-Training of Transformer Language Models Jul 16, 2024 Language Modeling Language Modelling
Code Code Available 1Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models Aug 21, 2023 Code Generation In-Context Learning
Code Code Available 1Exploring the Connection Between Binary and Spiking Neural Networks Feb 24, 2020 Binarization Quantization
Code Code Available 1F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization Feb 10, 2022 Quantization
Code Code Available 1