NICE: Noise Injection and Clamping Estimation for Neural Network Quantization Sep 29, 2018 General Classification GPU
Code Code Available 15 Continual Learning via Bit-Level Information Preserving May 10, 2021 Continual Learning Quantization
Code Code Available 15 MBQuant: A Novel Multi-Branch Topology Method for Arbitrary Bit-width Network Quantization May 14, 2023 Quantization
Code Code Available 15 Continuous Visual Autoregressive Generation via Score Maximization May 12, 2025 Quantization
Code Code Available 15 Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network Mar 17, 2021 Classification with Binary Neural Network Classification with Binary Weight Network
Code Code Available 15 MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design May 9, 2025 Mixture-of-Experts Quantization
Code Code Available 15 Designing Large Foundation Models for Efficient Training and Inference: A Survey Sep 3, 2024 Knowledge Distillation Model Compression
Code Code Available 15 Confounding Tradeoffs for Neural Network Quantization Feb 12, 2021 Quantization
Code Code Available 15 ARB-LLM: Alternating Refined Binarizations for Large Language Models Oct 4, 2024 Binarization Quantization
Code Code Available 15 LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time Oct 8, 2021 Quantization
Code Code Available 15 Arch-Net: Model Distillation for Architecture Agnostic Model Deployment Nov 1, 2021 image-classification Image Classification
Code Code Available 15 MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models Dec 16, 2024 Quantization
Code Code Available 15 Context-aware Communication for Multi-agent Reinforcement Learning Dec 25, 2023 Multi-agent Reinforcement Learning Quantization
Code Code Available 15 MQBench: Towards Reproducible and Deployable Model Quantization Benchmark Nov 5, 2021 CPU GPU
Code Code Available 15 N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores Dec 15, 2021 Quantization
Code Code Available 15 Graph Convolutional Network for Recommendation with Low-pass Collaborative Filters Jun 28, 2020 Quantization
Code Code Available 15 COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization Mar 11, 2024 Quantization
Code Code Available 15 Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval Oct 12, 2021 Clustering Constrained Clustering
Code Code Available 15 Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression Dec 26, 2021 Motion Compensation Optical Flow Estimation
Code Code Available 15 Compression with Bayesian Implicit Neural Representations May 30, 2023 Audio Compression Quantization
Code Code Available 15 Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs May 6, 2024 Quantization
Code Code Available 15 Learning Graph Quantized Tokenizers Oct 17, 2024 Graph Learning Quantization
Code Code Available 15 A holistic approach to polyphonic music transcription with neural networks Oct 26, 2019 Beat Tracking Music Transcription
Code Code Available 15 A Refined Analysis of Massive Activations in LLMs Mar 28, 2025 Quantization
Code Code Available 15 Learning to Structure an Image with Few Colors Mar 17, 2020 Explainable artificial intelligence Image Compression
Code Code Available 15 Learning Statistical Texture for Semantic Segmentation Mar 6, 2021 Quantization Segmentation
Code Code Available 15 Learning to Groove with Inverse Sequence Transformations May 14, 2019 Generative Adversarial Network Quantization
Code Code Available 15 Learning to Improve Image Compression without Changing the Standard Decoder Sep 27, 2020 Decoder Image Compression
Code Code Available 15 CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution Feb 21, 2025 Image Super-Resolution Quantization
Code Code Available 15 L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning Oct 31, 2022 image-classification Image Classification
Code Code Available 15 Compressing LLMs: The Truth is Rarely Pure and Never Simple Oct 2, 2023 Quantization Retrieval
Code Code Available 15 Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries Dec 12, 2024 4k GSM8K
Code Code Available 15 Conditional Coding and Variable Bitrate for Practical Learned Video Coding Apr 19, 2021 Decoder Quantization
Code Code Available 15 ConveRT: Efficient and Accurate Conversational Representations from Transformers Nov 9, 2019 Conversational Response Selection intent-classification
Code Code Available 15 NAPA-VQ: Neighborhood Aware Prototype Augmentation with Vector Quantization for Continual Learning Aug 18, 2023 class-incremental learning Class Incremental Learning
Code Code Available 15 Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer Apr 1, 2021 Binarization Quantization
Code Code Available 15 NIPQ: Noise proxy-based Integrated Pseudo-Quantization Jun 2, 2022 Quantization
Code Code Available 15 Transferable Sparse Adversarial Attack May 31, 2021 Adversarial Attack Quantization
Code Code Available 15 Lightweight Super-Resolution Head for Human Pose Estimation Jul 31, 2023 Pose Estimation Quantization
Code Code Available 15 Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities Mar 7, 2024 Contrastive Learning Knowledge Distillation
Code Code Available 15 Model-Aware Deep Architectures for One-Bit Compressive Variational Autoencoding Nov 27, 2019 Compressive Sensing Quantization
Code Code Available 05 Model Compression Techniques in Biometrics Applications: A Survey Jan 18, 2024 Fairness Knowledge Distillation
Code Code Available 05 Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor Decomposition Jun 8, 2023 Efficient Neural Network Quantization
Code Code Available 05 A Tale of Two Models: Constructing Evasive Attacks on Edge Models Apr 22, 2022 Quantization Vocal Bursts Valence Prediction
Code Code Available 05 Mixed-Precision Quantization for Deep Vision Models with Integer Quadratic Programming Jul 11, 2023 Quantization Sensitivity
Code Code Available 05 Model compression via distillation and quantization Feb 15, 2018 image-classification model
Code Code Available 05 Mixed Non-linear Quantization for Vision Transformers Jul 26, 2024 Quantization
Code Code Available 05 Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge Dec 9, 2023 Language Modeling Language Modelling
Code Code Available 05 Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs May 23, 2024 Quantization
Code Code Available 05 Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Apr 4, 2024 GPU Language Modeling
Code Code Available 05