Exploiting LLM Quantization May 28, 2024 Code Generation Quantization
Code Code Available 1Object Discovery from Motion-Guided Tokens Mar 27, 2023 Decoder Object
Code Code Available 1Exploring the Connection Between Binary and Spiking Neural Networks Feb 24, 2020 Binarization Quantization
Code Code Available 1Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox Jun 15, 2024 Quantization
Code Code Available 14-bit Shampoo for Memory-Efficient Network Training May 28, 2024 image-classification Image Classification
Code Code Available 1One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective Sep 29, 2021 All Deep Hashing
Code Code Available 1A Thorough Examination of Decoding Methods in the Era of LLMs Feb 10, 2024 Quantization
Code Code Available 1Online Learned Continual Compression with Adaptive Quantization Modules Nov 19, 2019 Continual Learning Decoder
Code Code Available 1AdANNS: A Framework for Adaptive Semantic Search May 30, 2023 Natural Questions Quantization
Code Code Available 1Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks Mar 8, 2023 Quantization
Code Code Available 1Evaluation and Optimization of Gradient Compression for Distributed Deep Learning Jun 15, 2023 Deep Learning GPU
Code Code Available 1Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks Oct 15, 2024 Quantization
Code Code Available 1AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Jul 17, 2024 Instance Segmentation object-detection
Code Code Available 1EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search Oct 18, 2024 Model Compression Quantization
Code Code Available 1FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation Feb 15, 2021 Model Compression Neural Network Compression
Code Code Available 1Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregation Aug 11, 2023 Quantization
Code Code Available 1Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices Mar 5, 2021 Audio Classification Environmental Sound Classification
Code Code Available 1EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Jan 9, 2024 Denoising Image Generation
Code Code Available 1Enabling Binary Neural Network Training on the Edge Feb 8, 2021 Quantization
Code Code Available 1EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization Jul 20, 2023 Quantization
Code Code Available 1End-to-End Rate-Distortion Optimized 3D Gaussian Representation Apr 9, 2024 3DGS Quantization
Code Code Available 1Embedding in Recommender Systems: A Survey Oct 28, 2023 AutoML Collaborative Filtering
Code Code Available 1End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression Dec 17, 2021 Motion Estimation MS-SSIM
Code Code Available 1EQ-Net: Elastic Quantization Neural Networks Aug 15, 2023 Quantization
Code Code Available 1And the Bit Goes Down: Revisiting the Quantization of Neural Networks Jul 12, 2019 CPU Quantization
Code Code Available 1Anchor-based Plain Net for Mobile Image Super-Resolution May 20, 2021 Image Super-Resolution Quantization
Code Code Available 1ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training Apr 29, 2021 Quantization
Code Code Available 1An Automatic Graph Construction Framework based on Large Language Models for Recommendation Dec 24, 2024 graph construction Quantization
Code Code Available 1Active Image Indexing Oct 5, 2022 Copy Detection Quantization
Code Code Available 1Efficient Quantized Sparse Matrix Operations on Tensor Cores Sep 14, 2022 GPU Quantization
Code Code Available 1Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs Oct 17, 2024 Quantization
Code Code Available 1HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision Apr 29, 2019 Quantization
Code Code Available 1Efficient and Robust Quantization-aware Training via Adaptive Coreset Selection Jun 12, 2023 Model Compression Quantization
Code Code Available 1Efficient-VDVAE: Less is more Mar 25, 2022 Image Generation Quantization
Code Code Available 1ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation Dec 31, 2021 Image Captioning Image Generation
Code Code Available 1Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation Oct 17, 2021 Knowledge Distillation Node Classification
Code Code Available 1Edge AI-Based Vein Detector for Efficient Venipuncture in the Antecubital Fossa Oct 27, 2023 Quantization
Code Code Available 1EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge Feb 16, 2024 Quantization
Code Code Available 1Dynamic Network Quantization for Efficient Video Inference Aug 23, 2021 Quantization Video Recognition
Code Code Available 1A Benchmark for Gaussian Splatting Compression and Quality Assessment Study Jul 19, 2024 Attribute Data Compression
Code Code Available 1EasyQuant: Post-training Quantization via Scale Optimization Jun 30, 2020 Quantization
Code Code Available 1EFaR 2023: Efficient Face Recognition Competition Aug 8, 2023 Face Recognition Lightweight Face Recognition
Code Code Available 1DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection Apr 25, 2023 3D Object Detection object-detection
Code Code Available 1DVD-Quant: Data-free Video Diffusion Transformers Quantization May 24, 2025 Data Free Quantization Quantization
Code Code Available 1ABCD: Arbitrary Bitwise Coefficient for De-Quantization Jan 1, 2023 Quantization
Code Code Available 1DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization Mar 21, 2022 Knowledge Distillation Model Compression
Code Code Available 1Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks Mar 8, 2022 Quantization Super-Resolution
Code Code Available 1Effectiveness of self-supervised pre-training for speech recognition Nov 10, 2019 Language Modelling Quantization
Code Code Available 1Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study Jul 16, 2023 In-Context Learning Instruction Following
Code Code Available 1DNN+NeuroSim V2.0: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators for On-chip Training Mar 13, 2020 Benchmarking Quantization
Code Code Available 1