Adaptive Data-Free Quantization Mar 13, 2023 Data Free Quantization Quantization
Code Code Available 1End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression Dec 17, 2021 Motion Estimation MS-SSIM
Code Code Available 1Efficient Quantized Sparse Matrix Operations on Tensor Cores Sep 14, 2022 GPU Quantization
Code Code Available 1LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time Oct 8, 2021 Quantization
Code Code Available 1Efficient-VDVAE: Less is more Mar 25, 2022 Image Generation Quantization
Code Code Available 1Learning Architectures for Binary Networks Feb 17, 2020 Quantization
Code Code Available 1Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression Dec 26, 2021 Motion Compensation Optical Flow Estimation
Code Code Available 1Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval Oct 12, 2021 Clustering Constrained Clustering
Code Code Available 1Learning Statistical Texture for Semantic Segmentation Mar 6, 2021 Quantization Segmentation
Code Code Available 1Learning to Groove with Inverse Sequence Transformations May 14, 2019 Generative Adversarial Network Quantization
Code Code Available 1Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification Nov 1, 2024 Quantization Representation Learning
Code Code Available 1Least squares binary quantization of neural networks Jan 9, 2020 Quantization
Code Code Available 1Efficient and Robust Quantization-aware Training via Adaptive Coreset Selection Jun 12, 2023 Model Compression Quantization
Code Code Available 1EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Jan 9, 2024 Denoising Image Generation
Code Code Available 1Exploiting LLM Quantization May 28, 2024 Code Generation Quantization
Code Code Available 1Lightweight Super-Resolution Head for Human Pose Estimation Jul 31, 2023 Pose Estimation Quantization
Code Code Available 1Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks Dec 30, 2021 CPU image-classification
Code Code Available 1Effectiveness of self-supervised pre-training for speech recognition Nov 10, 2019 Language Modelling Quantization
Code Code Available 1EFaR 2023: Efficient Face Recognition Competition Aug 8, 2023 Face Recognition Lightweight Face Recognition
Code Code Available 1LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference Jun 28, 2024 GPU Quantization
Code Code Available 1Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance Jul 20, 2022 Optical Flow Estimation Quantization
Code Code Available 1EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge Feb 16, 2024 Quantization
Code Code Available 1Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices Oct 2, 2024 GPU Language Modeling
Code Code Available 1EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Oct 5, 2023 Denoising Image Generation
Code Code Available 1Dynamic Network Quantization for Efficient Video Inference Aug 23, 2021 Quantization Video Recognition
Code Code Available 1Adapting LLaMA Decoder to Vision Transformer Apr 10, 2024 Computational Efficiency Decoder
Code Code Available 1"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach Mar 1, 2024 Model Compression Quantization
Code Code Available 1EasyQuant: Post-training Quantization via Scale Optimization Jun 30, 2020 Quantization
Code Code Available 1DVD-Quant: Data-free Video Diffusion Transformers Quantization May 24, 2025 Data Free Quantization Quantization
Code Code Available 1Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks Mar 8, 2022 Quantization Super-Resolution
Code Code Available 1Edge AI-Based Vein Detector for Efficient Venipuncture in the Antecubital Fossa Oct 27, 2023 Quantization
Code Code Available 1Machine Unlearning of Federated Clusters Oct 28, 2022 Clustering Federated Learning
Code Code Available 1DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization Mar 21, 2022 Knowledge Distillation Model Compression
Code Code Available 1Catastrophic Failure of LLM Unlearning via Quantization Oct 21, 2024 Machine Unlearning Quantization
Code Code Available 1BAFFLE: A Baseline of Backpropagation-Free Federated Learning Jan 28, 2023 Federated Learning Quantization
Code Code Available 1Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation Aug 3, 2023 Decoder Quantization
Code Code Available 1Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense Retrieval May 23, 2022 Ad-Hoc Information Retrieval CPU
Code Code Available 1Matrix Compression via Randomized Low Rank and Low Precision Factorization Oct 17, 2023 Image Compression Quantization
Code Code Available 1DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection Apr 25, 2023 3D Object Detection object-detection
Code Code Available 1Diverse Sample Generation: Pushing the Limit of Generative Data-free Quantization Sep 1, 2021 Data Free Quantization image-classification
Code Code Available 1AdANNS: A Framework for Adaptive Semantic Search May 30, 2023 Natural Questions Quantization
Code Code Available 1DNN+NeuroSim V2.0: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators for On-chip Training Mar 13, 2020 Benchmarking Quantization
Code Code Available 1DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing Sep 12, 2024 Image Generation Quantization
Code Code Available 1MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization Dec 1, 2019 Quantization
Code Code Available 1AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Jul 17, 2024 Instance Segmentation object-detection
Code Code Available 1Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study Jul 16, 2023 In-Context Learning Instruction Following
Code Code Available 1Mind the Gap: A Practical Attack on GGUF Quantization May 24, 2025 Code Generation Quantization
Code Code Available 1Mini-GPTs: Efficient Large Language Models through Contextual Pruning Dec 20, 2023 Articles Quantization
Code Code Available 1Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance Mar 16, 2022 GPU Quantization
Code Code Available 1Disentanglement via Latent Quantization May 28, 2023 Disentanglement Inductive Bias
Code Code Available 1