A Refined Analysis of Massive Activations in LLMs Mar 28, 2025 Quantization
Code Code Available 1VADMamba: Exploring State Space Models for Fast Video Anomaly Detection Mar 27, 2025 Anomaly Detection Computational Efficiency
Code Code Available 1LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Mar 25, 2025 Code Completion Language Modeling
Code Code Available 1QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge Mar 20, 2025 Depth Estimation Monocular Depth Estimation
Code Code Available 1PCGS: Progressive Compression of 3D Gaussian Splatting Mar 11, 2025 3DGS Novel View Synthesis
Code Code Available 1QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation Mar 9, 2025 Quantization Video Generation
Code Code Available 1QArtSR: Quantization via Reverse-Module and Timestep-Retraining in One-Step Diffusion based Image Super-Resolution Mar 7, 2025 Denoising Image Super-Resolution
Code Code Available 1RSQ: Learning from Important Tokens Leads to Better Quantized LLMs Mar 3, 2025 Quantization
Code Code Available 1Towards Lossless Implicit Neural Representation via Bit Plane Decomposition Feb 28, 2025 Image Compression Quantization
Code Code Available 1Oscillation-Reduced MXFP4 Training for Vision Transformers Feb 28, 2025 GPU Quantization
Code Code Available 1Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression Feb 23, 2025 Efficient Neural Network Quantization
Code Code Available 1CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution Feb 21, 2025 Image Super-Resolution Quantization
Code Code Available 1PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models Feb 18, 2025 Binarization Quantization
Code Code Available 1CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs Feb 15, 2025 Computational Efficiency GPU
Code Code Available 1CISSIR: Beam Codebooks with Self-Interference Reduction Guarantees for Integrated Sensing and Communication Beyond 5G Feb 14, 2025 Integrated sensing and communication ISAC
Code Code Available 1SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Feb 13, 2025 Image Compression Quantization
Code Code Available 1Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Jan 31, 2025 GPU Quantization
Code Code Available 1Quantized Spike-driven Transformer Jan 23, 2025 Quantization
Code Code Available 1D^2-DPM: Dual Denoising for Quantized Diffusion Probabilistic Models Jan 14, 2025 Denoising Image Generation
Code Code Available 1kANNolo: Sweet and Smooth Approximate k-Nearest Neighbors Search Jan 10, 2025 Information Retrieval Quantization
Code Code Available 1DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models Jan 8, 2025 Quantization
Code Code Available 1HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs Jan 5, 2025 Efficient Neural Network parameter-efficient fine-tuning
Code Code Available 1PTQ4VM: Post-Training Quantization for Visual Mamba Dec 29, 2024 Mamba Quantization
Code Code Available 1An Automatic Graph Construction Framework based on Large Language Models for Recommendation Dec 24, 2024 graph construction Quantization
Code Code Available 1Hierarchical Vector Quantization for Unsupervised Action Segmentation Dec 23, 2024 Action Segmentation Clustering
Code Code Available 1ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Dec 18, 2024 Quantization
Code Code Available 1Relation-Guided Adversarial Learning for Data-free Knowledge Transfer Dec 16, 2024 Data-free Knowledge Distillation Data Free Quantization
Code Code Available 1MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models Dec 16, 2024 Quantization
Code Code Available 1Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries Dec 12, 2024 4k GSM8K
Code Code Available 1BiDM: Pushing the Limit of Quantization for Diffusion Models Dec 8, 2024 Binarization Image Generation
Code Code Available 1Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Dec 7, 2024 Quantization
Code Code Available 1Improving Detail in Pluralistic Image Inpainting with Feature Dequantization Dec 2, 2024 Image Inpainting Quantization
Code Code Available 1DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation Dec 1, 2024 Quantization
Code Code Available 1Quantization without Tears Nov 21, 2024 GPU Quantization
Code Code Available 1MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization Nov 8, 2024 Quantization
Code Code Available 1Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money Laundering Nov 5, 2024 Computational Efficiency Graph Neural Network
Code Code Available 1VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization Nov 3, 2024 Quantization Representation Learning
Code Code Available 1Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification Nov 1, 2024 Quantization Representation Learning
Code Code Available 1BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments Oct 31, 2024 Quantization
Code Code Available 1IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models Oct 29, 2024 parameter-efficient fine-tuning Quantization
Code Code Available 1Vector Quantization Prompting for Continual Learning Oct 27, 2024 Continual Learning Quantization
Code Code Available 1Catastrophic Failure of LLM Unlearning via Quantization Oct 21, 2024 Machine Unlearning Quantization
Code Code Available 1Residual vector quantization for KV cache compression in large language model Oct 21, 2024 Audio Compression Language Modeling
Code Code Available 1EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search Oct 18, 2024 Model Compression Quantization
Code Code Available 1Learning Graph Quantized Tokenizers Oct 17, 2024 Graph Learning Quantization
Code Code Available 1Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs Oct 17, 2024 Quantization
Code Code Available 1Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks Oct 15, 2024 Quantization
Code Code Available 1SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression Oct 12, 2024 Model Compression Natural Language Understanding
Code Code Available 1QT-DoG: Quantization-aware Training for Domain Generalization Oct 8, 2024 Domain Generalization Model Compression
Code Code Available 1Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization Oct 4, 2024 Deep Reinforcement Learning Quantization
Code Code Available 1