Exploiting Latent Properties to Optimize Neural Codecs Jan 2, 2025 Decoder Quantization
— Unverified 0MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Jan 2, 2025 Contrastive Learning Key Detection
Code Code Available 3TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer Jan 2, 2025 Benchmarking Quantization
— Unverified 0BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference Jan 2, 2025 Quantization
Code Code Available 0PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram Jan 1, 2025 3D Object Detection Autonomous Driving
— Unverified 0Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning Jan 1, 2025 Denoising Quantization
— Unverified 0Self-Supervised Learning for Color Spike Camera Reconstruction Jan 1, 2025 Motion Estimation Quantization
Code Code Available 0Enhancing Diversity for Data-free Quantization Jan 1, 2025 Data Free Quantization Diversity
— Unverified 0Multirate Neural Image Compression with Adaptive Lattice Vector Quantization Jan 1, 2025 Domain Adaptation Image Compression
— Unverified 0Secret Lies in Color: Enhancing AI-Generated Images Detection with Color Distribution Analysis Jan 1, 2025 Image Restoration Misinformation
— Unverified 0Frequency-Biased Synergistic Design for Image Compression and Compensation Jan 1, 2025 Image Compression Quantization
— Unverified 0CacheQuant: Comprehensively Accelerated Diffusion Models Jan 1, 2025 Image Generation Quantization
— Unverified 0STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search Jan 1, 2025 Computational Efficiency Quantization
— Unverified 0DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI Jan 1, 2025 Dataset Generation Diversity
— Unverified 0Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression Jan 1, 2025 Motion Generation Quantization
— Unverified 0Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression Jan 1, 2025 3DGS Quantization
— Unverified 0Intuitive Analysis of the Quantization-based Optimization: From Stochastic and Quantum Mechanical Perspective Dec 31, 2024 global-optimization Quantization
— Unverified 0PQD: Post-training Quantization for Efficient Diffusion Models Dec 30, 2024 Diversity Image Generation
— Unverified 0DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models Dec 30, 2024 Arithmetic Reasoning Quantization
— Unverified 0Accelerating Energy-Efficient Federated Learning in Cell-Free Networks with Adaptive Quantization Dec 30, 2024 Federated Learning Quantization
— Unverified 0Improving Acoustic Scene Classification in Low-Resource Conditions Dec 30, 2024 Acoustic Scene Classification Classification
— Unverified 0PTQ4VM: Post-Training Quantization for Visual Mamba Dec 29, 2024 Mamba Quantization
Code Code Available 1IMSSA: Deploying modern state-space models on memristive in-memory compute hardware Dec 28, 2024 GPU Quantization
— Unverified 0Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation Dec 28, 2024 CPU GPU
— Unverified 0Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales Dec 27, 2024 image-classification Image Classification
— Unverified 0A Survey on Large Language Model Acceleration based on KV Cache Management Dec 27, 2024 Language Modeling Language Modelling
Code Code Available 3MBQ: Modality-Balanced Quantization for Large Vision-Language Models Dec 27, 2024 GPU Quantization
Code Code Available 2Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features Dec 26, 2024 Multi-Task Learning Quantization
— Unverified 0Semantic Residual for Multimodal Unified Discrete Representation Dec 26, 2024 Disentanglement Quantization
— Unverified 0Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing Dec 26, 2024 Edge-computing Quantization
Code Code Available 0Resource-Efficient Transformer Architecture: Optimizing Memory and Execution Time for Real-Time Applications Dec 25, 2024 Quantization
— Unverified 0Recommending Pre-Trained Models for IoT Devices Dec 25, 2024 Model Selection Quantization
— Unverified 01.58-bit FLUX Dec 24, 2024 Computational Efficiency Image Generation
— Unverified 0Achieving Robustness in Blind Modulo Analog-to-Digital Conversion Dec 24, 2024 Quantization
— Unverified 0Unified Stochastic Framework for Neural Network Quantization and Pruning Dec 24, 2024 Quantization
— Unverified 0An Automatic Graph Construction Framework based on Large Language Models for Recommendation Dec 24, 2024 graph construction Quantization
Code Code Available 1LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment Dec 24, 2024 Language Modeling Language Modelling
— Unverified 0Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUs Dec 23, 2024 Quantization
Code Code Available 0GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Dec 23, 2024 GPU Language Modeling
— Unverified 0Hierarchical Vector Quantization for Unsupervised Action Segmentation Dec 23, 2024 Action Segmentation Clustering
Code Code Available 1The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM's Internal States Dec 22, 2024 Quantization RAG
Code Code Available 0Adaptive Dataset Quantization Dec 22, 2024 Contrastive Learning Dataset Distillation
— Unverified 0TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models Dec 21, 2024 Quantization Video Generation
— Unverified 0Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers Dec 21, 2024 Data Free Quantization Model Compression
— Unverified 0Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart Dec 20, 2024 Quantization
— Unverified 0Log-Time K-Means Clustering for 1D Data: Novel Approaches with Proof and Implementation Dec 19, 2024 Clustering Quantization
Code Code Available 0MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design Dec 19, 2024 MMLU Quantization
— Unverified 0Qua^2SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models Dec 19, 2024 Denoising Image Generation
— Unverified 0Preventing Local Pitfalls in Vector Quantization via Optimal Transport Dec 19, 2024 Image Reconstruction Quantization
Code Code Available 2Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Dec 19, 2024 Instance Segmentation POS
— Unverified 0