MCRB for Parameter Estimation from One-Bit Quantized and Oversampled Measurements Mar 28, 2025 Direction of Arrival Estimation parameter estimation
— Unverified 0Make Some Noise: Towards LLM audio reasoning and generation using sound tokens Mar 28, 2025 Audio Generation Quantization
— Unverified 0Long-Tail Crisis in Nearest Neighbor Language Models Mar 28, 2025 Language Modeling Language Modelling
— Unverified 0MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness Mar 27, 2025 Language Modeling Language Modelling
— Unverified 0A 71.2-μW Speech Recognition Accelerator with Recurrent Spiking Neural Network Mar 27, 2025 Quantization speech-recognition
— Unverified 0Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration Mar 27, 2025 Computational Efficiency Image Restoration
— Unverified 0HOT: Hadamard-based Optimized Training Mar 27, 2025 Quantization
Code Code Available 0MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation Mar 26, 2025 3D Generation Denoising
— Unverified 0SINR: Sparsity Driven Compressed Implicit Neural Representations Mar 25, 2025 Quantization
— Unverified 0QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition Mar 25, 2025 parameter-efficient fine-tuning Quantization
Code Code Available 0Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization Mar 24, 2025 GPU Large Language Model
— Unverified 0QSID-MPC: Model Predictive Control with System Identification from Quantized Data Mar 24, 2025 Model Predictive Control Quantization
— Unverified 0GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT Mar 24, 2025 Neural Network Compression Quantization
— Unverified 0FFN Fusion: Rethinking Sequential Computation in Large Language Models Mar 24, 2025 Quantization
— Unverified 04DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video Mar 24, 2025 3DGS Quantization
— Unverified 0Energy-Aware LLMs: A step towards sustainable AI for downstream applications Mar 22, 2025 Quantization
— Unverified 0Variance Control via Weight Rescaling in LLM Pre-training Mar 21, 2025 Language Modeling Language Modelling
Code Code Available 0Improving Quantization with Post-Training Model Expansion Mar 21, 2025 Large Language Model model
— Unverified 0SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs Mar 20, 2025 CPU GPU
— Unverified 0Learning Linear Block Codes with Gradient Quantization Mar 20, 2025 Decoder Quantization
— Unverified 0Neural Networks: According to the Principles of Grassmann Algebra Mar 20, 2025 Quantization
— Unverified 0Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Mar 20, 2025 Quantization
— Unverified 0Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction Mar 20, 2025 Image Generation Language Modeling
— Unverified 0Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Mar 20, 2025 Quantization
— Unverified 0LeanTTA: A Backpropagation-Free and Stateless Approach to Quantized Test-Time Adaptation on Edge Devices Mar 20, 2025 Quantization Test-time Adaptation
— Unverified 0PARQ: Piecewise-Affine Regularized Quantization Mar 19, 2025 Quantization
— Unverified 0FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers Mar 19, 2025 Image Generation Quantization
Code Code Available 0RAG-based User Profiling for Precision Planning in Mixed-precision Over-the-Air Federated Learning Mar 19, 2025 Federated Learning Quantization
— Unverified 0Natural Quantization of Neural Networks Mar 19, 2025 Quantization
Code Code Available 0Quantization-Free Autoregressive Action Transformer Mar 18, 2025 Imitation Learning Quantization
Code Code Available 0Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels Mar 18, 2025 Machine Unlearning Quantization
— Unverified 0MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization Mar 18, 2025 Gesture Generation Quantization
— Unverified 0CompMarkGS: Robust Watermarking for Compressed 3D Gaussian Splatting Mar 17, 2025 3DGS 3D Reconstruction
— Unverified 0ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning Mar 17, 2025 GPU Model Compression
— Unverified 0ML-SpecQD: Multi-Level Speculative Decoding with Quantized Drafts Mar 17, 2025 Quantization
— Unverified 0ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing Mar 17, 2025 Action Detection Disaster Response
— Unverified 0Versatile Physics-based Character Control with Hybrid Latent Representation Mar 17, 2025 Motion Generation motion in-betweening
— Unverified 0Pathology Image Compression with Pre-trained Autoencoders Mar 14, 2025 Computational Efficiency Image Compression
— Unverified 0Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix Mar 14, 2025 Neural Network Compression Quantization
— Unverified 0Understanding Flatness in Generative Models: Its Role and Benefits Mar 14, 2025 Noise Estimation Quantization
— Unverified 0Global synchronization of multi-agent systems with nonlinear interactions Mar 13, 2025 Quantization
— Unverified 0Dual Codebook VQ: Enhanced Image Reconstruction with Reduced Codebook Size Mar 13, 2025 Face Reconstruction Image Reconstruction
— Unverified 0OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models Mar 13, 2025 channel selection Contrastive Learning
— Unverified 0Automated Tomato Maturity Estimation Using an Optimized Residual Model with Pruning and Quantization Techniques Mar 13, 2025 Classification Computational Efficiency
— Unverified 0Quantization for OpenAI's Whisper Models: A Comparative Analysis Mar 12, 2025 Quantization speech-recognition
Code Code Available 0Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge Mar 12, 2025 CPU GPU
— Unverified 0ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba Mar 12, 2025 Mamba Quantization
— Unverified 0Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks Mar 12, 2025 Adversarial Robustness Quantization
— Unverified 0Accurate INT8 Training Through Dynamic Block-Level Fallback Mar 11, 2025 Quantization
— Unverified 0PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models Mar 11, 2025 Federated Learning Privacy Preserving
Code Code Available 0