Speech Enhancement Using Continuous Embeddings of Neural Audio Codec Feb 22, 2025 Quantization Speech Enhancement
— Unverified 0A 2-bit Wideband 5G mm-Wave RIS with Low Side Lobe Levels and no Quantization Lobe Feb 22, 2025 Quantization
— Unverified 0Verification of Bit-Flip Attacks against Quantized Neural Networks Feb 22, 2025 Neural Network Security Quantization
— Unverified 0Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements Feb 21, 2025 2k Quantization
— Unverified 0SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention Feb 21, 2025 Quantization
— Unverified 0FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression Feb 21, 2025 Image Compression MS-SSIM
— Unverified 0Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection Feb 21, 2025 3D Object Detection Autonomous Driving
— Unverified 0When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models Feb 21, 2025 Model Compression Quantization
— Unverified 0Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation Feb 21, 2025 Image Compression MS-SSIM
— Unverified 0Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications Feb 20, 2025 Knowledge Distillation Model Compression
— Unverified 0Hardware-Friendly Static Quantization Method for Video Diffusion Transformers Feb 20, 2025 Quantization Video Generation
— Unverified 0More for Keys, Less for Values: Adaptive KV Cache Quantization Feb 20, 2025 Quantization
— Unverified 0A General Error-Theoretical Analysis Framework for Constructing Compression Strategies Feb 19, 2025 Quantization
— Unverified 0Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis Feb 18, 2025 Benchmarking Mamba
Code Code Available 0Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models Feb 18, 2025 Quantization
Code Code Available 0A^2ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization Feb 18, 2025 CPU Position
— Unverified 0Towards Reasoning Ability of Small Language Models Feb 17, 2025 Quantization
— Unverified 0Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer Gate Feb 17, 2025 GPU Mixture-of-Experts
Code Code Available 0Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models? Feb 17, 2025 Quantization
— Unverified 0Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer Feb 17, 2025 GPU Quantization
— Unverified 0Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models Feb 17, 2025 Quantization
— Unverified 0On Quantizing Neural Representation for Variable-Rate Video Coding Feb 17, 2025 Quantization
Code Code Available 0On the Logic Elements Associated with Round-Off Errors and Gaussian Blur in Image Registration: A Simple Case of Commingling Feb 17, 2025 Image Registration Quantization
— Unverified 0Unveiling Environmental Impacts of Large Language Model Serving: A Functional Unit View Feb 16, 2025 Language Modeling Language Modelling
Code Code Available 0Weighted quantization using MMD: From mean field to mean shift via gradient flows Feb 14, 2025 Clustering Quantization
Code Code Available 0EmbBERT-Q: Breaking Memory Barriers in Embedded NLP Feb 14, 2025 Mamba Quantization
Code Code Available 0Towards Watermarking of Open-Source LLMs Feb 14, 2025 Quantization
— Unverified 0Low-Complexity On-Grid Channel Estimation for Partially-Connected Hybrid XL-MIMO Feb 14, 2025 Quantization
— Unverified 0RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models Feb 13, 2025 Quantization
— Unverified 0NestQuant: Nested Lattice Quantization for Matrix Products and LLMs Feb 13, 2025 Quantization
— Unverified 0LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits Feb 12, 2025 parameter-efficient fine-tuning Quantization
— Unverified 0Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning Feb 12, 2025 Computational Efficiency Quantization
— Unverified 0Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding Feb 12, 2025 Neural Architecture Search Neural Network Compression
— Unverified 0Loss Landscape Analysis for Reliable Quantized ML Models for Scientific Sensing Feb 12, 2025 Quantization
Code Code Available 0Scalable Thermodynamic Second-order Optimization Feb 12, 2025 Quantization
— Unverified 0Exploiting Non-uniform Quantization for Enhanced ILC in Wideband Digital Pre-distortion Feb 12, 2025 Quantization
— Unverified 0Conditional Distribution Quantization in Machine Learning Feb 11, 2025 Quantization Uncertainty Quantification
— Unverified 0Column-wise Quantization of Weights and Partial Sums for Accurate and Efficient Compute-In-Memory Accelerators Feb 11, 2025 Quantization
Code Code Available 0Vision-Language Models for Edge Networks: A Comprehensive Survey Feb 11, 2025 Autonomous Vehicles Image Captioning
— Unverified 0HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates Feb 11, 2025 Image Compression Image Reconstruction
— Unverified 0MEMHD: Memory-Efficient Multi-Centroid Hyperdimensional Computing for Fully-Utilized In-Memory Computing Architectures Feb 11, 2025 Quantization
— Unverified 0GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units Feb 10, 2025 Event-based vision Quantization
Code Code Available 0Matryoshka Quantization Feb 10, 2025 Quantization
— Unverified 0Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation Feb 10, 2025 Blood pressure estimation EEG
— Unverified 0Demystifying Singular Defects in Large Language Models Feb 10, 2025 Quantization
— Unverified 0Gradient Based Method for the Fusion of Lattice Quantizers Feb 9, 2025 Quantization
— Unverified 0Physics-Conditioned Diffusion Models for Lattice Gauge Theory Feb 8, 2025 Quantization
Code Code Available 0Scalable and consistent embedding of probability measures into Hilbert spaces via measure quantization Feb 7, 2025 Quantization
— Unverified 0Efficient Evaluation of Quantization-Effects in Neural Codecs Feb 7, 2025 Decoder Quantization
— Unverified 0QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation Feb 7, 2025 Image Generation Quantization
— Unverified 0