GSVR: 2D Gaussian-based Video Representation for 800+ FPS with Hybrid Deformation Field Jul 8, 2025 Quantization Video Compression
— Unverified 0EdgeCodec: Onboard Lightweight High Fidelity Neural Compressor with Residual Vector Quantization Jul 8, 2025 Quantization
Code Code Available 0Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis Jul 2, 2025 Density Estimation Image Generation
— Unverified 0PsyLite Technical Report Jun 26, 2025 Large Language Model Lightweight Deployment
Code Code Available 0Analysis of Null Related Beampattern Measures and Signal Quantization Effects for Linear Differential Microphone Arrays Jun 26, 2025 Quantization
— Unverified 0OLALa: Online Learned Adaptive Lattice Codes for Heterogeneous Federated Learning Jun 25, 2025 Federated Learning Quantization
Code Code Available 0Joint Quantization and Pruning Neural Networks Approach: A Case Study on FSO Receivers Jun 25, 2025 Quantization
— Unverified 0DipSVD: Dual-importance Protected SVD for Efficient LLM Compression Jun 25, 2025 Model Compression Quantization
— Unverified 0Cross-Layer Discrete Concept Discovery for Interpreting Language Models Jun 24, 2025 Diversity Quantization
— Unverified 0Variational Bayesian Channel Estimation and Data Detection for Cell-Free Massive MIMO with Low-Resolution Quantized Fronthaul Links Jun 23, 2025 CPU Quantization
— Unverified 0LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images Jun 22, 2025 Image Compression Image Segmentation
Code Code Available 0StainPIDR: A Pathological Image Decouplingand Reconstruction Method for Stain Normalization Based on Color Vector Quantization and Structure Restaining Jun 22, 2025 Diagnostic Quantization
— Unverified 0NestQuant: Post-Training Integer-Nesting Quantization for On-Device DNN Jun 22, 2025 Quantization
Code Code Available 0TROJAN-GUARD: Hardware Trojans Detection Using GNN in RTL Designs Jun 22, 2025 Graph Neural Network Quantization
— Unverified 0RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models Jun 21, 2025 Model Compression Quantization
— Unverified 0A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation Jun 20, 2025 Contrastive Learning Descriptive
— Unverified 0Cross-Modal Epileptic Signal Harmonization: Frequency Domain Mapping Quantization for Pre-training a Unified Neurophysiological Transformer Jun 20, 2025 EEG Quantization
Code Code Available 0The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation Jun 20, 2025 Image Generation Quantization
— Unverified 0PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models Jun 19, 2025 Image Generation Quantization
— Unverified 0On Designing Modulation for Over-the-Air Computation -- Part I: Noise-Aware Design Jun 19, 2025 Low-latency processing Quantization
— Unverified 0J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor Jun 18, 2025 image-classification Image Classification
— Unverified 0Effect of Signal Quantization on Performance Measures of a 1st Order One Dimensional Differential Microphone Array Jun 18, 2025 Quantization
— Unverified 0Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization Jun 18, 2025 Quantization
Code Code Available 0MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models Jun 17, 2025 Mixture-of-Experts Quantization
— Unverified 0Compressed Video Super-Resolution based on Hierarchical Encoding Jun 17, 2025 Quantization Super-Resolution
— Unverified 0Cost-Aware Routing for Efficient Text-To-Image Generation Jun 17, 2025 Denoising Image Generation
— Unverified 0ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models Jun 16, 2025 Quantization
— Unverified 0EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization Jun 16, 2025 Mixture-of-Experts Model Compression
Code Code Available 0Serving Large Language Models on Huawei CloudMatrix384 Jun 15, 2025 Mixture-of-Experts Quantization
— Unverified 0Quantizing Small-Scale State-Space Models for Edge AI Jun 14, 2025 Quantization State Space Models
— Unverified 0Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis Jun 14, 2025 Model-based Reinforcement Learning Privacy Preserving
— Unverified 0Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems Jun 13, 2025 Quantization Recommendation Systems
— Unverified 0GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers Jun 13, 2025 Fine-Grained Image Classification Quantization
— Unverified 0MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices Jun 12, 2025 CPU GPU
— Unverified 0Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization Jun 12, 2025 Quantization
— Unverified 0Discrete Audio Tokens: More Than a Survey! Jun 12, 2025 Language Modeling Language Modelling
— Unverified 0Post-Training Quantization for Video Matting Jun 12, 2025 Image Matting Model Compression
— Unverified 0SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving Jun 11, 2025 Edge-computing Quantization
— Unverified 0HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations Jun 11, 2025 Image Generation Quantization
— Unverified 0Q-SAM2: Accurate Quantization for Segment Anything Model 2 Jun 11, 2025 Quantization Video Segmentation
— Unverified 0AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent Jun 11, 2025 Model Compression Quantization
— Unverified 0Hardware Limitations and Optimization Approach in 1-Bit RIS Design at 28 GHz Jun 10, 2025 Quantization
— Unverified 0Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU Jun 10, 2025 CPU Keyword Spotting
— Unverified 0POLARON: Precision-aware On-device Learning and Adaptive Runtime-cONfigurable AI acceleration Jun 10, 2025 Quantization
— Unverified 0Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization Jun 10, 2025 Image Compression Quantization
— Unverified 0Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking Jun 9, 2025 Distributed Optimization Quantization
— Unverified 0LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments Jun 9, 2025 Autonomous Driving Language Modeling
— Unverified 0Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark Jun 9, 2025 Quantization
Code Code Available 0QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine Jun 8, 2025 Decision Making Quantization
— Unverified 0Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test Jun 8, 2025 Quantization
— Unverified 0