Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis Jun 14, 2025 Model-based Reinforcement Learning Privacy Preserving
— Unverified 0FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation Jun 13, 2025 Model Compression Quantization
Code Code Available 1Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems Jun 13, 2025 Quantization Recommendation Systems
— Unverified 0GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers Jun 13, 2025 Fine-Grained Image Classification Quantization
— Unverified 0Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization Jun 12, 2025 Quantization
— Unverified 0Post-Training Quantization for Video Matting Jun 12, 2025 Image Matting Model Compression
— Unverified 0MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices Jun 12, 2025 CPU GPU
— Unverified 0Discrete Audio Tokens: More Than a Survey! Jun 12, 2025 Language Modeling Language Modelling
— Unverified 0SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving Jun 11, 2025 Edge-computing Quantization
— Unverified 0Q-SAM2: Accurate Quantization for Segment Anything Model 2 Jun 11, 2025 Quantization Video Segmentation
— Unverified 0HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations Jun 11, 2025 Image Generation Quantization
— Unverified 0AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent Jun 11, 2025 Model Compression Quantization
— Unverified 0Hardware Limitations and Optimization Approach in 1-Bit RIS Design at 28 GHz Jun 10, 2025 Quantization
— Unverified 0Implementing Keyword Spotting on the MCUX947 Microcontroller with Integrated NPU Jun 10, 2025 CPU Keyword Spotting
— Unverified 0POLARON: Precision-aware On-device Learning and Adaptive Runtime-cONfigurable AI acceleration Jun 10, 2025 Quantization
— Unverified 0Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization Jun 10, 2025 Image Compression Quantization
— Unverified 0Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking Jun 9, 2025 Distributed Optimization Quantization
— Unverified 0BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Jun 9, 2025 Quantization Vision-Language-Action
Code Code Available 2Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark Jun 9, 2025 Quantization
Code Code Available 0LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments Jun 9, 2025 Autonomous Driving Language Modeling
— Unverified 0Highly Compressed Tokenizer Can Generate Without Training Jun 9, 2025 Image Generation Quantization
Code Code Available 3Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test Jun 8, 2025 Quantization
— Unverified 0QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine Jun 8, 2025 Decision Making Quantization
— Unverified 0Enabling On-Device Medical AI Assistants via Input-Driven Saliency Adaptation Jun 7, 2025 MedQA Quantization
— Unverified 0Towards AI-Native Fronthaul: Neural Compression for NextG Cloud RAN Jun 7, 2025 Quantization
— Unverified 0Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0EdgeProfiler: A Fast Profiling Framework for Lightweight LLMs on Edge Using Analytical Model Jun 6, 2025 Natural Language Understanding Quantization
Code Code Available 0RecGPT: A Foundation Model for Sequential Recommendation Jun 6, 2025 Decoder model
Code Code Available 2BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning Jun 6, 2025 continuous-control Continuous Control
— Unverified 0PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling Jun 5, 2025 Clustering Quantization
— Unverified 0Massive MIMO with 1-Bit DACs: Data Detection for Quantized Linear Precoding with Dithering Jun 5, 2025 Quantization
— Unverified 0Kernel k-Medoids as General Vector Quantization Jun 5, 2025 Data Compression Density Estimation
— Unverified 0FPTQuant: Function-Preserving Transforms for LLM Quantization Jun 5, 2025 Quantization
— Unverified 0TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering Jun 5, 2025 Quantization
— Unverified 0FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion Jun 5, 2025 Denoising Quantization
— Unverified 0Nonlinear Sparse Bayesian Learning Methods with Application to Massive MIMO Channel Estimation with Hardware Impairments Jun 4, 2025 Quantization
— Unverified 0BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing Jun 4, 2025 Quantization text-to-speech
— Unverified 0STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization Jun 4, 2025 Action Generation Quantization
Code Code Available 0MUC-G4: Minimal Unsat Core-Guided Incremental Verification for Deep Neural Network Compression Jun 3, 2025 Neural Network Compression Quantization
— Unverified 0Quantized Dissipative Uncertain Model for Fractional T_S Fuzzy systems with Time_Varying Delays Under Networked Control System Jun 3, 2025 Quantization
— Unverified 0Enhancing Convergence, Privacy and Fairness for Wireless Personalized Federated Learning: Quantization-Assisted Min-Max Fair Scheduling Jun 3, 2025 Fairness Federated Learning
— Unverified 0Flexible Mixed Precision Quantization for Learned Image Compression Jun 2, 2025 Image Compression Quantization
Code Code Available 0Structured Pruning and Quantization for Learned Image Compression Jun 2, 2025 image-classification Image Classification
Code Code Available 0Quantitative Error Feedback for Quantization Noise Reduction of Filtering over Graphs Jun 2, 2025 Quantization
— Unverified 0Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws Jun 2, 2025 Language Modeling Language Modelling
Code Code Available 0Enhancing Speech Emotion Recognition with Graph-Based Multimodal Fusion and Prosodic Features for the Speech Emotion Recognition in Naturalistic Conditions Challenge at Interspeech 2025 Jun 2, 2025 Audio Tagging Emotion Recognition
— Unverified 0CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer Jun 1, 2025 Audio captioning Language Modeling
— Unverified 0Quantization-based Bounds on the Wasserstein Metric Jun 1, 2025 Computational Efficiency Domain Adaptation
— Unverified 0Power-of-Two (PoT) Weights in Large Language Models (LLMs) May 31, 2025 Quantization
— Unverified 0LittleBit: Ultra Low-Bit Quantization via Latent Factorization May 30, 2025 Quantization
— Unverified 0