HEPPO: Hardware-Efficient Proximal Policy Optimization -- A Universal Pipelined Architecture for Generalized Advantage Estimation Jan 22, 2025 CPU GPU
— Unverified 0UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model Jan 21, 2025 image-classification Image Classification
— Unverified 0SplitQuant: Layer Splitting for Low-Bit Neural Network Quantization Jan 21, 2025 Quantization
— Unverified 0RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0Communication-Efficient Federated Learning by Quantized Variance Reduction for Heterogeneous Wireless Edge Networks Jan 20, 2025 Federated Learning Quantization
— Unverified 0Ditto: Accelerating Diffusion Model via Temporal Value Similarity Jan 20, 2025 Image Generation model
— Unverified 0Practical Modulo Sampling: Mitigating High-Frequency Components Jan 20, 2025 Quantization
— Unverified 0Personalized Federated Learning for Cellular VR: Online Learning and Dynamic Caching Jan 20, 2025 Edge-computing Federated Learning
— Unverified 0BeST -- A Novel Source Selection Metric for Transfer Learning Jan 19, 2025 Quantization Transfer Learning
— Unverified 0LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data Jan 19, 2025 3D Object Detection object-detection
Code Code Available 0DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization Jan 19, 2025 Decoder Point Cloud Completion
— Unverified 0LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator Jan 18, 2025 Quantization
— Unverified 0A Novel Hybrid Precoder With Low-Resolution Phase Shifters and Fronthaul Capacity Limitation Jan 18, 2025 Quantization
— Unverified 04bit-Quantization in Vector-Embedding for RAG Jan 17, 2025 Quantization RAG
Code Code Available 0Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Jan 16, 2025 Model Compression Quantization
— Unverified 0The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Jan 16, 2025 3D Object Detection 3D Semantic Segmentation
— Unverified 0Real-time Indexing for Large-scale Recommendation by Streaming Vector Quantization Retriever Jan 15, 2025 Quantization
— Unverified 0Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach Jan 15, 2025 Quantization
— Unverified 0Large Language Models For Text Classification: Case Study And Comprehensive Review Jan 14, 2025 Articles Binary Classification
— Unverified 0Koopman Meets Limited Bandwidth: Effect of Quantization on Data-Driven Linear Prediction and Control of Nonlinear Systems Jan 13, 2025 Model Predictive Control Quantization
— Unverified 0QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications Jan 13, 2025 Computational Efficiency Quantization
— Unverified 0Dataset Distillation as Pushforward Optimal Quantization Jan 13, 2025 Dataset Distillation Decoder
— Unverified 0FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices Jan 13, 2025 Quantization
— Unverified 0ZOQO: Zero-Order Quantized Optimization Jan 12, 2025 Quantization
— Unverified 0DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory Jan 11, 2025 GSM8K Quantization
Code Code Available 0Precoding Design for Limited-Feedback MISO Systems via Character-Polynomial Codes Jan 10, 2025 Quantization
— Unverified 0Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion Jan 10, 2025 Audio Effects Modeling Quantization
Code Code Available 0Mix-QViT: Mixed-Precision Vision Transformer Quantization Driven by Layer Importance and Quantization Sensitivity Jan 10, 2025 Quantization Sensitivity
— Unverified 0Neural Architecture Codesign for Fast Physics Applications Jan 9, 2025 High-Level Synthesis Model Compression
Code Code Available 0JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration Jan 9, 2025 Quantization
— Unverified 0Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning Jan 9, 2025 Model-based Reinforcement Learning Multi-Task Learning
— Unverified 0UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles Jan 8, 2025 3D Object Detection Autonomous Vehicles
— Unverified 0Histogram-Equalized Quantization for logic-gated Residual Neural Networks Jan 8, 2025 Quantization
— Unverified 0Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Jan 7, 2025 Model Compression parameter estimation
— Unverified 0Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Jan 6, 2025 Math Mathematical Reasoning
— Unverified 0A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks Jan 6, 2025 Neural Network Compression Quantization
— Unverified 0The Power of Negative Zero: Datatype Customization for Quantized Large Language Models Jan 6, 2025 Computational Efficiency Quantization
Code Code Available 0Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization Jan 5, 2025 Data Compression Quantization
Code Code Available 0Scaling Laws for Floating Point Quantization Training Jan 5, 2025 Quantization
— Unverified 0TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jan 5, 2025 GPU Quantization
— Unverified 0Optimizing Small Language Models for In-Vehicle Function-Calling Jan 4, 2025 Model Compression Quantization
— Unverified 0Millimeter-Wave Energy-Efficient Hybrid Beamforming Architecture and Algorithm Jan 3, 2025 Quantization
— Unverified 0Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content Jan 3, 2025 Quantization Super-Resolution
— Unverified 0Modulo Sampling: Performance Guarantees in The Presence of Quantization Jan 2, 2025 Quantization
— Unverified 0TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees Jan 2, 2025 Quantization
Code Code Available 0BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference Jan 2, 2025 Quantization
Code Code Available 0TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer Jan 2, 2025 Benchmarking Quantization
— Unverified 0Exploiting Latent Properties to Optimize Neural Codecs Jan 2, 2025 Decoder Quantization
— Unverified 0Enhancing Diversity for Data-free Quantization Jan 1, 2025 Data Free Quantization Diversity
— Unverified 0Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression Jan 1, 2025 3DGS Quantization
— Unverified 0