RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0HAC++: Towards 100X Compression of 3D Gaussian Splatting Jan 21, 2025 3DGS Attribute
Code Code Available 3Practical Modulo Sampling: Mitigating High-Frequency Components Jan 20, 2025 Quantization
— Unverified 0Communication-Efficient Federated Learning by Quantized Variance Reduction for Heterogeneous Wireless Edge Networks Jan 20, 2025 Federated Learning Quantization
— Unverified 0Personalized Federated Learning for Cellular VR: Online Learning and Dynamic Caching Jan 20, 2025 Edge-computing Federated Learning
— Unverified 0Ditto: Accelerating Diffusion Model via Temporal Value Similarity Jan 20, 2025 Image Generation model
— Unverified 0DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization Jan 19, 2025 Decoder Point Cloud Completion
— Unverified 0LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data Jan 19, 2025 3D Object Detection object-detection
Code Code Available 0BeST -- A Novel Source Selection Metric for Transfer Learning Jan 19, 2025 Quantization Transfer Learning
— Unverified 0A Novel Hybrid Precoder With Low-Resolution Phase Shifters and Fronthaul Capacity Limitation Jan 18, 2025 Quantization
— Unverified 0LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator Jan 18, 2025 Quantization
— Unverified 04bit-Quantization in Vector-Embedding for RAG Jan 17, 2025 Quantization RAG
Code Code Available 0Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search Jan 16, 2025 Quantization
Code Code Available 2Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Jan 16, 2025 Model Compression Quantization
— Unverified 0The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Jan 16, 2025 3D Object Detection 3D Semantic Segmentation
— Unverified 0Real-time Indexing for Large-scale Recommendation by Streaming Vector Quantization Retriever Jan 15, 2025 Quantization
— Unverified 0Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach Jan 15, 2025 Quantization
— Unverified 0Large Language Models For Text Classification: Case Study And Comprehensive Review Jan 14, 2025 Articles Binary Classification
— Unverified 0D^2-DPM: Dual Denoising for Quantized Diffusion Probabilistic Models Jan 14, 2025 Denoising Image Generation
Code Code Available 1Koopman Meets Limited Bandwidth: Effect of Quantization on Data-Driven Linear Prediction and Control of Nonlinear Systems Jan 13, 2025 Model Predictive Control Quantization
— Unverified 0Dataset Distillation as Pushforward Optimal Quantization Jan 13, 2025 Dataset Distillation Decoder
— Unverified 0FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices Jan 13, 2025 Quantization
— Unverified 0QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications Jan 13, 2025 Computational Efficiency Quantization
— Unverified 0ZOQO: Zero-Order Quantized Optimization Jan 12, 2025 Quantization
— Unverified 0DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory Jan 11, 2025 GSM8K Quantization
Code Code Available 0Precoding Design for Limited-Feedback MISO Systems via Character-Polynomial Codes Jan 10, 2025 Quantization
— Unverified 0Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion Jan 10, 2025 Audio Effects Modeling Quantization
Code Code Available 0Mix-QViT: Mixed-Precision Vision Transformer Quantization Driven by Layer Importance and Quantization Sensitivity Jan 10, 2025 Quantization Sensitivity
— Unverified 0kANNolo: Sweet and Smooth Approximate k-Nearest Neighbors Search Jan 10, 2025 Information Retrieval Quantization
Code Code Available 1Neural Architecture Codesign for Fast Physics Applications Jan 9, 2025 High-Level Synthesis Model Compression
Code Code Available 0Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task Learning Jan 9, 2025 Model-based Reinforcement Learning Multi-Task Learning
— Unverified 0JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration Jan 9, 2025 Quantization
— Unverified 0DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models Jan 8, 2025 Quantization
Code Code Available 1Histogram-Equalized Quantization for logic-gated Residual Neural Networks Jan 8, 2025 Quantization
— Unverified 0UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles Jan 8, 2025 3D Object Detection Autonomous Vehicles
— Unverified 0Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Jan 7, 2025 Model Compression parameter estimation
— Unverified 0The Power of Negative Zero: Datatype Customization for Quantized Large Language Models Jan 6, 2025 Computational Efficiency Quantization
Code Code Available 0Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Jan 6, 2025 Math Mathematical Reasoning
— Unverified 0A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks Jan 6, 2025 Neural Network Compression Quantization
— Unverified 0Qinco2: Vector Compression and Search with Improved Implicit Neural Codebooks Jan 6, 2025 Decoder Quantization
Code Code Available 2Scaling Laws for Floating Point Quantization Training Jan 5, 2025 Quantization
— Unverified 0Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization Jan 5, 2025 Data Compression Quantization
Code Code Available 0HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs Jan 5, 2025 Efficient Neural Network parameter-efficient fine-tuning
Code Code Available 1TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jan 5, 2025 GPU Quantization
— Unverified 0Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies Jan 4, 2025 Edge-computing Knowledge Distillation
Code Code Available 2Optimizing Small Language Models for In-Vehicle Function-Calling Jan 4, 2025 Model Compression Quantization
— Unverified 0Millimeter-Wave Energy-Efficient Hybrid Beamforming Architecture and Algorithm Jan 3, 2025 Quantization
— Unverified 0Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content Jan 3, 2025 Quantization Super-Resolution
— Unverified 0Modulo Sampling: Performance Guarantees in The Presence of Quantization Jan 2, 2025 Quantization
— Unverified 0TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees Jan 2, 2025 Quantization
Code Code Available 0