Distributed Deep Reinforcement Learning Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing Jul 11, 2024 Deep Reinforcement Learning Edge-computing
— Unverified 0Autoregressive Speech Synthesis without Vector Quantization Jul 11, 2024 Audio Compression Diversity
— Unverified 0Applying generative neural networks for fast simulations of the ALICE (CERN) experiment Jul 10, 2024 Quantization
Code Code Available 0ERQ: Error Reduction for Post-Training Quantization of Vision Transformers Jul 9, 2024 Quantization regression
— Unverified 0Ternary Spike-based Neuromorphic Signal Processing System Jul 7, 2024 Quantization
— Unverified 0ZOBNN: Zero-Overhead Dependable Design of Binary Neural Networks with Deliberately Quantized Parameters Jul 6, 2024 Attribute Quantization
— Unverified 0Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression Jul 6, 2024 Language Modeling Language Modelling
Code Code Available 0Integer-only Quantized Transformers for Embedded FPGA-based Time-series Forecasting in AIoT Jul 6, 2024 Quantization Time Series
— Unverified 0Balance of Number of Embedding and their Dimensions in Vector Quantization Jul 6, 2024 Quantization
— Unverified 0Quantizing YOLOv7: A Comprehensive Study Jul 6, 2024 Model Compression object-detection
— Unverified 0Hybrid Receiver Design for Massive MIMO-OFDM with Low-Resolution ADCs and Oversampling Jul 5, 2024 Quantization
— Unverified 0The Impact of Quantization and Pruning on Deep Reinforcement Learning Models Jul 5, 2024 Deep Reinforcement Learning Model Compression
— Unverified 0Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps Jul 5, 2024 Quantization
Code Code Available 0Low-latency machine learning FPGA accelerator for multi-qubit-state discrimination Jul 4, 2024 Quantization
— Unverified 0Joint Beamforming Design and Bit Allocation in Massive MIMO with Resolution-Adaptive ADCs Jul 4, 2024 Quantization
— Unverified 0Timestep-Aware Correction for Quantized Diffusion Models Jul 4, 2024 Attribute Noise Estimation
— Unverified 0QET: Enhancing Quantized LLM Parameters and KV cache Compression through Element Substitution and Residual Clustering Jul 4, 2024 Computational Efficiency Edge-computing
— Unverified 0GPTQT: Quantize Large Language Models Twice to Push the Efficiency Jul 3, 2024 Quantization
— Unverified 0Fisher-aware Quantization for DETR Detectors with Critical-category Objectives Jul 3, 2024 object-detection Object Detection
— Unverified 0ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers Jul 3, 2024 Attribute image-classification
— Unverified 0Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Jul 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic Jul 3, 2024 Quantization
— Unverified 0Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization Jul 3, 2024 Anomaly Detection CPU
— Unverified 0Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment Jul 3, 2024 Chatbot Computational Efficiency
— Unverified 0Edge AI-Enabled Chicken Health Detection Based on Enhanced FCOS-Lite and Knowledge Distillation Jul 3, 2024 Knowledge Distillation Quantization
— Unverified 0OSPC: Artificial VLM Features for Hateful Meme Detection Jul 3, 2024 Computational Efficiency Feature Engineering
— Unverified 0How Does Quantization Affect Multilingual LLMs? Jul 3, 2024 Mathematical Reasoning Quantization
— Unverified 0Joint Pruning and Channel-wise Mixed-Precision Quantization for Efficient Deep Neural Networks Jul 1, 2024 Quantization
Code Code Available 0Exploring FPGA designs for MX and beyond Jul 1, 2024 Efficient Neural Network Quantization
— Unverified 0Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient Compression Jul 1, 2024 Quantization
— Unverified 0PQCache: Product Quantization-based KVCache for Long Context LLM Inference Jul 1, 2024 GPU Quantization
— Unverified 0Linear and Nonlinear MMSE Estimation in One-Bit Quantized Systems under a Gaussian Mixture Prior Jul 1, 2024 Quantization
— Unverified 0NeuroNAS: Enhancing Efficiency of Neuromorphic In-Memory Computing for Intelligent Mobile Agents through Hardware-Aware Spiking Neural Architecture Search Jun 30, 2024 Neural Architecture Search Quantization
— Unverified 0Toward a Diffusion-Based Generalist for Dense Vision Tasks Jun 29, 2024 Conditional Image Generation Image Generation
— Unverified 0Rateless Stochastic Coding for Delay-Constrained Semantic Communication Jun 28, 2024 Decoder Perceptual Distance
— Unverified 0Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation Jun 28, 2024 Brain Tumor Classification Classification
— Unverified 0Reliable edge machine learning hardware for scientific applications Jun 27, 2024 Quantization scientific discovery
— Unverified 0Fronthaul Quantization-Aware MU-MIMO Precoding for Sum Rate Maximization Jun 27, 2024 Quantization
— Unverified 0Efficient course recommendations with T5-based ranking and summarization Jun 27, 2024 In-Context Learning Quantization
Code Code Available 0MCNC: Manifold Constrained Network Compression Jun 27, 2024 Model Compression Quantization
— Unverified 0OutlierTune: Efficient Channel-Wise Quantization for Large Language Models Jun 27, 2024 Quantization
— Unverified 0FedAQ: Communication-Efficient Federated Edge Learning via Joint Uplink and Downlink Adaptive Quantization Jun 26, 2024 Federated Learning Quantization
— Unverified 0A Quantization-based Technique for Privacy Preserving Distributed Learning Jun 26, 2024 Privacy Preserving Quantization
— Unverified 0Differential error feedback for communication-efficient decentralized learning Jun 26, 2024 Quantization
— Unverified 0CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization Jun 25, 2024 Quantization
— Unverified 0Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels Jun 25, 2024 Language Modelling Large Language Model
Code Code Available 0Reducing the Memory Footprint of 3D Gaussian Splatting Jun 24, 2024 Novel View Synthesis Quantization
— Unverified 0Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other Jun 24, 2024 Quantization
— Unverified 0Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors Jun 24, 2024 Quantization
— Unverified 0BitNet b1.58 Reloaded: State-of-the-art Performance Also on Smaller Networks Jun 24, 2024 Quantization
— Unverified 0