QERA: an Analytical Framework for Quantization Error Reconstruction Oct 8, 2024 parameter-efficient fine-tuning Quantization
— Unverified 0Designing a Classifier for Active Fire Detection from Multispectral Satellite Imagery Using Neural Architecture Search Oct 7, 2024 Fire Detection Neural Architecture Search
— Unverified 0Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Oct 7, 2024 Edge-computing image-classification
— Unverified 0Continuous Approximations for Improving Quantization Aware Training of LLMs Oct 6, 2024 MMLU Model Compression
— Unverified 0HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis Oct 6, 2024 Language Modeling Language Modelling
— Unverified 0PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms Oct 5, 2024 Benchmarking GPU
— Unverified 0EXAQ: Exponent Aware Quantization For LLMs Acceleration Oct 4, 2024 Quantization Question Answering
Code Code Available 0Resource-aware Mixed-precision Quantization for Enhancing Deployability of Transformers for Time-series Forecasting on Embedded FPGAs Oct 4, 2024 Neural Architecture Search Quantization
— Unverified 0Generative Semantic Communication for Text-to-Speech Synthesis Oct 4, 2024 Quantization Semantic Communication
— Unverified 0MIMO Detection with Spatial Sigma-Delta ADCs: A Variational Bayesian Approach Oct 4, 2024 Quantization
— Unverified 0SEAL: SEmantic-Augmented Imitation Learning via Language Model Oct 3, 2024 Decision Making Imitation Learning
— Unverified 0Overcoming Representation Bias in Fairness-Aware data Repair using Optimal Transport Oct 3, 2024 Attribute Fairness
— Unverified 0Remember and Recall: Associative-Memory-based Trajectory Prediction Oct 3, 2024 Autonomous Driving Computational Efficiency
— Unverified 0Getting Free Bits Back from Rotational Symmetries in LLMs Oct 2, 2024 Quantization
— Unverified 0Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules Oct 2, 2024 Quantization Speech Enhancement
— Unverified 0Trainable pruned ternary quantization for medical signal classification models Oct 1, 2024 Model Compression Quantization
Code Code Available 0Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Oct 1, 2024 Computational Efficiency Knowledge Distillation
— Unverified 0STanH : Parametric Quantization for Variable Rate Learned Image Compression Oct 1, 2024 Decoder Image Compression
— Unverified 0Deep activity propagation via weight initialization in spiking neural networks Oct 1, 2024 Quantization
— Unverified 0Aggressive Post-Training Compression on Extremely Large Language Models Sep 30, 2024 Model Compression Network Pruning
— Unverified 0Constraint Guided Model Quantization of Neural Networks Sep 30, 2024 model Quantization
— Unverified 0Accelerating PoT Quantization on Edge Devices Sep 30, 2024 CPU Quantization
Code Code Available 0Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference Sep 30, 2024 Quantization
— Unverified 0Mixed-Precision Embeddings for Large-Scale Recommendation Models Sep 30, 2024 Quantization Recommendation Systems
— Unverified 0Quantized and Asynchronous Federated Learning Sep 30, 2024 Federated Learning Quantization
— Unverified 0InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries Sep 29, 2024 Knowledge Distillation Model Compression
— Unverified 0Efficient Federated Intrusion Detection in 5G ecosystem using optimized BERT-based model Sep 28, 2024 Federated Learning Intrusion Detection
Code Code Available 0Asymptotic tracking control of dynamic reference over homomorphically encrypted data with finite modulus Sep 27, 2024 Quantization
— Unverified 0A method of using RSVD in residual calculation of LowBit GEMM Sep 27, 2024 Data Free Quantization Quantization
— Unverified 0Heterogeneous quantization regularizes spiking neural network activity Sep 27, 2024 Denoising Quantization
— Unverified 0Fronthaul-Constrained Distributed Radar Sensing Sep 26, 2024 Quantization
— Unverified 0Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models Sep 26, 2024 Neural Network Compression Quantization
Code Code Available 0MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling Sep 26, 2024 Motion Generation Quantization
— Unverified 0Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Sep 26, 2024 GPU Management
— Unverified 0Digital and Hybrid Precoding Designs in Massive MIMO with Low-Resolution ADCs Sep 26, 2024 Quantization
Code Code Available 0P4Q: Learning to Prompt for Quantization in Visual-language Models Sep 26, 2024 Quantization
— Unverified 0Reinforcement Learning for Finite Space Mean-Field Type Games Sep 25, 2024 Deep Reinforcement Learning Q-Learning
— Unverified 0A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms Sep 25, 2024 Quantization
— Unverified 0Accumulator-Aware Post-Training Quantization Sep 25, 2024 image-classification Image Classification
— Unverified 0LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ Sep 25, 2024 Chatbot GSM8K
— Unverified 0Using Random Codebooks for Audio Neural AutoEncoders Sep 25, 2024 Audio Compression Quantization
— Unverified 0PTQ4RIS: Post-Training Quantization for Referring Image Segmentation Sep 25, 2024 Image Segmentation Quantization
Code Code Available 0AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization Sep 25, 2024 Quantization
Code Code Available 0A Formalization of Image Vectorization by Region Merging Sep 24, 2024 Image Segmentation Quantization
— Unverified 0Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA Sep 24, 2024 Quantization Tensor Networks
— Unverified 0Communication and Energy Efficient Federated Learning using Zero-Order Optimization Technique Sep 24, 2024 Federated Learning Quantization
— Unverified 0Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization Sep 24, 2024 Knowledge Distillation Quantization
— Unverified 0Disentanglement with Factor Quantized Variational Autoencoders Sep 23, 2024 Disentanglement Inductive Bias
Code Code Available 0Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues Sep 22, 2024 Image Super-Resolution Quantization
Code Code Available 0SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Sep 22, 2024 Quantization Simultaneous Localization and Mapping
— Unverified 0