Deep activity propagation via weight initialization in spiking neural networks Oct 1, 2024 Quantization
— Unverified 0Trainable pruned ternary quantization for medical signal classification models Oct 1, 2024 Model Compression Quantization
Code Code Available 0Quantized and Asynchronous Federated Learning Sep 30, 2024 Federated Learning Quantization
— Unverified 0Mixed-Precision Embeddings for Large-Scale Recommendation Models Sep 30, 2024 Quantization Recommendation Systems
— Unverified 0Constraint Guided Model Quantization of Neural Networks Sep 30, 2024 model Quantization
— Unverified 0Accelerating PoT Quantization on Edge Devices Sep 30, 2024 CPU Quantization
Code Code Available 0Aggressive Post-Training Compression on Extremely Large Language Models Sep 30, 2024 Model Compression Network Pruning
— Unverified 0Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference Sep 30, 2024 Quantization
— Unverified 0InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries Sep 29, 2024 Knowledge Distillation Model Compression
— Unverified 0Efficient Federated Intrusion Detection in 5G ecosystem using optimized BERT-based model Sep 28, 2024 Federated Learning Intrusion Detection
Code Code Available 0Asymptotic tracking control of dynamic reference over homomorphically encrypted data with finite modulus Sep 27, 2024 Quantization
— Unverified 0Heterogeneous quantization regularizes spiking neural network activity Sep 27, 2024 Denoising Quantization
— Unverified 0A method of using RSVD in residual calculation of LowBit GEMM Sep 27, 2024 Data Free Quantization Quantization
— Unverified 0Fronthaul-Constrained Distributed Radar Sensing Sep 26, 2024 Quantization
— Unverified 0Digital and Hybrid Precoding Designs in Massive MIMO with Low-Resolution ADCs Sep 26, 2024 Quantization
Code Code Available 0Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models Sep 26, 2024 Neural Network Compression Quantization
Code Code Available 0Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Sep 26, 2024 GPU Management
— Unverified 0P4Q: Learning to Prompt for Quantization in Visual-language Models Sep 26, 2024 Quantization
— Unverified 0MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling Sep 26, 2024 Motion Generation Quantization
— Unverified 0Using Random Codebooks for Audio Neural AutoEncoders Sep 25, 2024 Audio Compression Quantization
— Unverified 0Reinforcement Learning for Finite Space Mean-Field Type Games Sep 25, 2024 Deep Reinforcement Learning Q-Learning
— Unverified 0Search for Efficient Large Language Models Sep 25, 2024 GPU Model Compression
Code Code Available 1AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization Sep 25, 2024 Quantization
Code Code Available 0A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms Sep 25, 2024 Quantization
— Unverified 0Accumulator-Aware Post-Training Quantization Sep 25, 2024 image-classification Image Classification
— Unverified 0INT-FlashAttention: Enabling Flash Attention for INT8 Quantization Sep 25, 2024 GPU Quantization
Code Code Available 2VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Sep 25, 2024 Quantization
Code Code Available 4BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices Sep 25, 2024 image-classification Image Classification
Code Code Available 1PTQ4RIS: Post-Training Quantization for Referring Image Segmentation Sep 25, 2024 Image Segmentation Quantization
Code Code Available 0LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ Sep 25, 2024 Chatbot GSM8K
— Unverified 0Communication and Energy Efficient Federated Learning using Zero-Order Optimization Technique Sep 24, 2024 Federated Learning Quantization
— Unverified 0A Formalization of Image Vectorization by Region Merging Sep 24, 2024 Image Segmentation Quantization
— Unverified 0Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization Sep 24, 2024 Knowledge Distillation Quantization
— Unverified 0TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control Sep 24, 2024 Clustering Language Modelling
Code Code Available 3Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA Sep 24, 2024 Quantization Tensor Networks
— Unverified 0Disentanglement with Factor Quantized Variational Autoencoders Sep 23, 2024 Disentanglement Inductive Bias
Code Code Available 0MICSim: A Modular Simulator for Mixed-signal Compute-in-Memory based AI Accelerator Sep 23, 2024 Quantization
Code Code Available 1SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Sep 22, 2024 Quantization Simultaneous Localization and Mapping
— Unverified 0Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues Sep 22, 2024 Image Super-Resolution Quantization
Code Code Available 0DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation Sep 22, 2024 Image Generation Knowledge Distillation
— Unverified 0CorBin-FL: A Differentially Private Federated Learning Mechanism using Common Randomness Sep 20, 2024 Federated Learning Quantization
— Unverified 0Reduced bit median quantization: A middle process for Efficient Image Compression Sep 20, 2024 Image Compression Quantization
— Unverified 0PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models Sep 20, 2024 Audio Generation Audio Synthesis
— Unverified 0TalkMosaic: Interactive PhotoMosaic with Multi-modal LLM Q&A Interactions Sep 20, 2024 Quantization
— Unverified 0NDVQ: Robust Neural Audio Codec with Normal Distribution-Based Vector Quantization Sep 19, 2024 Audio Compression Audio Generation
— Unverified 0Scaling FP8 training to trillion-token LLMs Sep 19, 2024 Quantization
— Unverified 0Impact of ML Optimization Tactics on Greener Pre-Trained ML Models Sep 19, 2024 GPU image-classification
— Unverified 0Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview Sep 18, 2024 Quantization
— Unverified 0Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference Sep 18, 2024 Audio Compression Language Modeling
— Unverified 0Pareto Data Framework: Steps Towards Resource-Efficient Decision Making Using Minimum Viable Data (MVD) Sep 18, 2024 Decision Making Quantization
— Unverified 0