Implementation of a framework for deploying AI inference engines in FPGAs May 30, 2023 Quantization Resynthesis
— Unverified 0Low Precision Quantization-aware Training in Spiking Neural Networks with Differentiable Quantization Function May 30, 2023 Edge-computing Quantization
— Unverified 0Intriguing Properties of Quantization at Scale May 30, 2023 Quantization
— Unverified 0Stochastic Gradient Langevin Dynamics Based on Quantization with Increasing Resolution May 30, 2023 Quantization
— Unverified 0DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes May 29, 2023 Acoustic Scene Classification Continual Learning
— Unverified 0Global-QSGD: Practical Floatless Quantization for Distributed Learning with Theoretical Guarantees May 29, 2023 Quantization
— Unverified 0Reducing Communication for Split Learning by Randomized Top-k Sparsification May 29, 2023 Federated Learning Quantization
— Unverified 0BRICS: Bi-level feature Representation of Image CollectionS May 29, 2023 Decoder Image Generation
— Unverified 0SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics May 29, 2023 GPU Quantization
— Unverified 0Reversible Quantization Index Modulation for Static Deep Neural Network Watermarking May 29, 2023 Quantization
— Unverified 0A Transfer Learning and Explainable Solution to Detect mpox from Smartphones images May 29, 2023 image-classification Image Classification
Code Code Available 0Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of Weight Residuals May 28, 2023 Quantization
— Unverified 0Examining the Role and Limits of Batchnorm Optimization to Mitigate Diverse Hardware-noise in In-memory Computing May 28, 2023 Quantization
— Unverified 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time May 26, 2023 Quantization
— Unverified 0PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration May 25, 2023 Quantization
Code Code Available 0BinaryViT: Towards Efficient and Accurate Binary Vision Transformers May 24, 2023 Binarization Quantization
— Unverified 0Just CHOP: Embarrassingly Simple LLM Compression May 24, 2023 Knowledge Distillation Language Modeling
— Unverified 0RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0Downlink Clustering-Based Scheduling of IRS-Assisted Communications With Reconfiguration Constraints May 23, 2023 Clustering Quantization
— Unverified 0Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization May 23, 2023 In-Context Learning Language Modeling
— Unverified 0Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML May 23, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Adversarial Defenses via Vector Quantization May 23, 2023 Quantization
— Unverified 0Differential Privacy with Random Projections and Sign Random Projections May 22, 2023 Information Retrieval Quantization
— Unverified 0TSPTQ-ViT: Two-scaled post-training quantization for vision transformer May 22, 2023 Quantization
— Unverified 0Digital-SC: Digital Semantic Communication with Adaptive Network Split and Learned Non-Linear Quantization May 22, 2023 image-classification Image Classification
— Unverified 0TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers May 22, 2023 Object object-detection
— Unverified 0Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study May 22, 2023 Data Augmentation Knowledge Distillation
— Unverified 0Bi-ViT: Pushing the Limit of Vision Transformer Quantization May 21, 2023 Binarization Quantization
— Unverified 0FAQ: Mitigating the Impact of Faults in the Weight Memory of DNN Accelerators through Fault-Aware Quantization May 21, 2023 Quantization
— Unverified 0Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models May 21, 2023 GPU Quantization
— Unverified 0Atomic Anatomy of Low-Inertia Power Systems May 21, 2023 Anatomy Quantization
— Unverified 0ReTAG: Reasoning Aware Table to Analytic Text Generation May 19, 2023 Data-to-Text Generation Descriptive
— Unverified 0Two-Bit RIS-Aided Communications at 3.5GHz: Some Insights from the Measurement Results Under Multiple Practical Scenes May 19, 2023 Intelligent Communication Quantization
— Unverified 0Boost Vision Transformer with GPU-Friendly Sparsity and Quantization May 18, 2023 Benchmarking GPU
— Unverified 0DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition May 18, 2023 Knowledge Distillation Quantization
— Unverified 0Q-SHED: Distributed Optimization at the Edge via Hessian Eigenvectors Quantization May 18, 2023 Distributed Optimization Quantization
— Unverified 0Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt May 17, 2023 GPU Model Compression
— Unverified 0Component Training of Turbo Autoencoders May 16, 2023 Quantization
— Unverified 0MINT: Multiplier-less INTeger Quantization for Energy Efficient Spiking Neural Networks May 16, 2023 Quantization
Code Code Available 0Task-Oriented Communication Design at Scale May 15, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks May 15, 2023 image-classification Image Classification
— Unverified 0Fast Inference of Tree Ensembles on ARM Devices May 15, 2023 Quantization
— Unverified 0Designing Discontinuities May 15, 2023 Econometrics Quantization
— Unverified 0Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling May 14, 2023 Distributed Optimization Federated Learning
— Unverified 0Analyzing Compression Techniques for Computer Vision May 14, 2023 Knowledge Distillation Quantization
— Unverified 0Quantization in Spiking Neural Networks May 13, 2023 Quantization
Code Code Available 0GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples May 13, 2023 Binarization Knowledge Distillation
Code Code Available 0Accelerator-Aware Training for Transducer-Based Speech Recognition May 12, 2023 CPU Quantization
— Unverified 0Speaker Diaphragm Excursion Prediction: deep attention and online adaptation May 11, 2023 Deep Attention Quantization
— Unverified 0