Reducing Communication for Split Learning by Randomized Top-k Sparsification May 29, 2023 Federated Learning Quantization
— Unverified 0SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics May 29, 2023 GPU Quantization
— Unverified 0DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes May 29, 2023 Acoustic Scene Classification Continual Learning
— Unverified 0LLM-QAT: Data-Free Quantization Aware Training for Large Language Models May 29, 2023 Data Free Quantization Quantization
Code Code Available 3BRICS: Bi-level feature Representation of Image CollectionS May 29, 2023 Decoder Image Generation
— Unverified 0A Transfer Learning and Explainable Solution to Detect mpox from Smartphones images May 29, 2023 image-classification Image Classification
Code Code Available 0Reversible Quantization Index Modulation for Static Deep Neural Network Watermarking May 29, 2023 Quantization
— Unverified 0Disentanglement via Latent Quantization May 28, 2023 Disentanglement Inductive Bias
Code Code Available 1Examining the Role and Limits of Batchnorm Optimization to Mitigate Diverse Hardware-noise in In-memory Computing May 28, 2023 Quantization
— Unverified 0Efficient Storage of Fine-Tuned Models via Low-Rank Approximation of Weight Residuals May 28, 2023 Quantization
— Unverified 0Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time May 26, 2023 Quantization
— Unverified 02-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration May 25, 2023 Quantization
Code Code Available 0NVTC: Nonlinear Vector Transform Coding May 25, 2023 Image Compression Quantization
Code Code Available 1KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration May 25, 2023 Benchmarking Face Recognition
Code Code Available 1RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0Just CHOP: Embarrassingly Simple LLM Compression May 24, 2023 Knowledge Distillation Language Modeling
— Unverified 0BinaryViT: Towards Efficient and Accurate Binary Vision Transformers May 24, 2023 Binarization Quantization
— Unverified 0QLoRA: Efficient Finetuning of Quantized LLMs May 23, 2023 Chatbot GPU
Code Code Available 6Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation May 23, 2023 All Image Generation
Code Code Available 1Adversarial Defenses via Vector Quantization May 23, 2023 Quantization
— Unverified 0Downlink Clustering-Based Scheduling of IRS-Assisted Communications With Reconfiguration Constraints May 23, 2023 Clustering Quantization
— Unverified 0Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization May 23, 2023 In-Context Learning Language Modeling
— Unverified 0Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML May 23, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Differential Privacy with Random Projections and Sign Random Projections May 22, 2023 Information Retrieval Quantization
— Unverified 0TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers May 22, 2023 Object object-detection
— Unverified 0Digital-SC: Digital Semantic Communication with Adaptive Network Split and Learned Non-Linear Quantization May 22, 2023 image-classification Image Classification
— Unverified 0TSPTQ-ViT: Two-scaled post-training quantization for vision transformer May 22, 2023 Quantization
— Unverified 0Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline May 22, 2023 Quantization Scheduling
Code Code Available 1Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study May 22, 2023 Data Augmentation Knowledge Distillation
— Unverified 0FAQ: Mitigating the Impact of Faults in the Weight Memory of DNN Accelerators through Fault-Aware Quantization May 21, 2023 Quantization
— Unverified 0Atomic Anatomy of Low-Inertia Power Systems May 21, 2023 Anatomy Quantization
— Unverified 0Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models May 21, 2023 GPU Quantization
— Unverified 0Bi-ViT: Pushing the Limit of Vision Transformer Quantization May 21, 2023 Binarization Quantization
— Unverified 0Two-Bit RIS-Aided Communications at 3.5GHz: Some Insights from the Measurement Results Under Multiple Practical Scenes May 19, 2023 Intelligent Communication Quantization
— Unverified 0Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization May 19, 2023 Image Generation Position
Code Code Available 1ReTAG: Reasoning Aware Table to Analytic Text Generation May 19, 2023 Data-to-Text Generation Descriptive
— Unverified 0PTQD: Accurate Post-Training Quantization for Diffusion Models May 18, 2023 Denoising Image Generation
Code Code Available 1Boost Vision Transformer with GPU-Friendly Sparsity and Quantization May 18, 2023 Benchmarking GPU
— Unverified 0QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation May 18, 2023 Gesture Generation Quantization
Code Code Available 1Q-SHED: Distributed Optimization at the Edge via Hessian Eigenvectors Quantization May 18, 2023 Distributed Optimization Quantization
— Unverified 0DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition May 18, 2023 Knowledge Distillation Quantization
— Unverified 0Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt May 17, 2023 GPU Model Compression
— Unverified 0MINT: Multiplier-less INTeger Quantization for Energy Efficient Spiking Neural Networks May 16, 2023 Quantization
Code Code Available 0Component Training of Turbo Autoencoders May 16, 2023 Quantization
— Unverified 0Fast Inference of Tree Ensembles on ARM Devices May 15, 2023 Quantization
— Unverified 0Task-Oriented Communication Design at Scale May 15, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0Designing Discontinuities May 15, 2023 Econometrics Quantization
— Unverified 0Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks May 15, 2023 image-classification Image Classification
— Unverified 0Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling May 14, 2023 Distributed Optimization Federated Learning
— Unverified 0