Spectral Codecs: Improving Non-Autoregressive Speech Synthesis with Spectrogram-Based Audio Codecs Jun 7, 2024 Quantization Speech Synthesis
— Unverified 0Activation Map-based Vector Quantization for 360-degree Image Semantic Communication Jun 7, 2024 Quantization Semantic Communication
— Unverified 0Winner-takes-all learners are geometry-aware conditional density estimators Jun 7, 2024 All Density Estimation
Code Code Available 0Real-Time Spacecraft Pose Estimation Using Mixed-Precision Quantized Neural Network on COTS Reconfigurable MPSoC Jun 6, 2024 Pose Estimation Quantization
Code Code Available 0Proofread: Fixes All Errors with One Tap Jun 6, 2024 All Quantization
— Unverified 0BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Jun 6, 2024 Image Generation model
— Unverified 0USM RNN-T model weights binarization Jun 5, 2024 Binarization model
— Unverified 0Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Jun 5, 2024 GPU Quantization
— Unverified 0VQUNet: Vector Quantization U-Net for Defending Adversarial Atacks by Regularizing Unwanted Noise Jun 5, 2024 Adversarial Attack Quantization
— Unverified 0Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Jun 5, 2024 Quantization Reinforcement Learning (RL)
Code Code Available 1QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead Jun 5, 2024 Quantization
Code Code Available 1Mixed-Precision Federated Learning via Multi-Precision Over-The-Air Aggregation Jun 4, 2024 Computational Efficiency Edge-computing
— Unverified 0ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Jun 4, 2024 Quantization Video Generation
Code Code Available 1SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining Jun 4, 2024 Quantization Sparse Learning
Code Code Available 1Toward Efficient Deep Spiking Neuron Networks:A Survey On Compression Jun 3, 2024 Knowledge Distillation Quantization
— Unverified 0DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs Jun 3, 2024 Management Quantization
Code Code Available 2CE-VAE: Capsule Enhanced Variational AutoEncoder for Underwater Image Enhancement Jun 3, 2024 Image Enhancement Image Generation
Code Code Available 1Log-Scale Quantization in Distributed First-Order Methods: Gradient-based Learning from Distributed Data Jun 2, 2024 Distributed Optimization Quantization
— Unverified 0MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization Jun 2, 2024 Quantization
Code Code Available 1Privacy-Aware Randomized Quantization via Linear Programming Jun 1, 2024 Quantization
Code Code Available 0Effective Interplay between Sparsity and Quantization: From Theory to Practice May 31, 2024 Computational Efficiency Model Compression
— Unverified 0Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs May 31, 2024 Quantization
— Unverified 0Locking Machine Learning Models into Hardware May 31, 2024 Quantization
— Unverified 0LCQ: Low-Rank Codebook based Quantization for Large Language Models May 31, 2024 Model Compression Quantization
— Unverified 0An Efficient Network with Novel Quantization Designed for Massive MIMO CSI Feedback May 30, 2024 Quantization
— Unverified 0S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs May 30, 2024 GPU Quantization
— Unverified 0HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization May 30, 2024 Quantization
— Unverified 0P^2-ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer May 30, 2024 Quantization
Code Code Available 1One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments May 30, 2024 All Quantization
— Unverified 0CV-VAE: A Compatible Video VAE for Latent Generative Video Models May 30, 2024 Quantization
Code Code Available 3Information Entropy Guided Height-aware Histogram for Quantization-friendly Pillar Feature Encoder May 29, 2024 3D Object Detection Autonomous Driving
— Unverified 0Compressing Large Language Models using Low Rank and Low Precision Decomposition May 29, 2024 Quantization
Code Code Available 24-bit Shampoo for Memory-Efficient Network Training May 28, 2024 image-classification Image Classification
Code Code Available 1LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models May 28, 2024 Neural Architecture Search Quantization
— Unverified 0I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models May 28, 2024 Quantization
— Unverified 0SLMRec: Distilling Large Language Models into Small for Sequential Recommendation May 28, 2024 Knowledge Distillation Language Modeling
Code Code Available 1Exploiting LLM Quantization May 28, 2024 Code Generation Quantization
Code Code Available 1The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention May 28, 2024 object-detection Object Detection
— Unverified 0MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization May 28, 2024 Denoising Quantization
— Unverified 0CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs May 27, 2024 Computational Efficiency Quantization
Code Code Available 0UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation May 27, 2024 Image Compression Knowledge Distillation
— Unverified 0Di^2Pose: Discrete Diffusion Model for Occluded 3D Human Pose Estimation May 27, 2024 3D Human Pose Estimation Monocular 3D Human Pose Estimation
— Unverified 0BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics May 27, 2024 Decoder Quantization
— Unverified 0Node Identifiers: Compact, Discrete Representations for Efficient Graph Learning May 26, 2024 Computational Efficiency Graph Classification
Code Code Available 1SpinQuant: LLM quantization with learned rotations May 26, 2024 Quantization
Code Code Available 5LoQT: Low-Rank Adapters for Quantized Pretraining May 26, 2024 GPU Language Modeling
Code Code Available 2M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation May 25, 2024 Language Modeling Language Modelling
Code Code Available 1PTQ4DiT: Post-training Quantization for Diffusion Transformers May 25, 2024 Image Generation Quantization
Code Code Available 1FastQuery: Communication-efficient Embedding Table Query for Private LLM Inference May 25, 2024 Quantization
— Unverified 0Massive MIMO-ISAC System With 1-Bit ADCs/DACs May 24, 2024 Integrated sensing and communication ISAC
— Unverified 0