Magnificent Minified Models Jun 16, 2023 Quantization
— Unverified 0ZeRO++: Extremely Efficient Collective Communication for Giant Model Training Jun 16, 2023 GPU Quantization
— Unverified 0HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation Jun 16, 2023 Model Compression Quantization
Code Code Available 1Evaluation and Optimization of Gradient Compression for Distributed Deep Learning Jun 15, 2023 Deep Learning GPU
Code Code Available 1Neural Network Compression using Binarization and Few Full-Precision Weights Jun 15, 2023 Binarization CPU
— Unverified 0PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators Jun 15, 2023 Image Enhancement Quantization
Code Code Available 0High-performance deep spiking neural networks with 0.3 spikes per neuron Jun 14, 2023 image-classification Image Classification
— Unverified 0GQFedWAvg: Optimization-Based Quantized Federated Learning in General Edge Computing Systems Jun 13, 2023 Edge-computing Federated Learning
Code Code Available 0INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation Jun 13, 2023 Language Modeling Language Modelling
Code Code Available 4SqueezeLLM: Dense-and-Sparse Quantization Jun 13, 2023 GPU Quantization
Code Code Available 6Discrete Graph Auto-Encoder Jun 13, 2023 Graph Generation Quantization
— Unverified 0MFSN: Multi-perspective Fusion Search Network For Pre-training Knowledge in Speech Emotion Recognition Jun 12, 2023 Emotion Recognition Quantization
— Unverified 0NF4 Isn't Information Theoretically Optimal (and that's Good) Jun 12, 2023 Quantization
Code Code Available 1Resource Efficient Neural Networks Using Hessian Based Pruning Jun 12, 2023 GPU image-classification
— Unverified 0Efficient and Robust Quantization-aware Training via Adaptive Coreset Selection Jun 12, 2023 Model Compression Quantization
Code Code Available 1Sparse-Inductive Generative Adversarial Hashing for Nearest Neighbor Search Jun 12, 2023 compressed sensing Quantization
— Unverified 0High-Fidelity Audio Compression with Improved RVQGAN Jun 11, 2023 Audio Compression Audio Generation
Code Code Available 3End-to-End Neural Network Compression via _1_2 Regularized Latency Surrogates Jun 9, 2023 Neural Architecture Search Neural Network Compression
— Unverified 0Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor Decomposition Jun 8, 2023 Efficient Neural Network Quantization
Code Code Available 0Precision-aware Latency and Energy Balancing on Multi-Accelerator Platforms for DNN Inference Jun 8, 2023 Quantization
— Unverified 0Iterative Signal Processing for Integrated Sensing and Communication Systems Jun 8, 2023 Integrated sensing and communication ISAC
— Unverified 0Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization Jun 8, 2023 Quantization
— Unverified 0MobileNMT: Enabling Translation in 15MB and 30ms Jun 7, 2023 Model Compression NMT
Code Code Available 1SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression Jun 5, 2023 GPU Language Modelling
Code Code Available 2Sensitivity-Aware Finetuning for Accuracy Recovery on Deep Learning Hardware Jun 5, 2023 Deep Learning Quantization
— Unverified 0OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models Jun 4, 2023 parameter-efficient fine-tuning Quantization
Code Code Available 1Temporal Dynamic Quantization for Diffusion Models Jun 4, 2023 Quantization
— Unverified 0Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference Jun 4, 2023 Decoder Knowledge Distillation
— Unverified 0An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech Jun 4, 2023 Quantization Representation Learning
Code Code Available 0Binary and Ternary Natural Language Generation Jun 2, 2023 Machine Translation Quantization
Code Code Available 1Group channel pruning and spatial attention distilling for object detection Jun 2, 2023 Knowledge Distillation Model Compression
— Unverified 0Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training Jun 2, 2023 Quantization
Code Code Available 1Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding Jun 1, 2023 Natural Language Understanding Quantization
— Unverified 0Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition Jun 1, 2023 Activity Recognition Human Activity Recognition
— Unverified 0FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization Jun 1, 2023 image-classification Image Classification
Code Code Available 0On the Effectiveness of Hybrid Mutual Information Estimation Jun 1, 2023 Mutual Information Estimation Quantization
— Unverified 0Dynamic quantized consensus under DoS attacks: Towards a tight zooming-out factor Jun 1, 2023 Quantization
— Unverified 0AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Jun 1, 2023 Autonomous Driving Cloud Computing
Code Code Available 6Asymptotic Performance Analysis of Large-Scale Active IRS-Aided Wireless Network May 31, 2023 Quantization
— Unverified 0Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN May 31, 2023 image-classification Image Classification
Code Code Available 1MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training May 31, 2023 Language Modelling Quantization
Code Code Available 2Compression with Bayesian Implicit Neural Representations May 30, 2023 Audio Compression Quantization
Code Code Available 1AdANNS: A Framework for Adaptive Semantic Search May 30, 2023 Natural Questions Quantization
Code Code Available 1PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models May 30, 2023 parameter-efficient fine-tuning Quantization
— Unverified 0Low Precision Quantization-aware Training in Spiking Neural Networks with Differentiable Quantization Function May 30, 2023 Edge-computing Quantization
— Unverified 0Implementation of a framework for deploying AI inference engines in FPGAs May 30, 2023 Quantization Resynthesis
— Unverified 0Intriguing Properties of Quantization at Scale May 30, 2023 Quantization
— Unverified 0Towards Accurate Post-training Quantization for Diffusion Models May 30, 2023 Data Free Quantization Image Generation
Code Code Available 1Stochastic Gradient Langevin Dynamics Based on Quantization with Increasing Resolution May 30, 2023 Quantization
— Unverified 0Global-QSGD: Practical Floatless Quantization for Distributed Learning with Theoretical Guarantees May 29, 2023 Quantization
— Unverified 0