GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting Mar 13, 2024 GPU Quantization
Code Code Available 3Vector Quantization for Deep-Learning-Based CSI Feedback in Massive MIMO Systems Mar 12, 2024 Quantization
— Unverified 0Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding Mar 12, 2024 Quantization
— Unverified 0Chronos: Learning the Language of Time Series Mar 12, 2024 Gaussian Processes Language Modeling
Code Code Available 7COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization Mar 11, 2024 Quantization
Code Code Available 1What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation Mar 11, 2024 Computational Efficiency Quantization
— Unverified 0FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization Mar 11, 2024 Face Generation Quantization
— Unverified 0QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning Mar 11, 2024 Quantization
— Unverified 0FrameQuant: Flexible Low-Bit Quantization for Transformers Mar 10, 2024 Quantization
Code Code Available 1Micro-Fracture Detection in Photovoltaic Cells with Hardware-Constrained Devices and Computer Vision Mar 8, 2024 Fracture detection Quantization
— Unverified 0The Impact of Quantization on the Robustness of Transformer-based Text Classifiers Mar 8, 2024 Quantization SST-2
— Unverified 0Enhancing Multimodal Unified Representations for Cross Modal Generalization Mar 8, 2024 Contrastive Learning Disentanglement
— Unverified 0Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference Mar 8, 2024 Quantization
Code Code Available 0GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM Mar 8, 2024 Quantization
Code Code Available 2Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities Mar 7, 2024 Contrastive Learning Knowledge Distillation
Code Code Available 1QAQ: Quality Adaptive Quantization for LLM KV Cache Mar 7, 2024 Quantization Question Answering
Code Code Available 2On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks Mar 7, 2024 Diversity Federated Learning
— Unverified 0LoCoDL: Communication-Efficient Distributed Learning with Local Training and Compression Mar 7, 2024 Distributed Optimization Federated Learning
— Unverified 0ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Mar 6, 2024 Quantization
Code Code Available 2Adaptive Integrate-and-Fire Time Encoding Machine with Quantization Mar 5, 2024 Quantization
— Unverified 0Design of Stochastic Quantizers for Privacy Preservation Mar 5, 2024 Privacy Preserving Quantization
— Unverified 0EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs Mar 5, 2024 Data Free Quantization Quantization
— Unverified 0NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Mar 5, 2024 Quantization Speech Synthesis
Code Code Available 3Behavior Generation with Latent Actions Mar 5, 2024 Autonomous Driving Decision Making
Code Code Available 3VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism Mar 5, 2024 Quantization
— Unverified 0Deep-Learned Compression for Radio-Frequency Signal Classification Mar 5, 2024 Classification Decision Making
— Unverified 0FlowPrecision: Advancing FPGA-Based Real-Time Fluid Flow Estimation with Linear Quantization Mar 4, 2024 Quantization
— Unverified 0Towards efficient deep autoencoders for multivariate time series anomaly detection Mar 4, 2024 Anomaly Detection Model Compression
— Unverified 0Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 Mar 4, 2024 Image Compression Quantization
Code Code Available 0Better Schedules for Low Precision Training of Deep Neural Networks Mar 4, 2024 Node Classification Quantization
— Unverified 0A Hierarchical Federated Learning Approach for the Internet of Things Mar 3, 2024 Federated Learning Quantization
— Unverified 0On the Compressibility of Quantized Large Language Models Mar 3, 2024 Data Compression Quantization
— Unverified 0Extracting Usable Predictions from Quantized Networks through Uncertainty Quantification for OOD Detection Mar 2, 2024 Quantization Uncertainty Quantification
Code Code Available 0LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization Mar 2, 2024 GPU Quantization
Code Code Available 1IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Mar 2, 2024 Language Modeling Language Modelling
Code Code Available 3"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach Mar 1, 2024 Model Compression Quantization
Code Code Available 1BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) Mar 1, 2024 Language Modeling Language Modelling
— Unverified 0NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions Feb 29, 2024 Quantization
Code Code Available 1Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets Feb 29, 2024 Image Compression Quantization
— Unverified 0T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition Feb 29, 2024 Autonomous Driving Quantization
— Unverified 0Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision Feb 28, 2024 Face Recognition Quantization
— Unverified 0No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization Feb 28, 2024 Quantization
— Unverified 0FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization Feb 28, 2024 GPU Quantization
— Unverified 0Evaluating Quantized Large Language Models Feb 28, 2024 Mamba Quantization
Code Code Available 2Inpainting Computational Fluid Dynamics with Deep Learning Feb 27, 2024 Deep Learning Quantization
— Unverified 0Neural Video Compression with Feature Modulation Feb 27, 2024 Blocking Quantization
— Unverified 0Adaptive quantization with mixed-precision based on low-cost proxy Feb 27, 2024 Neural Architecture Search Quantization
— Unverified 0Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning Feb 27, 2024 Imitation Learning Quantization
— Unverified 0Distortion-Controlled Dithering with Reduced Recompression Rate Feb 26, 2024 Data Compression Image Compression
— Unverified 0SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field Feb 26, 2024 Image Compression NeRF
— Unverified 0