MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization Nov 8, 2024 Quantization
Code Code Available 1QuanCrypt-FL: Quantized Homomorphic Encryption with Pruning for Secure Federated Learning Nov 8, 2024 Computational Efficiency Federated Learning
— Unverified 0Green My LLM: Studying the key factors affecting the energy consumption of code assistants Nov 7, 2024 Quantization
— Unverified 0Saliency Assisted Quantization for Neural Networks Nov 7, 2024 image-classification Image Classification
— Unverified 0Compressive Spectrum Sensing with 1-bit ADCs Nov 7, 2024 compressed sensing Quantization
— Unverified 0Scaling Laws for Precision Nov 7, 2024 Quantization
Code Code Available 2BitNet a4.8: 4-bit Activations for 1-bit LLMs Nov 7, 2024 Quantization
Code Code Available 4SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Nov 7, 2024 GPU Quantization
Code Code Available 4Multi-bit Distributed Detection of Sparse Stochastic Signals over Error-Prone Reporting Channels Nov 6, 2024 Quantization
— Unverified 0Interactions Across Blocks in Post-Training Quantization of Large Language Models Nov 6, 2024 Quantization
— Unverified 0An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging Nov 6, 2024 Deep Learning Edge-computing
Code Code Available 0Sum Rate Maximization in the Constant Envelope MIMO Downlink with the RZF Precoder Nov 5, 2024 Quantization
— Unverified 0Hybrid Beamforming for Integrated Sensing and Communications With Low Resolution DACs Nov 5, 2024 ISAC Quantization
— Unverified 0Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money Laundering Nov 5, 2024 Computational Efficiency Graph Neural Network
Code Code Available 1Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment Nov 5, 2024 Quantization Safety Alignment
Code Code Available 0Transferable Sequential Recommendation via Vector Quantized Meta Learning Nov 4, 2024 Meta-Learning Quantization
— Unverified 0"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Nov 4, 2024 GPU Large Language Model
— Unverified 0Addressing Representation Collapse in Vector Quantized Models with One Linear Layer Nov 4, 2024 Quantization Representation Learning
Code Code Available 3BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration Nov 3, 2024 Quantization
— Unverified 0VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization Nov 3, 2024 Quantization Representation Learning
Code Code Available 1Conformalized High-Density Quantile Regression via Dynamic Prototypes-based Probability Density Estimation Nov 2, 2024 Density Estimation quantile regression
Code Code Available 0Fundamental Trade-offs in Quantized Hybrid Radar Fusion: A CRB-Rate Perspective Nov 1, 2024 Integrated sensing and communication ISAC
— Unverified 0Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval Nov 1, 2024 Quantization Retrieval
— Unverified 0Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification Nov 1, 2024 Quantization Representation Learning
Code Code Available 1Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model Oct 31, 2024 Quantization Sequential Recommendation
— Unverified 0ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs Oct 31, 2024 Quantization
— Unverified 0BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments Oct 31, 2024 Quantization
Code Code Available 1ALISE: Accelerating Large Language Model Serving with Speculative Scheduling Oct 31, 2024 Blocking Language Modeling
— Unverified 0Accelerated AI Inference via Dynamic Execution Methods Oct 30, 2024 Quantization
— Unverified 0A Comprehensive Study on Quantization Techniques for Large Language Models Oct 30, 2024 Quantization
— Unverified 0GWQ: Gradient-Aware Weight Quantization for Large Language Models Oct 30, 2024 Outlier Detection Quantization
— Unverified 0APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm Oct 30, 2024 Decoder Quantization
— Unverified 0ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting Oct 30, 2024 Quantization
— Unverified 0IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models Oct 29, 2024 parameter-efficient fine-tuning Quantization
Code Code Available 1Data Generation for Hardware-Friendly Post-Training Quantization Oct 29, 2024 Data Augmentation GPU
Code Code Available 3HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation Oct 29, 2024 Pose Estimation Quantization
— Unverified 0The Impact of Inference Acceleration Strategies on Bias of LLMs Oct 29, 2024 Quantization
Code Code Available 0NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks Oct 28, 2024 Quantization
Code Code Available 2EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Oct 28, 2024 ARC Math
— Unverified 0Logarithmically Quantized Distributed Optimization over Dynamic Multi-Agent Networks Oct 27, 2024 Distributed Optimization Quantization
— Unverified 0Vector Quantization Prompting for Continual Learning Oct 27, 2024 Continual Learning Quantization
Code Code Available 1Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization Oct 27, 2024 Data Augmentation Quantization
Code Code Available 0Unleashing Dynamic Range and Resolution in Unlimited Sensing Framework via Novel Hardware Oct 26, 2024 Quantization
— Unverified 0DQRM: Deep Quantized Recommendation Models Oct 26, 2024 Quantization
Code Code Available 0You Never Know: Quantization Induces Inconsistent Biases in Vision-Language Foundation Models Oct 26, 2024 Quantization
— Unverified 0A Survey of Small Language Models Oct 25, 2024 Benchmarking Model Compression
— Unverified 0Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization Oct 25, 2024 NeRF Quantization
Code Code Available 0COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training Oct 25, 2024 Language Modeling Language Modelling
Code Code Available 3Learning ID-free Item Representation with Token Crossing for Multimodal Recommendation Oct 25, 2024 Multimodal Recommendation Quantization
— Unverified 0Sliding DFT-based Signal Recovery for Modulo ADC with 1-bit Folding Information Oct 24, 2024 Quantization
— Unverified 0