HarmLevelBench: Evaluating Harm-Level Compliance and the Impact of Quantization on Model Alignment Nov 11, 2024 Quantization
— Unverified 0HAFLQ: Heterogeneous Adaptive Federated LoRA Fine-tuned LLM with Quantization Nov 10, 2024 Quantization text-classification
— Unverified 0Expansion Quantization Network: An Efficient Micro-emotion Annotation and Detection Framework Nov 9, 2024 Emotion Detection and Classification Quantization
Code Code Available 0An asymmetric heuristic for trained ternary quantization based on the statistics of the weights: an application to medical signal classification Nov 9, 2024 Quantization
Code Code Available 0Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques Nov 9, 2024 Quantization
— Unverified 0Intelligent Fault Diagnosis of Type and Severity in Low-Frequency, Low Bit-Depth Signals Nov 9, 2024 Fault Diagnosis Quantization
— Unverified 0When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization Nov 8, 2024 Decoder Quantization
— Unverified 0Rate-aware Compression for NeRF-based Volumetric Video Nov 8, 2024 NeRF Quantization
— Unverified 0QuanCrypt-FL: Quantized Homomorphic Encryption with Pruning for Secure Federated Learning Nov 8, 2024 Computational Efficiency Federated Learning
— Unverified 0Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models Nov 8, 2024 Quantization Question Answering
— Unverified 0Qwen2.5-32B: Leveraging Self-Consistent Tool-Integrated Reasoning for Bengali Mathematical Olympiad Problem Solving Nov 8, 2024 Prompt Engineering Quantization
— Unverified 0Compressive Spectrum Sensing with 1-bit ADCs Nov 7, 2024 compressed sensing Quantization
— Unverified 0Saliency Assisted Quantization for Neural Networks Nov 7, 2024 image-classification Image Classification
— Unverified 0Green My LLM: Studying the key factors affecting the energy consumption of code assistants Nov 7, 2024 Quantization
— Unverified 0Interactions Across Blocks in Post-Training Quantization of Large Language Models Nov 6, 2024 Quantization
— Unverified 0Multi-bit Distributed Detection of Sparse Stochastic Signals over Error-Prone Reporting Channels Nov 6, 2024 Quantization
— Unverified 0An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging Nov 6, 2024 Deep Learning Edge-computing
Code Code Available 0Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment Nov 5, 2024 Quantization Safety Alignment
Code Code Available 0Hybrid Beamforming for Integrated Sensing and Communications With Low Resolution DACs Nov 5, 2024 ISAC Quantization
— Unverified 0Sum Rate Maximization in the Constant Envelope MIMO Downlink with the RZF Precoder Nov 5, 2024 Quantization
— Unverified 0"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Nov 4, 2024 GPU Large Language Model
— Unverified 0Transferable Sequential Recommendation via Vector Quantized Meta Learning Nov 4, 2024 Meta-Learning Quantization
— Unverified 0BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration Nov 3, 2024 Quantization
— Unverified 0Conformalized High-Density Quantile Regression via Dynamic Prototypes-based Probability Density Estimation Nov 2, 2024 Density Estimation quantile regression
Code Code Available 0Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval Nov 1, 2024 Quantization Retrieval
— Unverified 0Fundamental Trade-offs in Quantized Hybrid Radar Fusion: A CRB-Rate Perspective Nov 1, 2024 Integrated sensing and communication ISAC
— Unverified 0ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs Oct 31, 2024 Quantization
— Unverified 0Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model Oct 31, 2024 Quantization Sequential Recommendation
— Unverified 0ALISE: Accelerating Large Language Model Serving with Speculative Scheduling Oct 31, 2024 Blocking Language Modeling
— Unverified 0GWQ: Gradient-Aware Weight Quantization for Large Language Models Oct 30, 2024 Outlier Detection Quantization
— Unverified 0APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm Oct 30, 2024 Decoder Quantization
— Unverified 0ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting Oct 30, 2024 Quantization
— Unverified 0Accelerated AI Inference via Dynamic Execution Methods Oct 30, 2024 Quantization
— Unverified 0A Comprehensive Study on Quantization Techniques for Large Language Models Oct 30, 2024 Quantization
— Unverified 0HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation Oct 29, 2024 Pose Estimation Quantization
— Unverified 0The Impact of Inference Acceleration Strategies on Bias of LLMs Oct 29, 2024 Quantization
Code Code Available 0EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Oct 28, 2024 ARC Math
— Unverified 0Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization Oct 27, 2024 Data Augmentation Quantization
Code Code Available 0Logarithmically Quantized Distributed Optimization over Dynamic Multi-Agent Networks Oct 27, 2024 Distributed Optimization Quantization
— Unverified 0Unleashing Dynamic Range and Resolution in Unlimited Sensing Framework via Novel Hardware Oct 26, 2024 Quantization
— Unverified 0You Never Know: Quantization Induces Inconsistent Biases in Vision-Language Foundation Models Oct 26, 2024 Quantization
— Unverified 0DQRM: Deep Quantized Recommendation Models Oct 26, 2024 Quantization
Code Code Available 0A Survey of Small Language Models Oct 25, 2024 Benchmarking Model Compression
— Unverified 0Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization Oct 25, 2024 NeRF Quantization
Code Code Available 0Learning ID-free Item Representation with Token Crossing for Multimodal Recommendation Oct 25, 2024 Multimodal Recommendation Quantization
— Unverified 0TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstruction Oct 24, 2024 Quantization
— Unverified 0The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI Oct 24, 2024 Quantization
— Unverified 0Sliding DFT-based Signal Recovery for Modulo ADC with 1-bit Folding Information Oct 24, 2024 Quantization
— Unverified 0A Counterexample in Cross-Correlation Template Matching Oct 24, 2024 Image Registration Quantization
— Unverified 0Adaptive Wireless Image Semantic Transmission: Design, Simulation, and Prototype Validation Oct 23, 2024 Image Reconstruction Quantization
— Unverified 0