Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant Sep 17, 2024 Hallucination Instruction Following
Code Code Available 0LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling Sep 16, 2024 Dictionary Learning Quantization
— Unverified 0Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search Sep 16, 2024 Quantization
Code Code Available 2Forearm Ultrasound based Gesture Recognition on Edge Sep 16, 2024 Gesture Recognition Hand Gesture Recognition
— Unverified 0Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports Sep 15, 2024 Diagnostic Model Selection
— Unverified 0Improving Statistical Significance in Human Evaluation of Automatic Metrics via Soft Pairwise Accuracy Sep 15, 2024 Quantization
— Unverified 0MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation Sep 15, 2024 Attribute Novel View Synthesis
— Unverified 0Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare Sep 14, 2024 Data Free Quantization Image Segmentation
— Unverified 0Robust Training of Neural Networks at Arbitrary Precision and Sparsity Sep 14, 2024 Denoising Quantization
— Unverified 0S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training Sep 13, 2024 Quantization
Code Code Available 2Investigating Disentanglement in a Phoneme-level Speech Codec for Prosody Modeling Sep 13, 2024 Decoder Disentanglement
— Unverified 0Dequantization of a signal from two parallel quantized observations Sep 12, 2024 Quantization
— Unverified 0DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing Sep 12, 2024 Image Generation Quantization
Code Code Available 1Efficient and Reliable Vector Similarity Search Using Asymmetric Encoding with NAND-Flash for Many-Class Few-Shot Learning Sep 12, 2024 Few-Shot Learning Quantization
— Unverified 0Distributed Convolutional Neural Network Training on Mobile and Edge Clusters Sep 11, 2024 object-detection Object Detection
— Unverified 0STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM Sep 11, 2024 Language Modelling Large Language Model
— Unverified 0Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression Sep 11, 2024 Efficient Neural Network Neural Network Compression
— Unverified 0NVRC: Neural Video Representation Compression Sep 11, 2024 Model Compression Quantization
— Unverified 0AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration Sep 10, 2024 Image Restoration Quantization
— Unverified 0Rate-Constrained Quantization for Communication-Efficient Federated Learning Sep 10, 2024 Data Compression Federated Learning
— Unverified 0Distributed Optimization with Finite Bit Adaptive Quantization for Efficient Communication and Precision Enhancement Sep 9, 2024 Distributed Optimization Quantization
— Unverified 0ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors Sep 9, 2024 Contrastive Learning CPU
— Unverified 0BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec Sep 9, 2024 Quantization
Code Code Available 3SGC-VQGAN: Towards Complex Scene Representation via Semantic Guided Clustering Codebook Sep 9, 2024 Clustering Online Clustering
— Unverified 0Estimating the Completeness of Discrete Speech Units Sep 9, 2024 Disentanglement Quantization
— Unverified 0TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency Sep 9, 2024 Fairness Federated Learning
— Unverified 0BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration Sep 8, 2024 Deep Learning Quantization
Code Code Available 1Blind-Adaptive Quantizers Sep 6, 2024 Quantization
— Unverified 0OPAL: Outlier-Preserved Microscaling Quantization Accelerator for Generative Large Language Models Sep 6, 2024 Decoder Quantization
— Unverified 0Recursive Quantization for L_2 Stabilization of a Finite Capacity Stochastic Control Loop with Intermittent State Observations Sep 5, 2024 Quantization
— Unverified 0WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking Sep 5, 2024 image-classification Image Classification
— Unverified 0Investigating Privacy Bias in Training Data of Language Models Sep 5, 2024 Quantization
— Unverified 0LAST: Language Model Aware Speech Tokenization Sep 5, 2024 Language Modeling Language Modelling
— Unverified 0Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model Sep 4, 2024 Knowledge Distillation Language Modeling
— Unverified 0Learning Task-Based Trainable Neuromorphic ADCs via Power-Aware Distillation Sep 4, 2024 Quantization
— Unverified 0Gaussian Rate-Distortion-Perception Coding and Entropy-Constrained Scalar Quantization Sep 4, 2024 Quantization
— Unverified 0Task-Oriented Communication for Graph Data: A Graph Information Bottleneck Approach Sep 4, 2024 Quantization
— Unverified 0CoAst: Validation-Free Contribution Assessment for Federated Learning based on Cross-Round Valuation Sep 4, 2024 Contribution Assessment Federated Learning
— Unverified 0Optimization and Deployment of Deep Neural Networks for PPG-based Blood Pressure Estimation Targeting Low-power Wearables Sep 3, 2024 Blood pressure estimation Neural Architecture Search
— Unverified 0Designing Large Foundation Models for Efficient Training and Inference: A Survey Sep 3, 2024 Knowledge Distillation Model Compression
Code Code Available 1Foundations of Large Language Model Compression -- Part 1: Weight Quantization Sep 3, 2024 Language Modeling Language Modelling
Code Code Available 0Robust Clustering on High-Dimensional Data with Stochastic Quantization Sep 3, 2024 Clustering Computational Efficiency
Code Code Available 0Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment Sep 2, 2024 CPU GPU
— Unverified 0VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization Sep 2, 2024 Anomaly Detection Multi-class Anomaly Detection
Code Code Available 1One-Index Vector Quantization Based Adversarial Attack on Image Classification Sep 2, 2024 Adversarial Attack image-classification
— Unverified 0Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks Sep 2, 2024 Edge-computing image-classification
— Unverified 0Enhancing Multi-Stream Beamforming Through CQIs For 5G NR FDD Massive MIMO Communications: A Tuning-Free Scheme Sep 1, 2024 Quantization
— Unverified 0TinyAgent: Function Calling at the Edge Sep 1, 2024 Language Modelling Quantization
Code Code Available 3Federated Aggregation of Mallows Rankings: A Comparative Analysis of Borda and Lehmer Coding Sep 1, 2024 Privacy Preserving Quantization
— Unverified 0Hyper-Compression: Model Compression via Hyperfunction Sep 1, 2024 model Model Compression
Code Code Available 1