Fuzzy Norm-Explicit Product Quantization for Recommender Systems Dec 8, 2024 Quantization Recommendation Systems
— Unverified 0SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization Dec 8, 2024 3DGS Attribute
— Unverified 0Efficient Distributed Training through Gradient Compression with Sparsification and Quantization Techniques Dec 7, 2024 Quantization
— Unverified 0Sensor Selection and Distributed Quantization for Energy Efficiency in Massive MTC Dec 7, 2024 Quantization
— Unverified 0Error Feedback Approach for Quantization Noise Reduction of Distributed Graph Filters Dec 7, 2024 Quantization
— Unverified 0ULMRec: User-centric Large Language Model for Sequential Recommendation Dec 7, 2024 Language Modeling Language Modelling
— Unverified 0Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Dec 7, 2024 Quantization
Code Code Available 1GAQAT: gradient-adaptive quantization-aware training for domain generalization Dec 7, 2024 Domain Generalization Quantization
— Unverified 0Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search Dec 7, 2024 Model Compression Quantization
— Unverified 0APOLLO: SGD-like Memory, AdamW-level Performance Dec 6, 2024 GPU Quantization
Code Code Available 3SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization Dec 5, 2024 Clustering GPU
— Unverified 0Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task Dec 5, 2024 image-classification Image Classification
— Unverified 0QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos Dec 5, 2024 Attribute Quantization
Code Code Available 2Prompting Large Language Models for Clinical Temporal Relation Extraction Dec 4, 2024 Decoder Quantization
— Unverified 0Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective Dec 4, 2024 Autonomous Driving Quantization
Code Code Available 0FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness Dec 4, 2024 GPU Quantization
— Unverified 0Unifying KV Cache Compression for Large Language Models with LeanKV Dec 4, 2024 GPU Quantization
— Unverified 0Mixed-Precision Quantization: Make the Best Use of Bits Where They Matter Most Dec 4, 2024 Quantization
— Unverified 0TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation Dec 4, 2024 Image Generation Image Reconstruction
Code Code Available 3Designing DNNs for a trade-off between robustness and processing performance in embedded devices Dec 4, 2024 Autonomous Driving Quantization
— Unverified 0CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models Dec 3, 2024 Language Modeling Language Modelling
— Unverified 0Robust Precoding for Multi-User Visible Light Communications with Quantized Channel Information Dec 3, 2024 Quantization
— Unverified 03D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation Dec 3, 2024 3D Generation Image Generation
— Unverified 0Lean classical-quantum hybrid neural network model for image classification Dec 3, 2024 Classification Decision Making
— Unverified 0Scaling Image Tokenizers with Grouped Spherical Quantization Dec 3, 2024 Quantization
Code Code Available 0Taming Scalable Visual Tokenizer for Autoregressive Image Generation Dec 3, 2024 Image Generation Image Reconstruction
Code Code Available 4CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs Dec 3, 2024 Image Captioning Quantization
— Unverified 0Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features Dec 2, 2024 Image Retrieval Quantization
— Unverified 0Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification Dec 2, 2024 GPU Quantization
— Unverified 0XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation Dec 2, 2024 Image Reconstruction Quantization
Code Code Available 3Reducing Inference Energy Consumption Using Dual Complementary CNNs Dec 2, 2024 Quantization
Code Code Available 0Improving Detail in Pluralistic Image Inpainting with Feature Dequantization Dec 2, 2024 Image Inpainting Quantization
Code Code Available 1RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy Dec 2, 2024 Computational Efficiency Language Modeling
Code Code Available 0Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control Dec 2, 2024 Autonomous Driving Decision Making
— Unverified 0DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation Dec 1, 2024 Quantization
Code Code Available 1A Wave is Worth 100 Words: Investigating Cross-Domain Transferability in Time Series Dec 1, 2024 Imputation Quantization
— Unverified 0LAMBDA: Covering the Multimodal Critical Scenarios for Automated Driving Systems by Search Space Quantization Nov 30, 2024 Quantization
— Unverified 0Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation Nov 29, 2024 Attribute Fairness
— Unverified 0CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation Nov 29, 2024 Quantization Vision-Language-Action
— Unverified 0DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding Nov 29, 2024 Motion Synthesis Quantization
— Unverified 0Scaling Transformers for Low-Bitrate High-Quality Speech Coding Nov 29, 2024 Quantization
Code Code Available 3Quantized Delta Weight Is Safety Keeper Nov 29, 2024 Quantization
— Unverified 0Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads Nov 28, 2024 GPU Language Modeling
— Unverified 0On the effectiveness of discrete representations in sparse mixture of experts Nov 28, 2024 Mixture-of-Experts Quantization
— Unverified 0FAMES: Fast Approximate Multiplier Substitution for Mixed-Precision Quantized DNNs--Down to 2 Bits! Nov 27, 2024 Quantization
— Unverified 0COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection Nov 26, 2024 Quantization
— Unverified 0LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization Nov 26, 2024 Image Generation Quantization
Code Code Available 0Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens Nov 26, 2024 Quantization
— Unverified 0MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension Nov 26, 2024 Language Modeling Language Modelling
Code Code Available 2Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving Nov 26, 2024 Autonomous Driving Quantization
— Unverified 0