ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes Dec 15, 2024 Federated Learning Knowledge Distillation
— Unverified 0Nanoscaling Floating-Point (NxFP): NanoMantissa, Adaptive Microexponents, and Code Recycling for Direct-Cast Compression of Large Language Models Dec 15, 2024 MMLU Quantization
— Unverified 0Progressive Compression with Universally Quantized Diffusion Models Dec 14, 2024 Image Compression Image Generation
— Unverified 0Adaptive Quantization Resolution and Power Control for Federated Learning over Cell-free Networks Dec 14, 2024 Federated Learning Quantization
— Unverified 0TinySubNets: An efficient and low capacity continual learning strategy Dec 14, 2024 Continual Learning Quantization
Code Code Available 0Enhancing Off-Grid One-Bit DOA Estimation with Learning-Based Sparse Bayesian Approach for Non-Uniform Sparse Array Dec 14, 2024 Computational Efficiency Quantization
— Unverified 0Memory-Efficient 4-bit Preconditioned Stochastic Optimization Dec 14, 2024 Quantization Stochastic Optimization
— Unverified 0Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Dec 13, 2024 Conditional Image Generation Image Generation
— Unverified 0MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization Dec 13, 2024 image-classification Image Classification
— Unverified 0TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation Dec 13, 2024 Domain Adaptation Quantization
— Unverified 0VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization Dec 13, 2024 Face Generation Motion Generation
— Unverified 0Panacea: Novel DNN Accelerator using Accuracy-Preserving Asymmetric Quantization and Energy-Saving Bit-Slice Sparsity Dec 13, 2024 Quantization
— Unverified 0On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration Dec 12, 2024 Image Registration Quantization
— Unverified 0DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations Dec 12, 2024 image-classification Image Classification
— Unverified 0Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices Dec 12, 2024 Knowledge Distillation Mamba
— Unverified 0CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs Dec 12, 2024 Quantization
— Unverified 0Breaking the Bias: Recalibrating the Attention of Industrial Anomaly Detection Dec 11, 2024 Anomaly Detection Computational Efficiency
— Unverified 0TurboAttention: Efficient Attention Approximation For High Throughputs LLMs Dec 11, 2024 Computational Efficiency Language Modeling
— Unverified 0Low-Rank Correction for Quantized LLMs Dec 10, 2024 Model Compression Quantization
— Unverified 0QuantFormer: Learning to Quantize for Neural Activity Forecasting in Mouse Visual Cortex Dec 10, 2024 Quantization
— Unverified 0Post-Training Non-Uniform Quantization for Convolutional Neural Networks Dec 10, 2024 image-classification Image Classification
— Unverified 0Machine learning-driven conservative-to-primitive conversion in hybrid piecewise polytropic and tabulated equations of state Dec 10, 2024 CPU GPU
— Unverified 0Compression for Better: A General and Stable Lossless Compression Framework Dec 9, 2024 Computational Efficiency Model Compression
— Unverified 0Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion Dec 9, 2024 Denoising Image Generation
— Unverified 0FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization Dec 9, 2024 Quantization
— Unverified 0Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks Dec 9, 2024 Federated Learning Quantization
— Unverified 0Fuzzy Norm-Explicit Product Quantization for Recommender Systems Dec 8, 2024 Quantization Recommendation Systems
— Unverified 0Vision Transformer-based Semantic Communications With Importance-Aware Quantization Dec 8, 2024 image-classification Image Classification
— Unverified 0SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization Dec 8, 2024 3DGS Attribute
— Unverified 0Taming Sensitive Weights : Noise Perturbation Fine-tuning for Robust LLM Quantization Dec 8, 2024 Quantization
— Unverified 0Error Feedback Approach for Quantization Noise Reduction of Distributed Graph Filters Dec 7, 2024 Quantization
— Unverified 0Sensor Selection and Distributed Quantization for Energy Efficiency in Massive MTC Dec 7, 2024 Quantization
— Unverified 0GAQAT: gradient-adaptive quantization-aware training for domain generalization Dec 7, 2024 Domain Generalization Quantization
— Unverified 0Efficient Distributed Training through Gradient Compression with Sparsification and Quantization Techniques Dec 7, 2024 Quantization
— Unverified 0Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search Dec 7, 2024 Model Compression Quantization
— Unverified 0ULMRec: User-centric Large Language Model for Sequential Recommendation Dec 7, 2024 Language Modeling Language Modelling
— Unverified 0SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization Dec 5, 2024 Clustering GPU
— Unverified 0Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task Dec 5, 2024 image-classification Image Classification
— Unverified 0Unifying KV Cache Compression for Large Language Models with LeanKV Dec 4, 2024 GPU Quantization
— Unverified 0FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness Dec 4, 2024 GPU Quantization
— Unverified 0Prompting Large Language Models for Clinical Temporal Relation Extraction Dec 4, 2024 Decoder Quantization
— Unverified 0Designing DNNs for a trade-off between robustness and processing performance in embedded devices Dec 4, 2024 Autonomous Driving Quantization
— Unverified 0Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective Dec 4, 2024 Autonomous Driving Quantization
Code Code Available 0Mixed-Precision Quantization: Make the Best Use of Bits Where They Matter Most Dec 4, 2024 Quantization
— Unverified 0CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models Dec 3, 2024 Language Modeling Language Modelling
— Unverified 03D representation in 512-Byte:Variational tokenizer is the key for autoregressive 3D generation Dec 3, 2024 3D Generation Image Generation
— Unverified 0CEGI: Measuring the trade-off between efficiency and carbon emissions for SLMs and VLMs Dec 3, 2024 Image Captioning Quantization
— Unverified 0Robust Precoding for Multi-User Visible Light Communications with Quantized Channel Information Dec 3, 2024 Quantization
— Unverified 0Scaling Image Tokenizers with Grouped Spherical Quantization Dec 3, 2024 Quantization
Code Code Available 0Lean classical-quantum hybrid neural network model for image classification Dec 3, 2024 Classification Decision Making
— Unverified 0