Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement Aug 6, 2024 Quantization
Code Code Available 0Synaptic Modulation using Interspike Intervals Increases Energy Efficiency of Spiking Neural Networks Aug 6, 2024 Quantization
— Unverified 0Self-Supervised Learning for Multi-Channel Neural Transducer Aug 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Aug 6, 2024 Model Compression Quantization
— Unverified 0Winning Amazon KDD Cup'24 Aug 5, 2024 Data Augmentation Multiple-choice
— Unverified 0HQOD: Harmonious Quantization for Object Detection Aug 5, 2024 Object object-detection
Code Code Available 0Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying Networks Aug 5, 2024 Quantization
— Unverified 0An approach to optimize inference of the DIART speaker diarization pipeline Aug 5, 2024 Inference Optimization Knowledge Distillation
— Unverified 0STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs Aug 3, 2024 Binarization Computational Efficiency
— Unverified 0HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction Aug 2, 2024 Click-Through Rate Prediction Mixture-of-Experts
— Unverified 0UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation Aug 1, 2024 Language Modeling Language Modelling
— Unverified 0Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization Aug 1, 2024 Quantization
— Unverified 0CDFGNN: a Systematic Design of Cache-based Distributed Full-Batch Graph Neural Network Training with Communication Reduction Aug 1, 2024 Graph Neural Network Quantization
— Unverified 0Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study Jul 31, 2024 Computational Efficiency Quantization
— Unverified 0A Simple Low-bit Quantization Framework for Video Snapshot Compressive Imaging Jul 31, 2024 Quantization Video Reconstruction
Code Code Available 0On the Perturbed States for Transformed Input-robust Reinforcement Learning Jul 31, 2024 Denoising MuJoCo
Code Code Available 0Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval Jul 31, 2024 Quantization Recommendation Systems
— Unverified 0Abstractive summarization from Audio Transcription Jul 30, 2024 Abstractive Text Summarization Quantization
— Unverified 0Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection Jul 30, 2024 object-detection Object Detection
Code Code Available 3ThinK: Thinner Key Cache by Query-Driven Pruning Jul 30, 2024 GPU Quantization
— Unverified 0Palu: Compressing KV-Cache with Low-Rank Projection Jul 30, 2024 GPU Quantization
Code Code Available 2Pruning Large Language Models with Semi-Structural Adaptive Sparse Training Jul 30, 2024 GPU Knowledge Distillation
Code Code Available 1MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity Jul 29, 2024 Data Free Quantization Quantization
— Unverified 0Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference Jul 29, 2024 Quantization
— Unverified 0Temporal Feature Matters: A Framework for Diffusion Model Quantization Jul 28, 2024 Denoising Image Generation
Code Code Available 2Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain Jul 28, 2024 Autonomous Driving Deep Reinforcement Learning
— Unverified 0The Interpretability of Codebooks in Model-Based Reinforcement Learning is Limited Jul 28, 2024 Deep Reinforcement Learning Disentanglement
— Unverified 0Mixed Non-linear Quantization for Vision Transformers Jul 26, 2024 Quantization
Code Code Available 0Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers Jul 25, 2024 Quantization
— Unverified 0Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Jul 25, 2024 Generalization Bounds Quantization
— Unverified 0Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance Jul 24, 2024 Quantization
Code Code Available 0Low dimensional representation of multi-patient flow cytometry datasets using optimal transport for minimal residual disease detection in leukemia Jul 24, 2024 Dimensionality Reduction Prognosis
Code Code Available 0Pixel Embedding: Fully Quantized Convolutional Neural Network with Differentiable Lookup Table Jul 23, 2024 Quantization
— Unverified 0Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Jul 22, 2024 Deep Learning image-classification
— Unverified 0Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners Jul 22, 2024 Lightweight Deployment Quantization
— Unverified 0Differentiable Product Quantization for Memory Efficient Camera Relocalization Jul 22, 2024 Camera Relocalization Quantization
Code Code Available 0Uplink Transmit Power Optimization for Distributed Massive MIMO Systems with 1-Bit ADCs Jul 22, 2024 Quantization
— Unverified 0Power Measurement Enabled Channel Autocorrelation Matrix Estimation for IRS-Assisted Wireless Communication Jul 20, 2024 Quantization
— Unverified 0MetaAug: Meta-Data Augmentation for Post-Training Quantization Jul 20, 2024 Data Augmentation Meta-Learning
Code Code Available 0FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models Jul 20, 2024 Quantization
— Unverified 0A Benchmark for Gaussian Splatting Compression and Quality Assessment Study Jul 19, 2024 Attribute Data Compression
Code Code Available 1Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD Operations Jul 19, 2024 CPU Quantization
Code Code Available 1Mixture of Experts with Mixture of Precisions for Tuning Quality of Service Jul 19, 2024 CPU GPU
— Unverified 0Asymptotically Optimal Closed-Form Phase Configuration of 1-bit RISs via Sign Alignment Jul 18, 2024 Form Quantization
— Unverified 0LiNR: Model Based Neural Retrieval on GPUs at LinkedIn Jul 18, 2024 Attribute GPU
— Unverified 0MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs Jul 17, 2024 Neural Architecture Search Quantization
— Unverified 0SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization Jul 17, 2024 GPU Quantization
— Unverified 0AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Jul 17, 2024 Instance Segmentation object-detection
Code Code Available 1Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale Jul 17, 2024 GPU LAMBADA
Code Code Available 2Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients Jul 17, 2024 image-classification Image Classification
— Unverified 0