Privacy-Preserving UCB Decision Process Verification via zk-SNARKs Apr 18, 2024 Decision Making Privacy Preserving
— Unverified 0LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory Apr 17, 2024 Computational Efficiency Language Modeling
— Unverified 0Variational quantization for state space models Apr 17, 2024 Quantization State Space Models
Code Code Available 0QGen: On the Ability to Generalize in Quantization Aware Training Apr 17, 2024 Quantization
— Unverified 0Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems Apr 17, 2024 Benchmarking Quantization
— Unverified 0Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 0Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning Apr 16, 2024 Data Compression Decoder
Code Code Available 1SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks Apr 15, 2024 Quantization
Code Code Available 5Quantization of Large Language Models with an Overdetermined Basis Apr 15, 2024 Data Compression Quantization
— Unverified 0Efficient and accurate neural field reconstruction using resistive memory Apr 15, 2024 CPU Novel View Synthesis
— Unverified 0TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models Apr 15, 2024 Denoising Model Optimization
— Unverified 0SNN4Agents: A Framework for Developing Energy-Efficient Embodied Spiking Neural Networks for Autonomous Agents Apr 14, 2024 Quantization
Code Code Available 0Bullion: A Column Store for Machine Learning Apr 13, 2024 Quantization Recommendation Systems
— Unverified 0Lossy Image Compression with Foundation Diffusion Models Apr 12, 2024 Denoising Image Compression
— Unverified 0Full-Duplex Beyond Self-Interference: The Unlimited Sensing Way Apr 12, 2024 Quantization
— Unverified 0Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis Apr 11, 2024 Knowledge Distillation Model Optimization
— Unverified 01-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture Apr 11, 2024 All Lesion Detection
— Unverified 0Frame Quantization of Neural Networks Apr 11, 2024 Quantization
— Unverified 0Differentiable Search for Finding Optimal Quantization Strategy Apr 10, 2024 image-classification Image Classification
— Unverified 0CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers Apr 10, 2024 Quantization
Code Code Available 0Adapting LLaMA Decoder to Vision Transformer Apr 10, 2024 Computational Efficiency Decoder
Code Code Available 1End-to-End Rate-Distortion Optimized 3D Gaussian Representation Apr 9, 2024 3DGS Quantization
Code Code Available 1Encoder-Quantization-Motion-based Video Quality Metrics Apr 9, 2024 Quantization Video Compression
— Unverified 0AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval Apr 9, 2024 All Information Retrieval
Code Code Available 2Collaborative Edge AI Inference over Cloud-RAN Apr 9, 2024 Quantization
— Unverified 0Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network Accelerators Apr 8, 2024 Quantization Scheduling
Code Code Available 0Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging Apr 8, 2024 Language Modeling Language Modelling
Code Code Available 1BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models Apr 8, 2024 Binarization Quantization
Code Code Available 1Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Apr 8, 2024 Quantization
— Unverified 0Investigating the Impact of Quantization on Adversarial Robustness Apr 8, 2024 Adversarial Robustness Quantization
— Unverified 0David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge Apr 8, 2024 Edge-computing Quantization
Code Code Available 0Nanometer Scanning with Micrometer Sensing: Beating Quantization Constraints in Lissajous Trajectory Tracking Apr 7, 2024 Quantization
— Unverified 0Gull: A Generative Multifunctional Audio Codec Apr 7, 2024 Audio Compression Audio Source Separation
— Unverified 0Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Apr 7, 2024 Image Retrieval Quantization
Code Code Available 0What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Apr 6, 2024 Knowledge Distillation Language Modeling
— Unverified 0Fine-Tuning, Quantization, and LLMs: Navigating Unintended Outcomes Apr 5, 2024 Quantization
— Unverified 0Outlier-Efficient Hopfield Layers for Large Transformer-Based Models Apr 4, 2024 Benchmarking Quantization
Code Code Available 1Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Apr 4, 2024 GPU Language Modeling
Code Code Available 0TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices Apr 4, 2024 Quantization Question Answering
— Unverified 0AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution Apr 4, 2024 Image Super-Resolution Quantization
Code Code Available 2DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement Apr 4, 2024 Image Enhancement Low-Light Image Enhancement
— Unverified 0CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models Apr 3, 2024 Quantization
— Unverified 0Efficient Multi-Vector Dense Retrieval Using Bit Vectors Apr 3, 2024 Quantization Retrieval
Code Code Available 2DNN Memory Footprint Reduction via Post-Training Intra-Layer Multi-Precision Quantization Apr 3, 2024 Edge-computing Quantization
— Unverified 0PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models Apr 3, 2024 GSM8K Quantization
Code Code Available 3NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation Apr 2, 2024 Decoder Feature Compression
— Unverified 0On the Effect of Quantization on Dynamic Mode Decomposition Apr 2, 2024 Quantization
— Unverified 0RefQSR: Reference-based Quantization for Image Super-Resolution Networks Apr 2, 2024 Image Super-Resolution Quantization
— Unverified 0Minimize Quantization Output Error with Bias Compensation Apr 2, 2024 Quantization
Code Code Available 0