What Does a One-Bit Quanta Image Sensor Offer? Aug 19, 2022 Quantization
— Unverified 00 What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Apr 6, 2024 Knowledge Distillation Language Modeling
— Unverified 00 An Evaluation of Memory Optimization Methods for Training Neural Networks Mar 26, 2023 Quantization
— Unverified 00 What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation Mar 11, 2024 Computational Efficiency Quantization
— Unverified 00 When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization Nov 8, 2024 Decoder Quantization
— Unverified 00 When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate, & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks Dec 12, 2023 Quantization
— Unverified 00 When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models Feb 21, 2025 Model Compression Quantization
— Unverified 00 When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks Apr 2, 2025 Benchmarking Language Modeling
— Unverified 00 Where Should We Begin? A Low-Level Exploration of Weight Initialization Impact on Quantized Behaviour of Deep Neural Networks Nov 30, 2020 Quantization
— Unverified 00 Which Space Partitioning Tree to Use for Search? Dec 1, 2013 Quantization
— Unverified 00 DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition May 18, 2023 Knowledge Distillation Quantization
— Unverified 00 Why Quantization Improves Generalization: NTK of Binary Weight Neural Networks Jun 13, 2022 Quantization
— Unverified 00 Wide Flat Minimum Watermarking for Robust Ownership Verification of GANs Oct 25, 2023 Quantization
— Unverified 00 Widening and Squeezing: Towards Accurate and Efficient QNNs Feb 3, 2020 Quantization
— Unverified 00 Winning Amazon KDD Cup'24 Aug 5, 2024 Data Augmentation Multiple-choice
— Unverified 00 Wireless End-to-End Image Transmission System using Semantic Communications Feb 27, 2023 Decoder Quantization
— Unverified 00 Wireless Quantized Federated Learning: A Joint Computation and Communication Design Mar 11, 2022 Federated Learning Quantization
— Unverified 00 Within-basket Recommendation via Neural Pattern Associator Jan 25, 2024 Quantization
— Unverified 00 Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence Mar 28, 2024 Neural Rendering Quantization
— Unverified 00 Witten-type topological field theory of self-organized criticality for stochastic neural networks Jun 21, 2021 Quantization
— Unverified 00 WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More Feb 19, 2024 Quantization Text Generation
— Unverified 00 Word-based Domain Adaptation for Neural Machine Translation Jun 7, 2019 Domain Adaptation Language Modeling
— Unverified 00 Work in Progress: Linear Transformers for TinyML Mar 25, 2024 Keyword Spotting Keyword Spotting on Google Speech Commands
— Unverified 00 WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic Jul 26, 2020 Quantization
— Unverified 00 WrapNet: Neural Net Inference with Ultra-Low-Precision Arithmetic Jan 1, 2021 Quantization
— Unverified 00 WRPN: Training and Inference using Wide Reduced-Precision Networks Apr 10, 2017 Quantization
— Unverified 00 WSMN: An optimized multipurpose blind watermarking in Shearlet domain using MLP and NSGA-II May 7, 2020 Quantization SSIM
— Unverified 00 WSNet: Compact and Efficient Networks Through Weight Sampling Nov 28, 2017 Audio Classification General Classification
— Unverified 00 WSNet: Learning Compact and Efficient Networks with Weight Sampling Jan 1, 2018 Audio Classification General Classification
— Unverified 00 Wyner-Ziv Gradient Compression for Federated Learning Nov 16, 2021 Federated Learning Quantization
— Unverified 00 XCAT -- Lightweight Quantized Single Image Super-Resolution using Heterogeneous Group Convolutions and Cross Concatenation Aug 31, 2022 Data Augmentation GPU
— Unverified 00 XNORBIN: A 95 TOp/s/W Hardware Accelerator for Binary Convolutional Neural Networks Mar 5, 2018 Quantization
— Unverified 00 XNOR-Net++: Improved Binary Neural Networks Sep 30, 2019 Binarization Classification with Binary Neural Network
— Unverified 00 YONO: Modeling Multiple Heterogeneous Neural Networks on Microcontrollers Mar 8, 2022 Multi-Task Learning Quantization
— Unverified 00 You Never Know: Quantization Induces Inconsistent Biases in Vision-Language Foundation Models Oct 26, 2024 Quantization
— Unverified 00 YUVMultiNet: Real-time YUV multi-task CNN for autonomous driving Apr 11, 2019 Autonomous Driving Quantization
— Unverified 00 Consistent Signal Reconstruction from Streaming Multivariate Time Series Aug 23, 2023 Quantization Time Series
— Unverified 00 Zero-Delay Gaussian Joint Source-Channel Coding for the Interference Channel Jan 24, 2018 Quantization
— Unverified 00 FDC: Fast KV Dimensionality Compression for Efficient LLM Inference Aug 7, 2024 Quantization
— Unverified 00 ZeRO++: Extremely Efficient Collective Communication for Giant Model Training Jun 16, 2023 GPU Quantization
— Unverified 00 ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats Jul 19, 2023 Computational Efficiency Quantization
— Unverified 00 ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers Oct 26, 2023 Quantization
— Unverified 00 Zero-shot Adversarial Quantization Mar 29, 2021 Data Free Quantization Quantization
— Unverified 00 Zero-Shot Learning of a Conditional Generative Adversarial Network for Data-Free Network Quantization Oct 26, 2022 Data Free Quantization Generative Adversarial Network
— Unverified 00 Zero-shot Quantization: A Comprehensive Survey May 14, 2025 Quantization Survey
— Unverified 00 Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models Oct 20, 2023 Language Modeling Language Modelling
— Unverified 00 Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Jun 5, 2024 GPU Quantization
— Unverified 00 ZipML: Training Linear Models with End-to-End Low Precision, and a Little Bit of Deep Learning Aug 1, 2017 Quantization
— Unverified 00 ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification Oct 11, 2024 MME Quantization
— Unverified 00 ZOBNN: Zero-Overhead Dependable Design of Binary Neural Networks with Deliberately Quantized Parameters Jul 6, 2024 Attribute Quantization
— Unverified 00