BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Jun 6, 2024 Image Generation model
— Unverified 0FineQ: Software-Hardware Co-Design for Low-Bit Fine-Grained Mixed-Precision Quantization of LLMs Apr 28, 2025 Quantization
— Unverified 0DSConv: Efficient Convolution Operator Oct 1, 2019 Quantization
— Unverified 0Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation Feb 10, 2025 Blood pressure estimation EEG
— Unverified 0A New Old Idea: Beam-Steering Reflectarrays for Efficient Sub-THz Multiuser MIMO Nov 30, 2023 3D geometry Quantization
— Unverified 0FinGPT-HPC: Efficient Pretraining and Finetuning Large Language Models for Financial Applications with High-Performance Computing Feb 21, 2024 GPU Model Compression
— Unverified 0Adaptation of MobileNetV2 for Face Detection on Ultra-Low Power Platform Aug 23, 2022 Face Detection Quantization
— Unverified 0RATQ: A Universal Fixed-Length Quantizer for Stochastic Optimization Aug 22, 2019 Quantization Stochastic Optimization
— Unverified 0Generative Diffusion Models for Lattice Field Theory Nov 6, 2023 Quantization
— Unverified 0FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation Dec 16, 2024 GPU Information Retrieval
— Unverified 0Compressing Language Models for Specialized Domains Feb 25, 2025 Quantization
— Unverified 0Fisher-aware Quantization for DETR Detectors with Critical-category Objectives Jul 3, 2024 object-detection Object Detection
— Unverified 0Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Feb 23, 2025 3DGS 3D Semantic Segmentation
— Unverified 0FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism Feb 24, 2021 CPU Deep Reinforcement Learning
— Unverified 0DQSGD: DYNAMIC QUANTIZED STOCHASTIC GRADIENT DESCENT FOR COMMUNICATION-EFFICIENT DISTRIBUTED LEARNING Jan 1, 2021 Quantization
— Unverified 0Fixed-point optimization of deep neural networks with adaptive step size retraining Feb 27, 2017 Quantization
— Unverified 0BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization Feb 8, 2020 Quantization
— Unverified 0Fixed-point quantization aware training for on-device keyword-spotting Mar 4, 2023 Keyword Spotting Quantization
— Unverified 0DQ-SGD: Dynamic Quantization in SGD for Communication-Efficient Distributed Learning Jul 30, 2021 Quantization
— Unverified 0Fixed Point Quantization of Deep Convolutional Networks Nov 19, 2015 Quantization
— Unverified 0Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference Feb 19, 2023 Classification Quantization
— Unverified 0FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration Nov 22, 2024 Quantization
— Unverified 0A New Learning Method for Inference Accuracy, Core Occupation, and Performance Co-optimization on TrueNorth Chip Apr 3, 2016 General Classification Quantization
— Unverified 0FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness Dec 4, 2024 GPU Quantization
— Unverified 0Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification Jun 1, 2019 General Classification Generalized Zero-Shot Learning
— Unverified 0FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep Neural Networks Dec 29, 2022 Model Compression Quantization
— Unverified 0DQ-Data2vec: Decoupling Quantization for Multilingual Speech Recognition Jan 23, 2025 Quantization Representation Learning
— Unverified 0Flattened one-bit stochastic gradient descent: compressed distributed optimization with controlled variance May 17, 2024 Distributed Optimization Quantization
— Unverified 0FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization Feb 28, 2024 GPU Quantization
— Unverified 0ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization Nov 12, 2024 Language Modeling Language Modelling
— Unverified 0BitNet b1.58 Reloaded: State-of-the-art Performance Also on Smaller Networks Jun 24, 2024 Quantization
— Unverified 0DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations Dec 12, 2024 image-classification Image Classification
— Unverified 0Flexible Unsupervised Learning for Massive MIMO Subarray Hybrid Beamforming Aug 10, 2022 Quantization
— Unverified 0FleXOR: Trainable Fractional Quantization Sep 9, 2020 Quantization
— Unverified 0FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices Jan 13, 2025 Quantization
— Unverified 0A Short Note on Analyzing Sequence Complexity in Trajectory Prediction Benchmarks Mar 27, 2020 Quantization Trajectory Prediction
— Unverified 0FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs Jan 8, 2024 Computational Efficiency GPU
— Unverified 0FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference Apr 5, 2019 Quantization
— Unverified 0A new heuristic algorithm for fast k-segmentation Sep 2, 2020 Quantization Segmentation
— Unverified 0Compression for Better: A General and Stable Lossless Compression Framework Dec 9, 2024 Computational Efficiency Model Compression
— Unverified 0Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training May 2, 2019 Event Detection Quantization
— Unverified 0FlowPrecision: Advancing FPGA-Based Real-Time Fluid Flow Estimation with Linear Quantization Mar 4, 2024 Quantization
— Unverified 0Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test Jun 8, 2025 Quantization
— Unverified 0FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization Mar 11, 2024 Face Generation Quantization
— Unverified 0On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates Jun 13, 2021 Benchmarking Federated Learning
— Unverified 0FoldToken2: Learning compact, invariant and generative protein structure language Jun 11, 2024 Decoder Quantization
— Unverified 0FoldToken: Learning Protein Language via Vector Quantization and Beyond Feb 4, 2024 Quantization
— Unverified 0Foothill: A Quasiconvex Regularization for Edge Computing of Deep Neural Networks Jan 18, 2019 Edge-computing General Classification
— Unverified 0Forearm Ultrasound based Gesture Recognition on Edge Sep 16, 2024 Gesture Recognition Hand Gesture Recognition
— Unverified 0DP-Net: Dynamic Programming Guided Deep Neural Network Compression Mar 21, 2020 Clustering Neural Network Compression
— Unverified 0