Fisher-aware Quantization for DETR Detectors with Critical-category Objectives Jul 3, 2024 object-detection Object Detection
— Unverified 00 FIT: A Metric for Model Sensitivity Oct 16, 2022 model Model Compression
— Unverified 00 FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism Feb 24, 2021 CPU Deep Reinforcement Learning
— Unverified 00 Fixed-Point Back-Propagation Training Jun 1, 2020 CPU image-classification
— Unverified 00 Fixed-point optimization of deep neural networks with adaptive step size retraining Feb 27, 2017 Quantization
— Unverified 00 Fixed-Point Performance Analysis of Recurrent Neural Networks Dec 4, 2015 Language Modeling Language Modelling
— Unverified 00 Fixed-point quantization aware training for on-device keyword-spotting Mar 4, 2023 Keyword Spotting Quantization
— Unverified 00 Fixed Point Quantization of Deep Convolutional Networks Nov 19, 2015 Quantization
— Unverified 00 Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference Feb 19, 2023 Classification Quantization
— Unverified 00 FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration Nov 22, 2024 Quantization
— Unverified 00 FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness Dec 4, 2024 GPU Quantization
— Unverified 00 FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep Neural Networks Dec 29, 2022 Model Compression Quantization
— Unverified 00 Flattened one-bit stochastic gradient descent: compressed distributed optimization with controlled variance May 17, 2024 Distributed Optimization Quantization
— Unverified 00 FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization Feb 28, 2024 GPU Quantization
— Unverified 00 Flexible Neural Image Compression via Code Editing Sep 19, 2022 Decoder Image Compression
— Unverified 00 Flexible Unsupervised Learning for Massive MIMO Subarray Hybrid Beamforming Aug 10, 2022 Quantization
— Unverified 00 FleXOR: Trainable Fractional Quantization Sep 9, 2020 Quantization
— Unverified 00 FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices Jan 13, 2025 Quantization
— Unverified 00 FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs Jan 8, 2024 Computational Efficiency GPU
— Unverified 00 FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference Apr 5, 2019 Quantization
— Unverified 00 FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search Aug 7, 2023 Quantization
— Unverified 00 FlowPrecision: Advancing FPGA-Based Real-Time Fluid Flow Estimation with Linear Quantization Mar 4, 2024 Quantization
— Unverified 00 FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization Mar 11, 2024 Face Generation Quantization
— Unverified 00 FoldToken2: Learning compact, invariant and generative protein structure language Jun 11, 2024 Decoder Quantization
— Unverified 00 FoldToken: Learning Protein Language via Vector Quantization and Beyond Feb 4, 2024 Quantization
— Unverified 00 Foothill: A Quasiconvex Regularization for Edge Computing of Deep Neural Networks Jan 18, 2019 Edge-computing General Classification
— Unverified 00 Forearm Ultrasound based Gesture Recognition on Edge Sep 16, 2024 Gesture Recognition Hand Gesture Recognition
— Unverified 00 Formal Uncertainty Propagation for Stochastic Dynamical Systems with Additive Noise May 16, 2025 Quantization Stochastic Optimization
— Unverified 00 Forward Link Analysis for Full-Duplex Cellular Networks with Low Resolution ADC/DAC Mar 7, 2022 Quantization
— Unverified 00 Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own Oct 4, 2023 Quantization reinforcement-learning
— Unverified 00 FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks Sep 20, 2022 Data Visualization Image Reconstruction
— Unverified 00 FP8-BERT: Post-Training Quantization for Transformer Dec 10, 2023 Quantization
— Unverified 00 FP8 versus INT8 for efficient deep learning inference Mar 31, 2023 Deep Learning Quantization
— Unverified 00 FPGA Implementations of Layered MinSum LDPC Decoders Using RCQ Message Passing Apr 19, 2021 Decoder Quantization
— Unverified 00 FPGA Resource-aware Structured Pruning for Real-Time Neural Networks Aug 9, 2023 Classification image-classification
— Unverified 00 FPRaker: A Processing Element For Accelerating Neural Network Training Oct 15, 2020 Quantization
— Unverified 00 FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion Jun 5, 2025 Denoising Quantization
— Unverified 00 FPTQ: Fine-grained Post-Training Quantization for Large Language Models Aug 30, 2023 Quantization
— Unverified 00 FPTQuant: Function-Preserving Transforms for LLM Quantization Jun 5, 2025 Quantization
— Unverified 00 FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization Dec 9, 2024 Quantization
— Unverified 00 FQ-Conv: Fully Quantized Convolution for Efficient and Accurate Inference Dec 19, 2019 Quantization
— Unverified 00 Frame Quantization of Neural Networks Apr 11, 2024 Quantization
— Unverified 00 Free Bits: Latency Optimization of Mixed-Precision Quantized Neural Networks on the Edge Jul 6, 2023 Navigate Quantization
— Unverified 00 freePruner: A Training-free Approach for Large Multimodal Model Acceleration Nov 23, 2024 Quantization Question Answering
— Unverified 00 Frequency Autoregressive Image Generation with Continuous Tokens Mar 7, 2025 Image Generation Language Modeling
— Unverified 00 Frequency-Biased Synergistic Design for Image Compression and Compensation Jan 1, 2025 Image Compression Quantization
— Unverified 00 Frequency Disentangled Features in Neural Image Compression Aug 4, 2023 Disentanglement Image Compression
— Unverified 00 From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks May 9, 2024 Knowledge Distillation Model Compression
— Unverified 00 From Hard to Soft: Understanding Deep Network Nonlinearities via Vector Quantization and Statistical Inference Oct 22, 2018 Quantization
— Unverified 00 From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs Apr 18, 2025 Knowledge Distillation Model Compression
— Unverified 00