SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 24512500 of 4925 papers

TitleStatusHype
Quantization in Relative Gradient Angle Domain For Building Polygon Estimation0
Quantization Loss Re-Learning Method0
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning0
Quantization Mimic: Towards Very Tiny CNN for Object Detection0
Quantization of Acoustic Model Parameters in Automatic Speech Recognition Framework0
Quantifying Climate Change Impacts on Renewable Energy Generation: A Super-Resolution Recurrent Diffusion Model0
Quantization of Deep Neural Networks for Accumulator-constrained Processors0
Quantization of Deep Neural Networks for Accurate Edge Computing0
Quantization of Deep Neural Networks to facilitate self-correction of weights on Phase Change Memory-based analog hardware0
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation0
Quantization of Generative Adversarial Networks for Efficient Inference: a Methodological Study0
Quantization of Large Language Models with an Overdetermined Basis0
Quantization optimized with respect to the Haar basis0
Quantization Robust Federated Learning for Efficient Inference on Heterogeneous Devices0
Quantization Under the Real-world Measure: Fast and Accurate Valuation of Long-dated Contracts0
Quantized Adam with Error Feedback0
Quantized Adaptive Subgradient Algorithms and Their Applications0
Quantized and Asynchronous Federated Learning0
Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task0
Quantized Approximately Orthogonal Recurrent Neural Networks0
Quantized Compressive K-Means0
Quantized Consensus under Data-Rate Constraints and DoS Attacks: A Zooming-In and Holding Approach0
Quantized Context Based LIF Neurons for Recurrent Spiking Neural Networks in 45nm0
Quantized control of non-Lipschitz nonlinear systems: a novel control framework with prescribed transient performance and lower design complexity0
Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations0
Quantized Decoder in Learned Image Compression for Deterministic Reconstruction0
Quantized Deep Path-following Control on a Microcontroller0
Quantized Delta Weight Is Safety Keeper0
Quantized Dissipative Uncertain Model for Fractional T_S Fuzzy systems with Time_Varying Delays Under Networked Control System0
Quantized distributed Nash equilibrium seeking under DoS attacks0
Quantized Distributed Training of Large Models with Convergence Guarantees0
Quantized Embedding Vectors for Controllable Diffusion Language Models0
Quantized Epoch-SGD for Communication-Efficient Distributed Learning0
Quantized Feature Distillation for Network Quantization0
Quantized Federated Learning under Transmission Delay and Outage Constraints0
Quantized Frank-Wolfe: Faster Optimization, Lower Communication, and Projection Free0
Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks0
A Hierarchical Federated Learning Approach for the Internet of Things0
Quantized Kernel Learning for Feature Matching0
Quantized Low-Rank Multivariate Regression with Random Dithering0
Quantized Memory-Augmented Neural Networks0
Quantized Minimum Error Entropy Criterion0
Quantized neural network design under weight capacity constraint0
Quantized neural network for complex hologram generation0
Quantized Neural Network Inference with Precision Batching0
Quantized Neural Networks: Characterization and Holistic Optimization0
Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow Avoidance0
Quantized Neural Networks for Radar Interference Mitigation0
Quantized Nonparametric Estimation over Sobolev Ellipsoids0
Quantized Precoding and RIS-Assisted Modulation for Integrated Sensing and Communications Systems0
Show:102550
← PrevPage 50 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified