SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 35013525 of 4925 papers

TitleStatusHype
A White Paper on Neural Network Quantization0
Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier IntegralsCode0
Development of Quantized DNN Library for Exact Hardware Emulation0
Compositional Sketch SearchCode0
Energy-efficient Knowledge Distillation for Spiking Neural Networks0
FastICARL: Fast Incremental Classifier and Representation Learning with Efficient Budget Allocation in Audio Sensing Applications0
Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates0
AI Enlightens Wireless Communication: Analyses, Solutions and Opportunities on CSI Feedback0
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning0
Conditional COT-GAN for Video Prediction with Kernel SmoothingCode0
Verifying Quantized Neural Networks using SMT-Based Model Checking0
Cross-Modal Discrete Representation Learning0
SignalNet: A Low Resolution Sinusoid Decomposition and Estimation Network0
Fastening the Initial Access in 5G NR Sidelink for 6G V2X Networks0
Harnessing Unrecognizable Faces for Improving Face Recognition0
Theoretically Better and Numerically Faster Distributed Optimization with Smoothness-Aware Quantization Techniques0
Deep Unsupervised Learning for Joint Antenna Selection and Hybrid Beamforming0
Sigma-Delta and Distributed Noise-Shaping Quantization Methods for Random Fourier Features0
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution0
Granger Causality from Quantized Measurements0
Passive Beamforming Design for Intelligent Reflecting Surface Assisted MIMO Systems0
On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers0
Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation0
Full-Resolution Encoder-Decoder Networks with Multi-Scale Feature Fusion for Human Pose Estimation0
Show:102550
← PrevPage 141 of 197Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified