SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 14511475 of 4925 papers

TitleStatusHype
EQ-Net: A Unified Deep Learning Framework for Log-Likelihood Ratio Estimation and Quantization0
Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval0
Error Analysis of CORDIC Processor with FPGA Implementation0
Efficient Distributed Training through Gradient Compression with Sparsification and Quantization Techniques0
Estimation and Quantization of Expected Persistence Diagrams0
Event-Triggered Quantized Average Consensus via Mass Summation0
BTEL: A Binary Tree Encoding Approach for Visual Localization0
Efficient Error-Tolerant Quantized Neural Network Accelerators0
Efficient Evaluation of Quantization-Effects in Neural Codecs0
Efficient Execution of Quantized Deep Learning Models: A Compiler Approach0
Bullion: A Column Store for Machine Learning0
Efficient Federated Learning over Multiple Access Channel with Differential Privacy Constraints0
Efficient Fine-Tuning of Quantized Models via Adaptive Rank and Bitwidth0
Efficient FPGA-accelerated Convolutional Neural Networks for Cloud Detection on CubeSats0
Efficient Generative Modeling with Residual Vector Quantization-Based Tokens0
Efficient Hardware Implementation of Incremental Learning and Inference on Chip0
Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization0
Efficient Hybrid Network Architectures for Extremely Quantized Neural Networks Enabling Intelligence at the Edge0
Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks0
Efficient Inference via Universal LSH Kernel0
Boost CTR Prediction for New Advertisements via Modeling Visual Content0
EdgeFusion: On-Device Text-to-Image Generation0
Edge-Enabled Real-time Railway Track Segmentation0
Efficient Large-Scale Approximate Nearest Neighbor Search on OpenCL FPGA0
BOMP-NAS: Bayesian Optimization Mixed Precision NAS0
Show:102550
← PrevPage 59 of 197Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified