SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 34013425 of 4925 papers

TitleStatusHype
Model Selection CNN-based VVC QualityEnhancement0
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization0
A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC0
One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment0
Training Quantized Neural Networks to Global Optimality via Semidefinite Programming0
Binarized Aggregated Network with Quantization: Flexible Deep Learning Deployment for CSI Feedback in Massive MIMO SystemCode1
On the Adversarial Robustness of Quantized Neural Networks0
Stealthy Backdoors as Compression ArtifactsCode0
Memory-Efficient Deep Learning Inference in Trusted Execution Environments0
Hessian Aware Quantization of Spiking Neural NetworksCode0
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed TrainingCode1
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization0
HAO: Hardware-aware neural Architecture Optimization for Efficient Inference0
Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks0
Quantization of Deep Neural Networks for Accurate Edge Computing0
Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of Quantization on Depthwise Separable Convolutional Networks Through the Eyes of Multi-scale Distributional Dynamics0
Differentiable Model Compression via Pseudo Quantization NoiseCode1
FPGA Implementations of Layered MinSum LDPC Decoders Using RCQ Message Passing0
Conditional Coding and Variable Bitrate for Practical Learned Video CodingCode1
Filtering Empty Camera Trap Images in Embedded SystemsCode0
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN AcceleratorsCode0
Matching-oriented Product Quantization For Ad-hoc RetrievalCode1
Homomorphic Encryption-Enabled Distance-Based Distributed Formation Control with Distance Mismatch Estimators0
Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization0
Show:102550
← PrevPage 137 of 197Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified