SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 42514300 of 4925 papers

TitleStatusHype
A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-OffCode0
SHE: A Fast and Accurate Deep Neural Network for Encrypted DataCode0
SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization0
Fully Quantized Network for Object Detection0
Learning Channel-Wise Interactions for Binary Convolutional Neural Networks0
Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification0
Deep Metric Learning to RankCode0
Enhanced Bayesian Compression via Deep Reinforcement Learning0
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques0
Deep Learning for Distributed Optimization: Applications to Wireless Resource Management0
DeepShift: Towards Multiplication-Less Neural NetworksCode0
Quantization Loss Re-Learning Method0
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On MicrocontrollersCode0
Mixed Precision Training With 8-bit Floating Point0
Instant Quantization of Neural Networks using Monte Carlo Methods0
A Reconfigurable Dual-Mode Tracking SAR ADC without Analog Subtraction0
Texture CNN for Thermoelectric Metal Pipe Image Classification0
Brain-inspired reverse adversarial examples0
Mixed Precision DNNs: All you need is a good parametrizationCode1
Learning In Practice: Reasoning About Quantization0
A Unified Theory of SGD: Variance Reduction, Sampling, Quantization and Coordinate Descent0
Quantization-Based Regularization for AutoencodersCode0
Natural Compression for Distributed Deep Learning0
Communication-Efficient Distributed Blockwise Momentum SGD with Error-FeedbackCode0
HadaNets: Flexible Quantization Strategies for Neural Networks0
Additive Noise Annealing and Approximation Properties of Quantized Neural NetworksCode0
Structured Compression by Weight Encryption for Unstructured Pruning and Quantization0
One-bit LFMCW Radar: Spectrum Analysis and Target Detection0
NTP : A Neural Network Topology Profiler0
Derived Codebooks for High-Accuracy Nearest Neighbor Search0
DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression0
Optimizing MRF-ASL Scan Design for Precise Quantification of Brain Hemodynamics using Neural Network Regression0
Learning to Groove with Inverse Sequence TransformationsCode1
MobiVSR: A Visual Speech Recognition Solution for Mobile Devices0
Compressing Weight-updates for Image Artifacts Removal Neural Networks0
Two-layer Near-lossless HDR Coding with Backward Compatibility to JPEG0
Reconstruction of Privacy-Sensitive Data from Protected Templates0
2-bit Model Compression of Deep Convolutional Neural Network on ASIC Engine for Image Retrieval0
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training0
Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval0
Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training0
Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM0
Double Viterbi: Weight Encoding for High Compression Ratio and Fast On-Chip Reconstruction for Deep Neural Network0
Nested Dithered Quantization for Communication Reduction in Distributed Training0
Dimension-Free Bounds for Low-Precision Training0
Analysis of Quantized Models0
ACIQ: Analytical Clipping for Integer Quantization of neural networksCode0
N-Ary Quantization for CNN Model Compression and Inference Acceleration0
Adaptive Sample-space & Adaptive Probability coding: a neural-network based approach for compression0
Unsupervised automatic classification of Scanning Electron Microscopy (SEM) images of CD4+ cells with varying extent of HIV virion infection0
Show:102550
← PrevPage 86 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified