SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 35513600 of 4925 papers

TitleStatusHype
3U-EdgeAI: Ultra-Low Memory Training, Ultra-Low BitwidthQuantization, and Ultra-Low Latency Acceleration0
Estimation and Quantization of Expected Persistence Diagrams0
In-Hindsight Quantization Range Estimation for Quantized Training0
RBNN: Memory-Efficient Reconfigurable Deep Binary Neural Network with IP Protection for Internet of Things0
Model Selection CNN-based VVC QualityEnhancement0
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization0
Training Quantized Neural Networks to Global Optimality via Semidefinite Programming0
A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC0
One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment0
On the Adversarial Robustness of Quantized Neural Networks0
Stealthy Backdoors as Compression ArtifactsCode0
Memory-Efficient Deep Learning Inference in Trusted Execution Environments0
Hessian Aware Quantization of Spiking Neural NetworksCode0
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization0
HAO: Hardware-aware neural Architecture Optimization for Efficient Inference0
Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks0
Quantization of Deep Neural Networks for Accurate Edge Computing0
Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of Quantization on Depthwise Separable Convolutional Networks Through the Eyes of Multi-scale Distributional Dynamics0
FPGA Implementations of Layered MinSum LDPC Decoders Using RCQ Message Passing0
Filtering Empty Camera Trap Images in Embedded SystemsCode0
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN AcceleratorsCode0
Homomorphic Encryption-Enabled Distance-Based Distributed Formation Control with Distance Mismatch Estimators0
Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization0
NoiseVC: Towards High Quality Zero-Shot Voice Conversion0
Soft then Hard: Rethinking the Quantization in Neural Image Compression0
A Novel Unified Model for Multi-exposure Stereo Coding Based on Low Rank Tucker-ALS and 3D-HEVC0
Q-matrix Unaware Double JPEG Detection using DCT-Domain Deep BiLSTM Network0
Quantized State Feedback Stabilization of Nonlinear Systems under Denial-of-Service0
Functional quantization of rough volatility and applications to volatility derivatives0
Learned transform compression with optimized entropy encodingCode0
Towards On-Device Face Recognition in Body-worn Cameras0
TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT0
Binary Neural Network for Speaker Verification0
Unconstrained Face Recognition using ASURF and Cloud-Forest Classifier optimized with VLAD0
Arabic Compact Language Modelling for Resource Limited Devices0
Bit-Mixer: Mixed-precision networks with runtime bit-width selection0
1-Bit Compressive Sensing for Efficient Federated Learning Over the Air0
Zero-shot Adversarial Quantization0
Automated Backend-Aware Post-Training Quantization0
Scalable and Efficient Neural Speech Coding: A Hybrid Design0
Hierarchical Federated Learning with Quantization: Convergence Analysis and System Design0
A Survey of Quantization Methods for Efficient Neural Network Inference0
DNN Quantization with Attention0
RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation0
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures0
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval0
n-hot: Efficient bit-level sparsity for powers-of-two neural network quantization0
Evaluating Post-Training Compression in GANs using Locality-Sensitive Hashing0
Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol0
Show:102550
← PrevPage 72 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified