SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 26262650 of 4925 papers

TitleStatusHype
Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol0
Resource Allocation and Dithering of Bayesian Parameter Estimation Using Mixed-Resolution Data0
Resource Allocation for Compression-aided Federated Learning with High Distortion Rate0
Resource-aware Mixed-precision Quantization for Enhancing Deployability of Transformers for Time-series Forecasting on Embedded FPGAs0
Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation0
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference0
Resource-Efficient Neural Networks for Embedded Systems0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Resource-Efficient Transformer Architecture: Optimizing Memory and Execution Time for Real-Time Applications0
ResQ: Residual Quantization for Video Perception0
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules0
Résumé abstractif à partir d'une transcription audio0
Rethinking Deconvolution for 2D Human Pose Estimation Light yet Accurate Model for Real-time Edge Computing0
Rethinking Diffusion for Text-Driven Human Motion Generation0
Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression0
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis0
Rethinking Few-Shot Medical Segmentation: A Vector Quantization View0
Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint0
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning0
Rethinking Neural Network Quantization0
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach0
Retraining-Based Iterative Weight Quantization for Deep Neural Networks0
Reverse Link Analysis for Full-Duplex Cellular Networks with Low Resolution ADC/DAC0
Reversible Quantization Index Modulation for Static Deep Neural Network Watermarking0
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study0
Show:102550
← PrevPage 106 of 197Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified