SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 26012650 of 4925 papers

TitleStatusHype
Regularization-based Framework for Quantization-, Fault- and Variability-Aware Training0
Regularization in Relevance Learning Vector Quantization Using l one Norms0
Regularized Residual Quantization: a multi-layer sparse dictionary learning approach0
Regularized Vector Quantization for Tokenized Image Synthesis0
Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics0
Reinforcement Learning for Finite Space Mean-Field Type Games0
Relationship between Model Compression and Adversarial Robustness: A Review of Current Evidence0
Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis0
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks0
Reliability-Aware Quantization for Anti-Aging NPUs0
Reliability of PET/CT shape and heterogeneity features in functional and morphological components of Non-Small Cell Lung Cancer tumors: a repeatability analysis in a prospective multi-center cohort0
Reliable edge machine learning hardware for scientific applications0
ReLU and Addition-based Gated RNN0
ReLU Neural Networks, Polyhedral Decompositions, and Persistent Homolog0
Remember and Recall: Associative-Memory-based Trajectory Prediction0
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures0
RepQuant: Towards Accurate Post-Training Quantization of Large Transformer Models via Scale Reparameterization0
Representation Collapsing Problems in Vector Quantization0
Representation Learning using Event-based STDP0
Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain0
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs0
Rescuing Deep Hashing from Dead Bits Problem0
ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals0
Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems0
Resiliency of Deep Neural Networks under Quantization0
Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol0
Resource Allocation and Dithering of Bayesian Parameter Estimation Using Mixed-Resolution Data0
Resource Allocation for Compression-aided Federated Learning with High Distortion Rate0
Resource-aware Mixed-precision Quantization for Enhancing Deployability of Transformers for Time-series Forecasting on Embedded FPGAs0
Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation0
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference0
Resource-Efficient Neural Networks for Embedded Systems0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Resource-Efficient Transformer Architecture: Optimizing Memory and Execution Time for Real-Time Applications0
ResQ: Residual Quantization for Video Perception0
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules0
Résumé abstractif à partir d'une transcription audio0
Rethinking Deconvolution for 2D Human Pose Estimation Light yet Accurate Model for Real-time Edge Computing0
Rethinking Diffusion for Text-Driven Human Motion Generation0
Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression0
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis0
Rethinking Few-Shot Medical Segmentation: A Vector Quantization View0
Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint0
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning0
Rethinking Neural Network Quantization0
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach0
Retraining-Based Iterative Weight Quantization for Deep Neural Networks0
Reverse Link Analysis for Full-Duplex Cellular Networks with Low Resolution ADC/DAC0
Reversible Quantization Index Modulation for Static Deep Neural Network Watermarking0
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study0
Show:102550
← PrevPage 53 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified