SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 37513800 of 4925 papers

TitleStatusHype
Automatic Gain Control Design for Dynamic Visible Light Communication Systems0
Automatic low-bit hybrid quantization of neural networks through meta learning0
Automatic mixed precision for optimizing gained time with constrained loss mean-squared-error based on model partition to sequential sub-graphs0
Automatic Mixed-Precision Quantization Search of BERT0
Automatic Network Adaptation for Ultra-Low Uniform-Precision Quantization0
Automatic Parameter Tying in Neural Networks0
Automatic Pruning for Quantized Neural Networks0
Automating Nearest Neighbor Search Configuration with Constrained Optimization0
AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning0
Automotive Radar Sensing with Sparse Linear Arrays Using One-Bit Hankel Matrix Completion0
AutoQ: Automated Kernel-Wise Neural Network Quantization0
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks0
Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications0
Auto-regressive Image Synthesis with Integrated Quantization0
Autoregressive Sign Language Production: A Gloss-Free Approach with Discrete Representations0
Autoregressive Speech Synthesis without Vector Quantization0
Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge0
Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization0
Avaliação do método dialético na quantização de imagens multiespectrais0
A Video Coding Method Based on Neural Network for CLIC20240
A Vision System for Multi-View Face Recognition0
A Wave is Worth 100 Words: Investigating Cross-Domain Transferability in Time Series0
AWEQ: Post-Training Quantization with Activation-Weight Equalization for Large Language Models0
A White Paper on Neural Network Quantization0
AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent0
Background Modelling using Octree Color Quantization0
Back to Simplicity: How to Train Accurate BNNs from Scratch?0
Bag of Tricks with Quantized Convolutional Neural Networks for image classification0
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks0
Balance of Number of Embedding and their Dimensions in Vector Quantization0
Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection0
BAMSProd: A Step towards Generalizing the Adaptive Optimization Methods to Deep Binary Model0
Bandlimited signal reconstruction from leaky integrate-and-fire encoding using POCS0
Bandwidth-efficient Inference for Neural Image Compression0
Bang for the Buck: Vector Search on Cloud CPUs0
BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs)0
BasisConv: A method for compressed representation and learning in CNNs0
Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates0
Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval0
b-bit Marginal Regression0
BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation0
BDD4BNN: A BDD-based Quantitative Analysis Framework for Binarized Neural Networks0
BdSLW401: Transformer-Based Word-Level Bangla Sign Language Recognition Using Relative Quantization Encoding (RQE)0
BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics0
BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning0
BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision0
Benchmarking CFAR and CNN-based Peak Detection Algorithms in ISAC under Hardware Impairments0
Benchmarking quantized LLaMa-based models on the Brazilian Secondary School Exam0
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance0
Benchmarking the Robustness of Quantized Models0
Show:102550
← PrevPage 76 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified