SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 44014450 of 4925 papers

TitleStatusHype
EfQAT: An Efficient Framework for Quantization-Aware Training0
Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision0
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks0
ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting0
Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals0
Embedding Compression for Efficient Re-Identification0
Embedding Compression with Isotropic Iterative Quantization0
Emergent Quantized Communication0
Emotion Recognition Using Speaker Cues0
Empirical Evaluation of Post-Training Quantization Methods for Language Tasks0
Emulation Learning for Neuromimetic Systems0
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications0
Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization0
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment0
Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning0
Enabling On-device Continual Learning with Binary Neural Networks0
Enabling On-Device Medical AI Assistants via Input-Driven Saliency Adaptation0
Enabling Fast Deep Learning on Tiny Energy-Harvesting IoT Devices0
Encoder-Quantization-Motion-based Video Quality Metrics0
End-to-End Autoencoder Communications with Optimized Interference Suppression0
End-to-end Binary Representation Learning via Direct Binary Embedding0
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs0
End-to-End Efficient Representation Learning via Cascading Combinatorial Optimization0
End-to-end fully-binarized network design: from Generic Learned Thermometer to Block Pruning0
End-to-end Keyword Spotting using Neural Architecture Search and Quantization0
End-to-End Latent Fingerprint Search0
End-to-End Learned Image Compression with Quantized Weights and Activations0
End-to-end Learning of Compressible Features0
End-to-End Neural Network Compression via _1_2 Regularized Latency Surrogates0
End-to-end optimization of nonlinear transform codes for perceptual quality0
End-to-End Optimized Speech Coding with Deep Neural Networks0
End-to-end Quantized Training via Log-Barrier Extensions0
End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression0
End-to-end workflow for machine learning-based qubit readout with QICK and hls4ml0
Energy-Aware LLMs: A step towards sustainable AI for downstream applications0
Energy awareness in low precision neural networks0
Energy Efficiency Maximization Precoding for Quantized Massive MIMO Systems0
Energy Efficiency Optimization for Millimeter Wave System with Resolution-Adaptive ADCs0
Energy-Efficient Distributed Learning Algorithms for Coarsely Quantized Signals0
Energy-efficient Knowledge Distillation for Spiking Neural Networks0
Energy Efficient Learning with Low Resolution Stochastic Domain Wall Synapse Based Deep Neural Networks0
Energy-Efficient Respiratory Anomaly Detection in Premature Newborn Infants0
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification0
Engineering the Neural Automatic Passenger Counter0
English K_Quantization of LLMs Does Not Disproportionately Diminish Multilingual Performance0
Enhanced Bayesian Compression via Deep Reinforcement Learning0
Enhanced Blind Calibration of Uniform Linear Arrays with One-Bit Quantization by Kullback-Leibler Divergence Covariance Fitting0
Enhance Feature Discrimination for Unsupervised Hashing0
Enhancement Of Coded Speech Using a Mask-Based Post-Filter0
Enhancing Bridge Deck Delamination Detection Based on Aerial Thermography Through Grayscale Morphologic Reconstruction: A Case Study0
Show:102550
← PrevPage 89 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified