SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 48514900 of 4925 papers

TitleStatusHype
Hybrid Scene Compression for Visual Localization0
Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression0
Hybrid Weight Representation: A Quantization Method Represented with Ternary and Sparse-Large Weights0
Hyperbolic Residual Quantization: Discrete Representations for Data with Latent Hierarchies0
HyperDepth: Learning Depth From Structured Light Without Matching0
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine0
Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition0
Hyperspectral recovery from RGB images using Gaussian Processes0
Hyperspherical Loss-Aware Ternary Quantization0
Hyperspherical Quantization: Toward Smaller and More Accurate Models0
Hyperstroke: A Novel High-quality Stroke Representation for Assistive Artistic Drawing0
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space0
ICQ: A Quantization Scheme for Best-Arm Identification Over Bit-Constrained Channels0
IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression0
IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means0
IFQ-Net: Integrated Fixed-point Quantization Networks for Embedded Vision0
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models0
Illumination and Temperature-Aware Multispectral Networks for Edge-Computing-Enabled Pedestrian Detection0
ILMPQ : An Intra-Layer Multi-Precision Deep Neural Network Quantization framework for FPGA0
Image Compression Based on Compressive Sensing: End-to-End Comparison with JPEG0
Image Compression using only Attention based Neural Networks0
Image Compression with Product Quantized Masked Image Modeling0
Image De-Quantization Using Generative Models as Priors0
Image processing in DNA0
Image Shadow Removal Using End-to-End Deep Convolutional Neural Networks0
Image Splicing Detection, Localization and Attribution via JPEG Primary Quantization Matrix Estimation and Clustering0
Image Storage on Synthetic DNA Using Autoencoders0
Image storage on synthetic DNA using compressive autoencoders and DNA-adapted entropy coders0
IM-Loss: Information Maximization Loss for Spiking Neural Networks0
Impact of Low-bitwidth Quantization on the Adversarial Robustness for Embedded Neural Networks0
Impact of ML Optimization Tactics on Greener Pre-Trained ML Models0
Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform0
Implementation of a framework for deploying AI inference engines in FPGAs0
Implicit Dual-domain Convolutional Network for Robust Color Image Compression Artifact Reduction0
Implicit Neural Representations for Image Compression0
Improved Convergence Rate for a Distributed Two-Time-Scale Gradient Method under Random Quantization0
Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning0
Improved Residual Vector Quantization for High-dimensional Approximate Nearest Neighbor Search0
Improved training of binary networks for human pose estimation and image recognition0
Improving Acoustic Scene Classification in Low-Resource Conditions0
Improving Adversarial Robustness in Weight-quantized Neural Networks0
Improving Approximate Optimal Transport Distances using Quantization0
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction0
Improving Bilayer Product Quantization for Billion-Scale Approximate Nearest Neighbors in High Dimensions0
Improving Block-Wise LLM Quantization by 4-bit Block-Wise Optimal Float (BOF4): Analysis and Variations0
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment0
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification0
Improving Low-Precision Network Quantization via Bin Regularization0
Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network0
Improving Multi-generation Robustness of Learned Image Compression0
Show:102550
← PrevPage 98 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified