SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 43514400 of 4925 papers

TitleStatusHype
LSQ++: Lower running time and higher recall in multi-codebook quantizationCode0
Nearly Lossless Adaptive Bit SwitchingCode0
Scaling Image Tokenizers with Grouped Spherical QuantizationCode0
Additive Noise Annealing and Approximation Properties of Quantized Neural NetworksCode0
Discrete Cross-Modal Alignment Enables Zero-Shot Speech TranslationCode0
NestQuant: Post-Training Integer-Nesting Quantization for On-Device DNNCode0
LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling MatricesCode0
Who's a Good Boy? Reinforcing Canine Behavior in Real-Time using Machine LearningCode0
Unsupervised Speech Representation Pooling Using Vector QuantizationCode0
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation MetricsCode0
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural NetworksCode0
Discrete, compositional, and symbolic representations through attractor dynamicsCode0
NeUQI: Near-Optimal Uniform Quantization Parameter InitializationCode0
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy TheoryCode0
Neural Architecture Codesign for Fast Physics ApplicationsCode0
Quantization-Based Regularization for AutoencodersCode0
Digital and Hybrid Precoding Designs in Massive MIMO with Low-Resolution ADCsCode0
Quantization Effects on Neural Networks Perception: How would quantization change the perceptual field of vision models?Code0
Diffusion Models as Stochastic Quantization in Lattice Field TheoryCode0
Comprehensive SNN Compression Using ADMM Optimization and Activity RegularizationCode0
Comprehensive Comparisons of Uniform Quantization in Deep Image CompressionCode0
Extracting Usable Predictions from Quantized Networks through Uncertainty Quantification for OOD DetectionCode0
Neural Network Activation Quantization with Bitwise Information BottlenecksCode0
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural NetworksCode0
Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000Code0
Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network AcceleratorsCode0
Quantization for OpenAI's Whisper Models: A Comparative AnalysisCode0
Unveiling Environmental Impacts of Large Language Model Serving: A Functional Unit ViewCode0
Low-Precision Stochastic Gradient Langevin DynamicsCode0
Quantization-Free Autoregressive Action TransformerCode0
Exploring Post-Training Quantization of Protein Language ModelsCode0
Differentiable Product Quantization for End-to-End Embedding CompressionCode0
Quantization Guided JPEG Artifact CorrectionCode0
Wideband and Entropy-Aware Deep Soft Bit QuantizationCode0
Exploring Embedding Methods in Binary Hyperdimensional Computing: A Case Study for Motor-Imagery based Brain-Computer InterfacesCode0
A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-OffCode0
SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTACode0
Neural Networks with Quantization ConstraintsCode0
Quantization in Spiking Neural NetworksCode0
Comprehensive Analysis of the Object Detection Pipeline on UAVsCode0
Low-Precision Random Fourier Features for Memory-Constrained Kernel ApproximationCode0
Low Precision Decentralized Distributed Training over IID and non-IID DataCode0
An Integrated Approach to Produce Robust Models with High EfficiencyCode0
Bees Local Phase Quantization Feature Selection for RGB-D Facial Expressions RecognitionCode0
Quantization NetworksCode0
NeuroSim V1.5: Improved Software Backbone for Benchmarking Compute-in-Memory Accelerators with Device and Circuit-level Non-idealitiesCode0
Low dimensional representation of multi-patient flow cytometry datasets using optimal transport for minimal residual disease detection in leukemiaCode0
Accelerating Large-Scale Inference with Anisotropic Vector QuantizationCode0
BdSLW60: A Word-Level Bangla Sign Language DatasetCode0
BatchQuant: Quantized-for-all Architecture Search with Robust QuantizerCode0
Show:102550
← PrevPage 88 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified