SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 776800 of 4925 papers

TitleStatusHype
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAECode1
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision TransformersCode1
Raw Image Reconstruction with Learned Compact MetadataCode1
Matching-oriented Embedding Quantization For Ad-hoc RetrievalCode1
Generative Adversarial Super-Resolution at the Edge with Knowledge DistillationCode1
Generative Low-bitwidth Data Free QuantizationCode1
Genetic Quantization-Aware Approximation for Non-Linear Operations in TransformersCode1
Designing Large Foundation Models for Efficient Training and Inference: A SurveyCode1
Confounding Tradeoffs for Neural Network QuantizationCode1
Making DensePose fast and lightCode1
Context-aware Communication for Multi-agent Reinforcement LearningCode1
MagR: Weight Magnitude Reduction for Enhancing Post-Training QuantizationCode1
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor CoresCode1
Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech TranslationCode1
Gradient-based Automatic Mixed Precision Quantization for Neural Networks On-ChipCode1
COMQ: A Backpropagation-Free Algorithm for Post-Training QuantizationCode1
Algorithm-hardware Co-design for Deformable ConvolutionCode1
CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-ResolutionCode1
RepCodec: A Speech Representation Codec for Speech TokenizationCode1
LSQ+: Improving low-bit quantization through learnable offsets and better initializationCode1
Conditional Coding and Variable Bitrate for Practical Learned Video CodingCode1
Guided Hybrid Quantization for Object detection in Multimodal Remote Sensing Imagery via One-to-one Self-teachingCode1
Resolution Switchable Networks for Runtime Efficient Image RecognitionCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Machine Unlearning of Federated ClustersCode1
Show:102550
← PrevPage 32 of 197Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified