SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 39514000 of 4925 papers

TitleStatusHype
QGAN: Quantized Generative Adversarial Networks0
QGAN: Quantize Generative Adversarial Networks to Extreme low-bits0
QGen: On the Ability to Generalize in Quantization Aware Training0
QKD: Quantization-aware Knowledge Distillation0
Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity0
Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments0
Q-LIC: Quantizing Learned Image Compression with Channel Splitting0
QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation0
Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration0
QMamba: Post-Training Quantization for Vision State Space Models0
Q-matrix Unaware Double JPEG Detection using DCT-Domain Deep BiLSTM Network0
QMGeo: Differentially Private Federated Learning via Stochastic Quantization with Mixed Truncated Geometric Distribution0
QNNRepair: Quantized Neural Network Repair0
QNNVerifier: A Tool for Verifying Neural Networks using SMT-Based Model Checking0
Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection0
QPP: Real-Time Quantization Parameter Prediction for Deep Neural Networks0
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models0
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization0
Qrazor: Reliable and effortless 4-bit llm quantization by significant data razoring0
QReg: On Regularization Effects of Quantization0
QRMODA and BRMODA: Novel Models for Face Recognition Accuracy in Computer Vision Systems with Adapted Video Streams0
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization0
QS4D: Quantization-aware training for efficient hardware deployment of structured state-space sequential models0
Q-SAM2: Accurate Quantization for Segment Anything Model 20
Q-SHED: Distributed Optimization at the Edge via Hessian Eigenvectors Quantization0
QSID-MPC: Model Predictive Control with System Identification from Quantized Data0
Q-SNNs: Quantized Spiking Neural Networks0
Qsparse-local-SGD: Distributed SGD with Quantization, Sparsification, and Local Computations0
QSpec: Speculative Decoding with Complementary Quantization Schemes0
Q-SpiNN: A Framework for Quantizing Spiking Neural Networks0
QSViT: A Methodology for Quantizing Spiking Vision Transformers0
QTI Submission to DCASE 2021: residual normalization for device-imbalanced acoustic scene classification with efficient design0
Qua^2SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models0
Quadapter: Adapter for GPT-2 Quantization0
QuAILoRA: Quantization-Aware Initialization for LoRA0
QuaLA-MiniLM: a Quantized Length Adaptive MiniLM0
Qualitative detection of oil adulteration with machine learning approaches0
Quality Adaptive Low-Rank Based JPEG Decoding with Applications0
Quality Scalable Quantization Methodology for Deep Learning on Edge0
QuanCrypt-FL: Quantized Homomorphic Encryption with Pruning for Secure Federated Learning0
QUANOS- Adversarial Noise Sensitivity Driven Hybrid Quantization of Neural Networks0
QuantEase: Optimization-based Quantization for Language Models0
QuantFormer: Learning to Quantize for Neural Activity Forecasting in Mouse Visual Cortex0
Quantifying the Capabilities of LLMs across Scale and Precision0
Quantifying the Capacity Gains in Coarsely Quantized SISO Systems with Nonlinear Analog Operators0
Quantifying the Complexity of Standard Benchmarking Datasets for Long-Term Human Trajectory Prediction0
Quantisation-aware Precoding for MU-MIMO with Limited-capacity Fronthaul0
Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks0
Quantitative Error Feedback for Quantization Noise Reduction of Filtering over Graphs0
Quantity over Quality: Dithered Quantization for Compressive Radar Systems0
Show:102550
← PrevPage 80 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified