SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 35013550 of 4925 papers

TitleStatusHype
A White Paper on Neural Network Quantization0
Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier IntegralsCode0
Development of Quantized DNN Library for Exact Hardware Emulation0
Compositional Sketch SearchCode0
Energy-efficient Knowledge Distillation for Spiking Neural Networks0
FastICARL: Fast Incremental Classifier and Representation Learning with Efficient Budget Allocation in Audio Sensing Applications0
Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates0
AI Enlightens Wireless Communication: Analyses, Solutions and Opportunities on CSI Feedback0
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning0
Conditional COT-GAN for Video Prediction with Kernel SmoothingCode0
Verifying Quantized Neural Networks using SMT-Based Model Checking0
Cross-Modal Discrete Representation Learning0
SignalNet: A Low Resolution Sinusoid Decomposition and Estimation Network0
Fastening the Initial Access in 5G NR Sidelink for 6G V2X Networks0
Harnessing Unrecognizable Faces for Improving Face Recognition0
Theoretically Better and Numerically Faster Distributed Optimization with Smoothness-Aware Quantization Techniques0
Deep Unsupervised Learning for Joint Antenna Selection and Hybrid Beamforming0
Sigma-Delta and Distributed Noise-Shaping Quantization Methods for Random Fourier Features0
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution0
Granger Causality from Quantized Measurements0
Passive Beamforming Design for Intelligent Reflecting Surface Assisted MIMO Systems0
On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers0
Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation0
Full-Resolution Encoder-Decoder Networks with Multi-Scale Feature Fusion for Human Pose Estimation0
Weak target detection with multi-bit quantization in colocated MIMO radar0
Improved Convergence Rate for a Distributed Two-Time-Scale Gradient Method under Random Quantization0
Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-NormalizationCode0
Low-complexity acoustic scene classification for multi-device audio: analysis of DCASE 2021 Challenge systemsCode0
HDRUNet: Single Image HDR Reconstruction with Denoising and DequantizationCode0
Towards Efficient Full 8-bit Integer DNN Online Training on Resource-limited Devices without Batch Normalization0
Quantization and Deployment of Deep Neural Networks on MicrocontrollersCode0
Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities0
DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications0
Denoising Noisy Neural Networks: A Bayesian Approach with CompensationCode0
DoStoVoQ: Doubly Stochastic Voronoi Vector Quantization SGD for Federated Learning0
Revisiting Multi-Codebook QuantizationCode0
EuclidNets: Combining hardware and architecture design for Efficient Inference and Training0
Model Compression0
Distribution Agnostic Symbolic Representations for Time Series Dimensionality Reduction and Online Anomaly DetectionCode0
Towards Quantized Model Parallelism for Graph-Augmented MLPs Based on Gradient-Free ADMM FrameworkCode0
BatchQuant: Quantized-for-all Architecture Search with Robust QuantizerCode0
Self-supervised Remote Sensing Images Change Detection at Pixel-level0
Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization0
Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence0
FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal RetrievalCode0
Quantized Proximal Averaging Network for Analysis Sparse Coding0
Deep and Shallow Covariance Feature Quantization for 3D Facial Expression Recognition0
A CNN-based Prediction-Aware Quality Enhancement Framework for VVC0
Discrete representations in neural models of spoken languageCode0
Show:102550
← PrevPage 71 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified