SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 35513600 of 4925 papers

TitleStatusHype
A Low Memory Footprint Quantized Neural Network for Depth Completion of Very Sparse Time-of-Flight Depth Maps0
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models0
Alternate Learning based Sparse Semantic Communications for Visual Transmission0
Alternating Co-Quantization for Cross-Modal Hashing0
Alternating Direction Method of Multipliers for Quantization0
Alternating Multi-bit Quantization for Recurrent Neural Networks0
A Markovian Model-Driven Deep Learning Framework for Massive MIMO CSI Feedback0
A Memory-Efficient Learning Framework for SymbolLevel Precoding with Quantized NN Weights0
Amenable Sparse Network Investigator0
A method of using RSVD in residual calculation of LowBit GEMM0
A Modular Neural Network Based Deep Learning Approach for MIMO Signal Detection0
A multi-layer image representation using Regularized Residual Quantization: application to compression and denoising0
AMXFP4: Taming Activation Outliers with Asymmetric Microscaling Floating-Point for 4-bit LLM Inference0
An Adaptive Device-Edge Co-Inference Framework Based on Soft Actor-Critic0
An adaptive random experiment design method for engineering experiment0
An Adaptive Statistical Non-uniform Quantizer for Detail Wavelet Components in Lossy JPEG2000 Image Compression0
An Additive Latent Feature Model for Transparent Object Recognition0
Analog-digital Scheduling for Federated Learning: A Communication-Efficient Approach0
Analog Spiking Neuron in CMOS 28 nm Towards Large-Scale Neuromorphic Processors0
Analysis and Optimization for RIS-Aided Multi-Pair Communications Relying on Statistical CSI0
Analysis of Null Related Beampattern Measures and Signal Quantization Effects for Linear Differential Microphone Arrays0
Analysis of Oversampling in Uplink Massive MIMO-OFDM with Low-Resolution ADCs0
Analysis of Quantization on MLP-based Vision Models0
Analysis of Quantized Models0
Analysis of the influence of final resolution on ADC accuracy0
Analytical aspects of non-differentiable neural networks0
Analyzing Compression Techniques for Computer Vision0
Analyzing Quantization in TVM0
An Analysis on Quantizing Diffusion Transformers0
An anatomy-based V1 model: Extraction of Low-level Features, Reduction of distortion and a V1-inspired SOM0
An Application of Backpropagation Artificial Neural Network Method for Measuring The Severity of Osteoarthritis0
An Automata-Theoretic Approach to Synthesizing Binarized Neural Networks0
An Efficient Index for Visual Search in Appearance-based SLAM0
An Efficient Network with Novel Quantization Designed for Massive MIMO CSI Feedback0
An Embedded Iris Recognition System Optimization using Dynamically ReconfigurableDecoder with LDPC Codes0
An Empirical Study of Low Precision Quantization for TinyML0
An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms0
An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC0
A "Network Pruning Network" Approach to Deep Model Compression0
A new heuristic algorithm for fast k-segmentation0
A New Learning Method for Inference Accuracy, Core Occupation, and Performance Co-optimization on TrueNorth Chip0
A New Old Idea: Beam-Steering Reflectarrays for Efficient Sub-THz Multiuser MIMO0
An Exact Quantized Decentralized Gradient Descent Algorithm0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
An Experimental Study of Reduced-Voltage Operation in Modern FPGAs for Neural Network Acceleration0
An Extra RMSNorm is All You Need for Fine Tuning to 1.58 Bits0
Angle Estimation of a Single Source with Massive Uniform Circular Arrays0
Angular Quantization-based Binary Codes for Fast Similarity Search0
An Implementation of Vector Quantization using the Genetic Algorithm Approach0
An Improved BKW Algorithm for LWE with Applications to Cryptography and Lattices0
Show:102550
← PrevPage 72 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified