SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 42014250 of 4925 papers

TitleStatusHype
Resource Allocation and Dithering of Bayesian Parameter Estimation Using Mixed-Resolution Data0
Resource Allocation for Compression-aided Federated Learning with High Distortion Rate0
Resource-aware Mixed-precision Quantization for Enhancing Deployability of Transformers for Time-series Forecasting on Embedded FPGAs0
Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation0
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference0
Resource-Efficient Neural Networks for Embedded Systems0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Resource-Efficient Transformer Architecture: Optimizing Memory and Execution Time for Real-Time Applications0
ResQ: Residual Quantization for Video Perception0
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules0
Résumé abstractif à partir d'une transcription audio0
Rethinking Deconvolution for 2D Human Pose Estimation Light yet Accurate Model for Real-time Edge Computing0
Rethinking Diffusion for Text-Driven Human Motion Generation0
Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression0
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis0
Rethinking Few-Shot Medical Segmentation: A Vector Quantization View0
Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint0
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning0
Rethinking Neural Network Quantization0
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach0
Retraining-Based Iterative Weight Quantization for Deep Neural Networks0
Reverse Link Analysis for Full-Duplex Cellular Networks with Low Resolution ADC/DAC0
Reversible Quantization Index Modulation for Static Deep Neural Network Watermarking0
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study0
Revisiting DNN Training for Intermittently-Powered Energy-Harvesting Micro-Computers0
Revisiting Locality-Sensitive Binary Codes from Random Fourier Features0
Revisiting Quantization Error in Face Alignment0
Revisiting Uncertainty Estimation and Calibration of Large Language Models0
Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile0
REx: Data-Free Residual Quantization Error Expansion0
RFI Mitigation for One-bit UWB Radar Systems0
Riemannian Manifold Embeddings for Straight-Through Estimator0
RIS-Assisted Energy Harvesting Gains for Bistatic Backscatter Networks: Performance Analysis and RIS Phase Optimization0
RIS-Assisted Self-Interference Mitigation for In-Band Full-Duplex Transceivers0
Risk Assessment for Connected Vehicles under Stealthy Attacks on Vehicle-to-Vehicle Networks0
Risk Bounds for Learning Multiple Components with Permutation-Invariant Losses0
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression0
RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models0
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions0
RobSurv: Vector Quantization-Based Multi-Modal Learning for Robust Cancer Survival Prediction0
Robust Anomaly-Based Ship Proposals Detection from Pan-sharpened High-Resolution Satellite Image0
Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor DecompositionCode0
Compressing 3D Gaussian Splatting by Noise-Substituted Vector QuantizationCode0
Mixed-Precision Quantization for Deep Vision Models with Integer Quadratic ProgrammingCode0
BitMoD: Bit-serial Mixture-of-Datatype LLM AccelerationCode0
Model-Aware Deep Architectures for One-Bit Compressive Variational AutoencodingCode0
An Underexplored Dilemma between Confidence and Calibration in Quantized Neural NetworksCode0
Distributed dual vigilance fuzzy adaptive resonance theory learns online, retrieves arbitrarily-shaped clusters, and mitigates order dependenceCode0
QNCD: Quantization Noise Correction for Diffusion ModelsCode0
TransPL: VQ-Code Transition Matrices for Pseudo-Labeling of Time Series Unsupervised Domain AdaptationCode0
Show:102550
← PrevPage 85 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified