SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 41014150 of 4925 papers

TitleStatusHype
DCNGAN: A Deformable Convolutional-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed Video0
DC-PCN: Point Cloud Completion Network with Dual-Codebook Guided Quantization0
Discrete Cosine Transform Based Decorrelated Attention for Vision Transformers0
Decentralized Low-Rank Fine-Tuning of Large Language Models0
Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking0
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression0
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval0
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes0
Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning0
DEED: A General Quantization Scheme for Communication Efficiency in Bits0
Deep activity propagation via weight initialization in spiking neural networks0
Deep and Shallow Covariance Feature Quantization for 3D Facial Expression Recognition0
Deep Asymmetric Hashing with Dual Semantic Regression and Class Structure Quantization0
Deep Attentive Generative Adversarial Network for Photo-Realistic Image De-Quantization0
Deep Autoencoder-based Z-Interference Channels with Perfect and Imperfect CSI0
Deep Compression of Neural Networks for Fault Detection on Tennessee Eastman Chemical Processes0
Deep Conditional Measure Quantization0
Deep Convolutional Compression for Massive MIMO CSI Feedback0
Deep data compression for approximate ultrasonic image formation0
DeepEvolution: A Search-Based Testing Approach for Deep Neural Networks0
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization0
Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation0
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables0
Deep Hashing With Minimal-Distance-Separated Hash Centers0
Deep Hashing with Triplet Quantization Loss0
Deep Hierarchy Quantization Compression algorithm based on Dynamic Sampling0
DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding0
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing0
Deep Imbalanced Regression via Hierarchical Classification Adjustment0
Deep-Learned Compression for Radio-Frequency Signal Classification0
Deep Learning-Aided Perturbation Model-Based Fiber Nonlinearity Compensation0
Deep-Learning-Based Channel Estimation for Distributed MIMO with 1-bit Radio-Over-Fiber Fronthaul0
Deep Learning based Fast and Accurate Beamforming for Millimeter-Wave Systems0
Deep Learning-based Image Compression with Trellis Coded Quantization0
Deep Learning-Based Intra Mode Derivation for Versatile Video Coding0
Knowledge Distillation-aided End-to-End Learning for Linear Precoding in Multiuser MIMO Downlink Systems with Finite-Rate Feedback0
Deep Learning-Based Real-Time Quality Control of Standard Video Compression for Live Streaming0
Deep Learning for Distributed Optimization: Applications to Wireless Resource Management0
Deep Learning for Estimation and Pilot Signal Design in Few-Bit Massive MIMO Systems0
Deep Learning for Radio Resource Allocation with Diverse Quality-of-Service Requirements in 5G0
Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit0
Deep Learning Methods for Joint Optimization of Beamforming and Fronthaul Quantization in Cloud Radio Access Networks0
Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems0
Deep learning model compression using network sensitivity and gradients0
Deep Learning to Ternary Hash Codes by Continuation0
DeepliteRT: Computer Vision at the Edge0
Deep Model Compression Via Two-Stage Deep Reinforcement Learning0
Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos0
Deep Multiple Description Coding by Learning Scalar Quantization0
Deep Neural Network-Based Quantized Signal Reconstruction for DOA Estimation0
Show:102550
← PrevPage 83 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified