SOTAVerified

Quantization

Quantization is a promising technique to reduce the computation cost of neural network training, which can replace high-cost floating-point numbers (e.g., float32) with low-cost fixed-point numbers (e.g., int8/int16).

Source: Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

Papers

Showing 16011650 of 4925 papers

TitleStatusHype
Pyramid Vector Quantization for LLMs0
Self-calibration for Language Model Quantization and Pruning0
Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?0
Continuous Speech Synthesis using per-token Latent Diffusion0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec0
Large Deviation Upper Bounds and Improved MSE Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry0
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training0
Lossless KV Cache Compression to 2%0
Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs0
Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language BenchmarksCode0
AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations0
Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees0
Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching0
Progressive Mixed-Precision Decoding for Efficient LLM Inference0
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models0
Optimal Quantization for Matrix MultiplicationCode0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
COMET: Towards Partical W4A4KV4 LLMs Serving0
ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs0
Channel-Wise Mixed-Precision Quantization for Large Language Models0
FairGLVQ: Fairness in Partition-Based ClassificationCode0
DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMsCode0
QSpec: Speculative Decoding with Complementary Quantization Schemes0
Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet AccuracyCode0
Scaling Laws for Post Training Quantized Large Language Models0
Real-Time Stress Detection via Photoplethysmogram Signals: Implementation of a Combined Continuous Wavelet Transform and Convolutional Neural Network on Resource-Constrained Microcontrollers0
SLaNC: Static LayerNorm Calibration0
Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior0
GALA: Geometry-Aware Local Adaptive Grids for Detailed 3D Generation0
Gradient-Free Neural Network Training on the Edge0
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization0
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification0
QEFT: Quantization for Efficient Fine-Tuning of LLMsCode0
DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantization0
Scalable Representation Learning for Multimodal Tabular Transactions0
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt TuningCode0
M^2-ViT: Accelerating Hybrid Vision Transformers with Two-Level Mixed Quantization0
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete DiffusionCode0
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation0
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression0
Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point CloudsCode0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Scaling Laws for Mixed quantization in Large Language Models0
JPEG Inspired Deep LearningCode0
Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression0
Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-training0
Accelerating Error Correction Code TransformersCode0
QERA: an Analytical Framework for Quantization Error Reconstruction0
Variable Bitrate Residual Vector Quantization for Audio Coding0
Show:102550
← PrevPage 33 of 99Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FQ-ViT (ViT-L)Top-1 Accuracy (%)85.03Unverified
2FQ-ViT (ViT-B)Top-1 Accuracy (%)83.31Unverified
3FQ-ViT (Swin-B)Top-1 Accuracy (%)82.97Unverified
4FQ-ViT (Swin-S)Top-1 Accuracy (%)82.71Unverified
5FQ-ViT (DeiT-B)Top-1 Accuracy (%)81.2Unverified
6FQ-ViT (Swin-T)Top-1 Accuracy (%)80.51Unverified
7FQ-ViT (DeiT-S)Top-1 Accuracy (%)79.17Unverified
8Xception W8A8Top-1 Accuracy (%)78.97Unverified
9ADLIK-MO-ResNet50-W4A4Top-1 Accuracy (%)77.88Unverified
10ADLIK-MO-ResNet50-W3A4Top-1 Accuracy (%)77.34Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_3MAP160,327.04Unverified
2DTQMAP0.79Unverified
#ModelMetricClaimedVerifiedStatus
1OutEffHop-Bert_basePerplexity6.3Unverified
2OutEffHop-Bert_basePerplexity6.21Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy98.13Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy92.92Unverified
#ModelMetricClaimedVerifiedStatus
1SSD ResNet50 V1 FPN 640x640MAP34.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-495.13Unverified
#ModelMetricClaimedVerifiedStatus
1TAR @ FAR=1e-496.38Unverified
#ModelMetricClaimedVerifiedStatus
13DCNN_VIVA_5All84,809,664Unverified
#ModelMetricClaimedVerifiedStatus
1Accuracy99.8Unverified