SOTAVerified

Benchmarking

Papers

Showing 43114320 of 5548 papers

TitleStatusHype
Efficient, Uncertainty-based Moderation of Neural Networks Text ClassifiersCode0
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System Events -- Part I: Overview and ResultsCode0
Intelligence at the Extreme Edge: A Survey on Reformable TinyML0
Unitail: Detecting, Reading, and Matching in Retail Scene0
Assessing the risk of re-identification arising from an attack on anonymised data0
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?0
To Find Waldo You Need Contextual Cues: Debiasing Who's WaldoCode0
Treatment Learning Causal Transformer for Noisy Image Classification0
A Unified Study of Machine Learning Explanation Evaluation Metrics0
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices0
Show:102550
← PrevPage 432 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified