SOTAVerified

Benchmarking

Papers

Showing 39013910 of 5548 papers

TitleStatusHype
Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors0
Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks0
Official-NV: An LLM-Generated News Video Dataset for Multimodal Fake News Detection0
Off-policy Evaluation for Payments at Adyen0
Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation0
TransBench: Benchmarking Machine Translation for Industrial-Scale Applications0
OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics0
IBB Traffic Graph Data: Benchmarking and Road Traffic Prediction Model0
Benchmarking Azerbaijani Neural Machine Translation0
Benchmarking a wide range of optimisers for solving the Fermi-Hubbard model using the variational quantum eigensolver0
Show:102550
← PrevPage 391 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified