SOTAVerified

Benchmarking

Papers

Showing 17111720 of 5548 papers

TitleStatusHype
Benchmarking Bias in Large Language Models during Role-Playing0
A New Approach for Image Authentication Framework for Media Forensics Purpose0
Abnormality-Driven Representation Learning for Radiology Imaging0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors0
An Evolutionary Algorithm For the Vehicle Routing Problem with Drones with Interceptions0
Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks0
Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation0
An evaluation framework for comparing causal inference models0
Benchmarking Azerbaijani Neural Machine Translation0
Show:102550
← PrevPage 172 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified