SOTAVerified

Benchmarking

Papers

Showing 46214630 of 5548 papers

TitleStatusHype
Beyond Optimism: Exploration With Partially Observable RewardsCode0
M3Dsynth: A dataset of medical 3D images with AI-generated local manipulationsCode0
M4Fog: A Global Multi-Regional, Multi-Modal, and Multi-Stage Dataset for Marine Fog Detection and Forecasting to Bridge Ocean and AtmosphereCode0
The Elusive Pursuit of Reproducing PATE-GAN: Benchmarking, Auditing, DebuggingCode0
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing AtariCode0
Machine-assisted quantitizing designs: augmenting humanities and social sciences with artificial intelligenceCode0
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?Code0
Machine learning classification of non-Markovian noise disturbing quantum dynamicsCode0
Machine Learning Automation Toolbox (MLaut)Code0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
Show:102550
← PrevPage 463 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified