SOTAVerified

Benchmarking

Papers

Showing 41714180 of 5548 papers

TitleStatusHype
Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic MaterialsCode1
A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papersCode0
Benchmarking Multimodal AutoML for Tabular Data with Text FieldsCode3
B-Pref: Benchmarking Preference-Based Reinforcement LearningCode1
OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform InversionCode1
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies0
Virus-MNIST: Machine Learning Baseline Calculations for Image Classification0
Procedural Generalization by Planning with Self-Supervised World Models0
Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue SystemCode1
Constructing a Psychometric Testbed for Fair Natural Language ProcessingCode0
Show:102550
← PrevPage 418 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified