SOTAVerified

Benchmarking

Papers

Showing 14811490 of 5548 papers

TitleStatusHype
Benchmarking Natural Language Understanding Services for building Conversational AgentsCode1
NAS-Bench-101: Towards Reproducible Neural Architecture SearchCode1
The StarCraft Multi-Agent ChallengeCode1
The Liver Tumor Segmentation Benchmark (LiTS)Code1
LEAF: A Benchmark for Federated SettingsCode1
GuacaMol: Benchmarking Models for De Novo Molecular DesignCode1
IOHprofiler: A Benchmarking and Profiling Tool for Iterative Optimization HeuristicsCode1
On Evaluation of Embodied Navigation AgentsCode1
Benchmarking Neural Network Robustness to Common Corruptions and Surface VariationsCode1
Texygen: A Benchmarking Platform for Text Generation ModelsCode1
Show:102550
← PrevPage 149 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified