SOTAVerified

Benchmarking

Papers

Showing 36113620 of 5548 papers

TitleStatusHype
OPTION: OPTImization Algorithm Benchmarking ONtology0
OPTION: OPTImization Algorithm Benchmarking ONtology0
OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery0
Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering0
Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition0
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents0
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving0
Out of Distribution Performance of State of Art Vision Model0
Overconfident Oracles: Limitations of In Silico Sequence Design Benchmarking0
Overview and practical recommendations on using Shapley Values for identifying predictive biomarkers via CATE modeling0
Show:102550
← PrevPage 362 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified