SOTAVerified

Benchmarking

Papers

Showing 35813590 of 5548 papers

TitleStatusHype
On the Use of Quality Diversity Algorithms for The Traveling Thief Problem0
On the Utility of Equivariance and Symmetry Breaking in Deep Learning Architectures on Point Clouds0
On the Value of ML Models0
OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images0
OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations0
OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking0
Open-CD: A Comprehensive Toolbox for Change Detection0
OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI0
Open Datasets for Satellite Radio Resource Control0
OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation0
Show:102550
← PrevPage 359 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified