SOTAVerified

Benchmarking

Papers

Showing 201210 of 5548 papers

TitleStatusHype
Segmenting France Across Four CenturiesCode0
GenSpace: Benchmarking Spatially-Aware Image Generation0
Progressive Class-level Distillation0
Bench4KE: Benchmarking Automated Competency Question GenerationCode1
Geospatial Foundation Models to Enable Progress on Sustainable Development Goals0
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMsCode0
Benchmarking Foundation Models for Zero-Shot Biometric Tasks0
ByzFL: Research Framework for Robust Federated LearningCode1
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image GenerationCode1
Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization0
Show:102550
← PrevPage 21 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified